RUS  ENG
Полная версия
ЖУРНАЛЫ // Компьютерная оптика // Архив

Компьютерная оптика, 2022, том 46, выпуск 6, страницы 980–987 (Mi co1094)

ЧИСЛЕННЫЕ МЕТОДЫ И АНАЛИЗ ДАННЫХ

Arrhythmia detection using resampling and deep learning methods on unbalanced data

E. Yu. Shchetinina, A. G. Glushkovab

a Financial University under the Government of the Russian Federation, Moscow
b Endeavor, London W4 5HR, Chiswick Park, 566 Chiswick High Road, United Kingdom

Аннотация: Due to cardiovascular diseases millions of people die around the world. One way to detect abnormality in the heart condition is with the help of electrocardiogram signal (ECG) analysis. This paper’s goal is to use machine learning and deep learning methods such as Support Vector Machines (SVM), Random Forests, Light Gradient Boosting Machine (LightGBM), Convolutional Neural Network (CNN), Long Short-Term Memory (LSTM) and Bidirectional Long Short-Term Memory (BLSTM) to classify arrhythmias, where particular interest represent the rare cases of disease. In order to deal with the problem of imbalance in the dataset we used resampling methods such as SMOTE Tomek-Links and SMOTE ENN to improve the representation ration of the minority classes. Although the machine learning models did not improve a lot when trained on the resampled dataset, the deep learning models showed more impressive results. In particular, LSTM model fitted on dataset resampled using SMOTE ENN method provides the most optimal precision-recall trade-off for the minority classes Supraventricular beat and Fusion of ventricular and normal beat, with recall of 83% and 88% and precision of 74% and 66% for the two classes re-spectively, whereas the macro-weighted recall is 92% and precision is 82%.

Ключевые слова: machine learning, deep learning, ECG, resampling, arrhythmia

Поступила в редакцию: 21.02.2022
Принята в печать: 13.06.2022

Язык публикации: английский

DOI: 10.18287/2412-6179-CO-1112



© МИАН, 2025