Imbalance Reduction Techniques Applied to ECG Classification Problem

Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 11872)


In this work we explored capabilities of improving deep learning models performance by reducing the dataset imbalance. For our experiments a highly imbalanced ECG dataset MIT-BIH was used. Multiple approaches were considered. First we introduced mutliclass UMCE, the ensemble designed to deal with imbalanced datasets. Secondly, we studied the impact of applying oversampling techniques to a training set. smote without prior majority class undersampling was used as one of the methods. Another method we used was smote with noise introduced to synthetic learning examples. The baseline for our study was a single ResNet network with undersampling of the training set. Mutliclass UMCE proved to be superior compared to the baseline model, but failed to beat the results obtained by a single model with smote applied to training set. Introducing perturbations to signals generated by smote did not bring significant improvement. Future work may consider combining multiclass UMCE with smote.


Machine learning ECG classification Imbalanced data 



This work is supported by the Polish National Science Center under the Grant no. UMO-2015/19/B/ST6/01597 as well the statutory funds of the Department of Systems and Computer Networks, Faculty of Electronics, Wrocław University of Science and Technology.

We also wanna thank Michał Leś for lending his computing power resources. Thanks to him this results could be collected and presented.


  1. 1.
    Abadi, M.: TensorFlow: Large-scale machine learning on heterogeneous systems (2015).
  2. 2.
    Li, H., et al.: Visualizing the loss landscape of neural nets. CoRR abs/1712.09913 (2017).
  3. 3.
    He, K., et al.: Deep residual learning for image recognition. CoRR abs/1512.03385 (2015).
  4. 4.
    Bowyer, K.W., et al.: SMOTE: synthetic minority over-sampling technique. CoRR abs/1106.1813 (2011).
  5. 5.
    Jun, T.J., et al.: ECG arrhythmia classification using a 2-D convolutional neural network. CoRR abs/1804.06812 (2018).
  6. 6.
    Chollet, F.E.A.: Keras (2015).
  7. 7.
    Goodfellow, I., Bengio, Y., Courville, A.: Deep Learning. MIT Press, Cambridge (2016).
  8. 8.
    Kachuee, M., Fazeli, S., Sarrafzadeh, M.: ECG heartbeat classification: a deep transferable representation. CoRR abs/1805.00794 (2018).
  9. 9.
    Ksieniewicz, P.: Undersampled majority class ensemble for highly imbalanced binary classification. Proc. Mach. Learn. Res. 1, 1–13 (2010)Google Scholar
  10. 10.
    Pedregosa, F.E.A.: Scikit-learn: machine learning in Python. J. Mach. Learn. Res. 12, 2825–2830 (2011)MathSciNetzbMATHGoogle Scholar
  11. 11.
    van der Walt, S., Colbert, S.C., Varoquaux, G.: The numpy array a structure for efficient numerical computation. Comput. Sci. Eng. 13(2), 22–30 (2011)CrossRefGoogle Scholar
  12. 12.
    Xiong, Z.E.A.: ECG signal classification for the detection of cardiac arrhythmias using a convolutional recurrent neural network. Physiol. Meas. (2018). Scholar
  13. 13.
    Xu, S.S., Mak, M.W., Cheung, C.C.: Towards end-to-end ECG classification with rawsignal extraction and deep neural networks. IEEE J. Biomed. Health Inform. (2019)Google Scholar

Copyright information

© Springer Nature Switzerland AG 2019

Authors and Affiliations

  1. 1.Department of Systems and Computer NetworksWroclaw University of Science and TechnologyWroclawPoland

Personalised recommendations