A novel proposed CNN–SVM architecture for ECG scalograms classification

Ozaltin, Oznur; Yeniay, Ozgur

doi:10.1007/s00500-022-07729-x

A novel proposed CNN–SVM architecture for ECG scalograms classification

Data Analytics and Machine Learning
Published: 15 December 2022

Volume 27, pages 4639–4658, (2023)
Cite this article

Download PDF

Soft Computing Aims and scope Submit manuscript

A novel proposed CNN–SVM architecture for ECG scalograms classification

Download PDF

3641 Accesses
9 Citations
Explore all metrics

Abstract

Nowadays, the number of sudden deaths due to heart disease is increasing with the coronavirus pandemic. Therefore, automatic classification of electrocardiogram (ECG) signals is crucial for diagnosis and treatment. Thanks to deep learning algorithms, classification can be performed without manual feature extraction. In this study, we propose a novel convolutional neural networks (CNN) architecture to detect ECG types. In addition, the proposed CNN can automatically extract features from images. Here, we classify a real ECG dataset using our proposed CNN which includes 34 layers. While this dataset is one-dimensional signals, these are transformed into images (scalograms) using continuous wavelet transform (CWT). In addition, the proposed CNN is compared to known architectures: AlexNet and SqueezeNet for classifying ECG images, and we find it more effective than others. This study, which not only performed CWT but also implemented short-time Fourier transform, examines the success in recognizing ECG types for the proposed CNN. Besides, different split methods: training and testing, and cross-validation are applied in this study. Eventually, CWT and cross-validation are the best pre-processing and split methods for the proposed CNN, respectively. Although the results are quite good, we benefit from support vector machines (SVM) to obtain the best algorithm and for detecting ECG types. Essentially, the main aim of the study increases classification results. In this way, the proposed CNN is utilized as deep feature extractor and combined with SVM. As a conclusion of this study, we achieve the highest accuracy of 99.21% from the proposed CNN–SVM when using CWT. Therefore, we can express that this framework can be used as an aid to clinicians for ECG-type identification.

Detection of heart arrhythmia based on UCMFB and deep learning technique

Article 23 November 2022

Automated ECG multi-class classification system based on combining deep learning features with HRV and ECG measures

Article Open access 29 January 2022

Scalar invariant transform based deep learning framework for detecting heart failures using ECG signals

Article Open access 01 February 2024

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

The qualitative processing and classification of biomedical signals is very important for diagnosis and therapy. Many methods are used to process biomedical signals. Some important methods are discrete Fourier transform (DFT), short-time Fourier transform (STFT), continuous wavelet transform (CWT), and discrete wavelet transform. The Fourier transformation provides a very good frequency range for stationary signals (Haberl et al. 1989). However, the time domain is almost non-existent. This can lead to serious problems, especially if time-dependent characteristics are to be inferred. However, when signals are transformed with the wavelet transform, both frequency and time domains are distinguishable (Li et al. 1995). In other words, wavelet transform (WT) is a transformation technique that splits signals into different frequency components and processes each component with the time domain of the respective scale. In this study, we focus on electrocardiogram (ECG) signals. The signals resulting from the electrical activity of the heart, the main vital organ in the human body, are called an electrocardiogram (ECG). Sudden deaths from heart disease with coronavirus (COVID-19) are currently on the rise (https://www.chss.org.uk/media-release/new-nhs-figures-show-dangerous-domino-effect-of-pandemic-on-progress-made-with-strokes-and-heart-disease/). For this reason, the processing and analysis of the signals received by the heart are very important for rapid diagnosis and treatment. In conventional methods, a suitable sampling method is used in the pre-processing phase of ECG signals and the signals are cleaned of noise. Then, the manual feature extraction phase begins, where it is very important to seek expert opinions. This phase is very critical as incorrect feature extraction can lead to misclassification of signals and serious errors in diagnosis and treatment. After all these phases are completed, classification is done using traditional classification algorithms. However, the studies show that the situation for deep learning algorithms has changed in recent years (Ozaltin et al. 2022; Özaltın and Yeniay 2021; Koc et al. 2022). Thanks to deep learning algorithms, successful classifications can be made automatically. In this way, the state of health of patients can be monitored with smartphones, watches, etc., even without an expert opinion.

The aim of the study was to recognize type of ECG efficiently via deep learning algorithm. Firstly, we collect the dataset from PhysioNet databases (Physionet 2020). The dataset consists of three different types: arrhythmia (ARR), congestive heart failure (CHF), and normal sinus rhythm (NSR). In this study, a novel convolutional neural networks (CNN) architecture, which is one of the deep learning algorithms, is proposed for automatic ECG signal classification. This newly proposed 34-layer CNN architecture is designed for two-dimensional images. In fact, the newly proposed CNN is considered not only ECG classification, but also other biomedical signals, images, etc. classification. In this context, the ECG signals are naturally transformed from one-dimensional signals into images by using a continuous wavelet transform (CWT) in the pre-processing phase. This wavelet transform has three different mother wavelet functions: Amor, Bump, and Morse, which are the most commonly used. The impact of these functions on classification performance is also examined. In this study, 360 Hz, 500 Hz and 1000 Hz sample lengths are examined whether the wave characteristics become more evident. Figure 1 shows the images (scalograms) obtained with different sampling lengths of ECG signals, 360 Hz, 500 Hz, and 1000 Hz, respectively. Therefore, a total of nine different datasets are obtained under these conditions. These datasets are classified separately with the same training options parameters using the proposed CNN, AlexNet, and SqueezeNet. After identifying the best wavelet function, sample length, and architecture, we additionally investigate another pre-processing method: STFT to measure ECG classification performance via different split methods: training and testing, and cross-validation. Finally, the proposed CNN is used as a deep feature extractor from images and merged with support vector machines (SVM) to get trusted results.

In this study, a hybrid algorithm is proposed to detect ECG types from acquired images based on a deep learning algorithm and a machine learning algorithm. The main contributions and novelties of this study are as follows:

When using CWT, 500 Hz is observed as an efficient sample length while converting.
Amor wavelet function has higher performance than others while applying CWT.
A new CNN architecture called proposed CNN is presented and compared with AlexNet and SqueezeNet. Eventually, the proposed CNN has the highest performance.
To measure the performance of the proposed CNN, STFT is also used as pre-processing method via different splitting methods: training and testing (80:20, 70:30), and k-fold cross-validation (5, 10). Finally, CWT is higher than it and cross-validation is the best splitting method.
To improve classification performance, the proposed CNN is utilized as feature extractor and benefited from both fully connected layer and maximum pooling layer.
Reduced features are classified using SVM.
Consequently, the highest performance to recognize ECG types is acquired thanks to the proposed CNN–SVM hybrid algorithm.

1.1 Related studies

Nowadays, artificial intelligence is evolving day by day, and many studies are also being conducted to classify ECG signals and other biomedical signals using CNN architectures. Khorrami and Moavenian (2010) applied the CWT, discrete wavelet transform (DWT), and discrete cosine transform (DCT) to ECG signals. In addition, they compared SVM with multi-layer perceptron (MLP) algorithms in the classification phase. In particular, they found that combinations made with MLP (CWT-MLP, DWT-MLP, DCT-MLP) are superior to SVM. Al Rahhal et al. (2018) transformed signals from different datasets using CWT to identify arrhythmias in ECG signals. Also, they used the CNN algorithm and achieved an accuracy of 99% in the classification phase. Huang et al. (2019) converted ECG signals with STFT and obtained two-dimensional scalograms in their study. Moreover, they benefited from the CNN architecture for classifying these scalograms and achieved an accuracy of 99%. In addition, they also classified the one-dimensional ECG signals using CNN and found an accuracy of 90.93%. Krak et al. (2020) transformed ECG signals into the images using CWT and DWT in their study. Furthermore, they classified the images using the CNN architecture and obtained an accuracy of 96% in the classification phase. Baloglu et al. (2019) designed a 10-layer end-to-end CNN architecture for the classification of multiclass one-dimensional ECG data and achieved an accuracy of a 99.78%. Mahmud et al. (2020) created a CNN architecture for multiclass one-dimensional ECG data and obtained an accuracy rate of 99.28%. Salem et al. (2018) utilized DenseNet architecture to classify transformed two-dimensional ECG data and achieved an accuracy of 97.23%. Zhao et al. (2020) proposed a CNN containing 24 layers for classifying transformed ECG data and achieved an accuracy of 87.1%. Xu and Liu (2020) created a CNN architecture in order to analyze ECG data recorded from a Holter device and achieved an accuracy of 99.4%. Rajkumar et al. (2019) suggested a CNN architecture for one-dimensional ECG data by using exponential linear unit (ELU) activation layers and achieved an accuracy of 93.6%. Hua et al. (2020) developed a CNN architecture for one-dimensional ECG signals and achieved an accuracy of 97.45%. Kiranyaz et al. (2015) proposed a CNN architecture for patient-specific real-time one-dimensional ECG classification and achieved an accuracy of 96.4%. Chen et al. (2020) suggested CNN + long short-term memory (LSTM) which can classify six kinds of ECG fragments. They have classified two ECG databases: MIT-BIH arrhythmia database and MIT-BIH arrhythmia database + Challenge2017, and achieved an accuracy of 99.32% and 97.15%, respectively, using CNN + LSTM. Sandeep et al. (2019) utilized the CNN architecture to classify ECG data and also achieved an accuracy of 90.63%. Furthermore, machine learning algorithms such as support vectors machine (SVM), K-nearest neighbors (KNN), decision tree (DT), extreme learning machine (ELM), ensemble learning, and multi-layer perceptron (MLP) to classify ECG signals by many other researchers (Alickovic and Subasi 2015; Qaisar and Subasi 2020; Tuncer et al. 2022; Ceylan and Özbay 2007; Pławiak and Acharya 2020). Additionally, Table 1 shows recent studies on ECG signals classification.

Table 1 Recent Studies on ECG signals classification

A novel proposed CNN–SVM architecture for ECG scalograms classification

Abstract

Similar content being viewed by others

Detection of heart arrhythmia based on UCMFB and deep learning technique

Automated ECG multi-class classification system based on combining deep learning features with HRV and ECG measures

Scalar invariant transform based deep learning framework for detecting heart failures using ECG signals

1 Introduction

1.1 Related studies

2 Materials and methods

2.1 Pre-processing methods

2.1.1 Max–min normalization

2.1.2 Continuous wavelet transform

2.1.3 Short-time Fourier transform (STFT)

2.2 Convolutional neural network (CNN)

2.3 Pre-trained architectures: AlexNet and SqueezeNet

2.4 Novel proposed CNN architecture

2.5 Deep feature extraction

2.6 Support vector machine (SVM)

3 Results

3.1 ECG dataset

3.2 Experimental setup

3.3 Performance metrics

3.4 Experimental results

4 Discussion

5 Conclusion

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation