COVID-19 Respiratory Sound Signal Detection Using HOS-Based Linear Frequency Cepstral Coefficients and Deep Learning

Sangle, Sandeep B.; Gaikwad, Chandrakant J.

doi:10.1007/s00034-023-02474-4

COVID-19 Respiratory Sound Signal Detection Using HOS-Based Linear Frequency Cepstral Coefficients and Deep Learning

Published: 20 August 2023

Volume 43, pages 331–347, (2024)
Cite this article

Circuits, Systems, and Signal Processing Aims and scope Submit manuscript

141 Accesses
Explore all metrics

Abstract

COVID-19 virus has become a very critical human health hazard. Many variants are reported, and still, the virus is mutating. Thus, we get new strains now and then. COVID-19 detection at an early stage is an important issue that will help in the efficient management of the disease. This work studies COVID-19 audio signals originating from breathing, coughing, and vowel sounds. In the literature, most of the works on this topic use MFCC-based features. In this work, various methods are proposed for COVID-19 detection. The proposed methods use accumulated bispectrum features that capture the distinctive properties of COVID-19 in the above signals. Three new methods are proposed for COVID-19 detection. The performance of the presented methods is analyzed in detail, and comparison with the state-of-the-art methods is given. For various signals, considerable performance improvement is seen in the proposed methods. The CNN and ResNet-50 network models are used in this study.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Deep learning for time series classification: a review

Article 02 March 2019

Human emotion recognition from EEG-based brain–computer interface using machine learning: a comprehensive review

Article Open access 07 May 2022

A comprehensive survey on automatic speech recognition using neural networks

Article 15 August 2023

Data Availability

The datasets used and analyzed in this study are described in COSWARA–A Database of Breathing, Cough, and Voice Sounds for COVID-19 Diagnosis and available at https://doi.org/10.48550/arXiv.2005.10548 The code of this work can be made available to the readers upon request via email.

References

M. Al-Khassaweneh, R.B. Abdelrahman, A signal processing approach for the diagnosis of asthma from cough sounds. J. Med. Eng. Technol. 37, 165–171 (2013)
Article Google Scholar
I. Al-Shourbaji, P.H. Kachare, L. Abualigah, M.E. Abdelhag, B. Elnaim, A.M. Anter, A.H. Gandomi, A deep batch normalized convolution approach for improving COVID-19 detection from chest X-ray images. Pathogens 12, 17 (2022)
Article Google Scholar
M. B. Alsabek, I. Shahin, A. Hassan, Studying the similarity of COVID-19 sounds based on correlation analysis of MFCC. In Proceedings International Conference on Communications, Computing, Cybersecurity, and Informatics (CCCI) (2020), pp. 1–5
M. Aly, K.H. Rahouma, S.M. Ramzy, Pay attention to the speech, COVID-19 diagnosis using machine learning and crowdsourced respiratory and speech recordings. Alexandria Eng. J. 61, 3487–3500 (2022)
Article Google Scholar
V. Bairagi, EEG signal analysis for early diagnosis of Alzheimer disease using spectral and wavelet based features. Int. J. Inf. Technol. 10, 403–412 (2018)
Google Scholar
V. Chandran, S. Elgar, A general procedure for the derivation of principal domains of higher-order spectra. IEEE Trans. Signal Process. 42, 229–233 (1994)
Article Google Scholar
N.V. Chawla, K.W. Bowyer, L.O. Hall, W.P. Kegelmeyer, SMOTE: synthetic minority over-sampling technique. J. Artif. Intell. Res. 16, 321–357 (2002)
Article Google Scholar
G. Deshpande, B. W. Schuller, COVID-19 biomarkers in speech: on source and filter components. In Proceedings 43rd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC) (2021), pp. 800–803
H. J. Dou, Z. Y. Wu, Y. Feng, Y. Z. Qian, Voice activity detection based on the bispectrum. In Proceedings IEEE 10th International Conference on Signal Processing (2010), pp. 502–505
K. Feng, F. He, J. Steinmann, I. Demirkiran, Deep-learning based approach to identify COVID-19. In Proceedings SoutheastCon (2021), pp. 1–4
C.J. Gaikwad, P. Sircar, Bispectrum-based technique to remove cross-terms in quadratic systems and Wigner-Ville distribution. Signal Image Video Process. 12, 703–710 (2018)
Article Google Scholar
M. Gilke, P. Kachare, R. Kothalikar, V. P. Rodrigues, M. Pednekar, MFCC-based vocal emotion recognition using ANN. In International Conference on Electronics Engineering and Informatics (2012), pp. 150–154
J.A. Gordon, D.F. Buscher, Detection noise bias and variance in the power spectrum and bispectrum in optical interferometry. Astron. Astrophys. 541, A46 (2012)
Article Google Scholar
I. Goodfellow, Y. Bengio, A. Courville, Deep Learning (MIT Press, 2016)
Google Scholar
P. Henríquez, J.B. Alonso, M.A. Ferrer, C.M. Travieso, J.I. Godino-Llorente, F. Díaz-de-María, Characterization of healthy and pathological voice through measures based on nonlinear dynamics. IEEE Trans. Audio Speech Lang. Process. 17, 1186–1195 (2009)
Article Google Scholar
A. Imran, I. Posokhova, H.N. Qureshi, U. Masood, M.S. Riaz, K. Ali, M. Nabeel, AI4COVID-19: AI enabled preliminary diagnosis for COVID-19 from cough samples via an app. Informat. Med. Unlocked 20, 100378 (2020)
Article Google Scholar
R. Islam, E. Abdel-Raheem, M. Tarique, A study of using cough sounds and deep neural networks for the early detection of COVID-19. Biomed. Eng. Adv. 3, 100025 (2022)
Article Google Scholar
R. Kakarala, The bispectrum as a source of phase-sensitive invariants for Fourier descriptors: a group-theoretic approach. J. Math. Imaging Vis. 44, 341–353 (2012)
Article MathSciNet Google Scholar
S. Li, Y. Liu Feature extraction of lung sounds based on bispectrum analysis. In IEEE Third International Symposium on Information Processing (2010), pp. 393–397
M. Loey, S. Mirjalili, COVID-19 cough sound symptoms classification from scalogram image representation using deep learning models. Comput. Biol. Med. 139, 105020 (2021)
Article Google Scholar
T. Matsuoka, T.J. Ulrych, Phase estimation using the bispectrum. Proc. IEEE 72, 1403–1411 (1984)
Article Google Scholar
J.M. Mendel, Tutorial on higher-order statistics (spectra) in signal processing and system theory: theoretical results and some applications. Proc. IEEE 79, 278–305 (1991)
Article Google Scholar
M. Pahar, M. Klopper, R. Warren, T. Niesler, COVID-19 cough classification using machine learning and global smartphone recordings. Comput. Biol. Med. 135, 104572 (2021)
Article Google Scholar
M. Pahar, M. Klopper, R. Warren, T. Niesler, COVID-19 detection in cough, breath and speech using deep transfer learning and bottleneck features. Comput. Biol. Med. 141, 105153 (2022)
Article Google Scholar
M. Sanaullah, A review of higher order statistics and spectra in communication systems. Global J. Sci. Front. Res. 13, 31–50 (2013)
Article Google Scholar
S. B. Sangle, C. J. Gaikwad, Covid-19 detection using spectral and statistical features of cough and breath sounds. In IEEE International Conference on Decision Aid Sciences and Application (DASA) (2021), pp. 182–186
S. B. Sangle, C. J. Gaikwad, Accumulated bispectral image-based respiratory sound signal classification using deep learning. Signal Image Video Process. 1–8 (2023)
S. Shahnawazuddin, A. Kumar, S. Kumar, W. Ahmad, Enhancing robustness of zero resource children’s speech recognition system through bispectrum based front-end acoustic features. Digital Signal Process. 118, 103226 (2021)
Article Google Scholar
N. Sharma, P. Krishnan, R. Kumar, S. Ramoji, S. R. Chetupalli, P. K. Ghosh, S. Ganapathy, Coswara–a database of breathing, cough, and voice sounds for COVID-19 diagnosis. (2020) arXiv preprint arXiv:2005.10548
A. Unnikrishnan, V. Sowmya, K.P. Soman, Deep AlexNet with reduced number of trainable parameters for satellite image classification. Proced. Comput. Sci. 143, 931–938 (2018)
Article Google Scholar
WHO Coronavirus (COVID-19) Dashboard. https://covid19.who.int (Date Accessed: 14 Jan 2022)
M. Xu, Y. Han, X. Sun, Y. Shao, F. Gu, A.D. Ball, Vibration characteristics and condition monitoring of internal radial clearance within a ball bearing in a gear-shaft-bearing system. Mech. Syst. Signal Process. 165, 108280 (2022)

Download references

Author information

Authors and Affiliations

Department of Electronics and Telecommunication Engineering, Ramrao Adik Institute of Technology, DY Patil Deemed to be University, Navi Mumbai, India
Sandeep B. Sangle & Chandrakant J. Gaikwad

Authors

Sandeep B. Sangle
View author publications
You can also search for this author in PubMed Google Scholar
Chandrakant J. Gaikwad
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Chandrakant J. Gaikwad.

Ethics declarations

Conflict of interest

The authors declare that they have no competing interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Sangle, S.B., Gaikwad, C.J. COVID-19 Respiratory Sound Signal Detection Using HOS-Based Linear Frequency Cepstral Coefficients and Deep Learning. Circuits Syst Signal Process 43, 331–347 (2024). https://doi.org/10.1007/s00034-023-02474-4

Download citation

Received: 28 December 2022
Revised: 19 July 2023
Accepted: 20 July 2023
Published: 20 August 2023
Issue Date: January 2024
DOI: https://doi.org/10.1007/s00034-023-02474-4

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

COVID-19 Respiratory Sound Signal Detection Using HOS-Based Linear Frequency Cepstral Coefficients and Deep Learning

Abstract

Access this article

Similar content being viewed by others

Deep learning for time series classification: a review

Human emotion recognition from EEG-based brain–computer interface using machine learning: a comprehensive review

A comprehensive survey on automatic speech recognition using neural networks

Data Availability

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

COVID-19 Respiratory Sound Signal Detection Using HOS-Based Linear Frequency Cepstral Coefficients and Deep Learning

Abstract

Access this article

Similar content being viewed by others

Deep learning for time series classification: a review

Human emotion recognition from EEG-based brain–computer interface using machine learning: a comprehensive review

A comprehensive survey on automatic speech recognition using neural networks

Data Availability

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation