Abstract
COVID-19 virus has become a very critical human health hazard. Many variants are reported, and still, the virus is mutating. Thus, we get new strains now and then. COVID-19 detection at an early stage is an important issue that will help in the efficient management of the disease. This work studies COVID-19 audio signals originating from breathing, coughing, and vowel sounds. In the literature, most of the works on this topic use MFCC-based features. In this work, various methods are proposed for COVID-19 detection. The proposed methods use accumulated bispectrum features that capture the distinctive properties of COVID-19 in the above signals. Three new methods are proposed for COVID-19 detection. The performance of the presented methods is analyzed in detail, and comparison with the state-of-the-art methods is given. For various signals, considerable performance improvement is seen in the proposed methods. The CNN and ResNet-50 network models are used in this study.
Similar content being viewed by others
Data Availability
The datasets used and analyzed in this study are described in COSWARA–A Database of Breathing, Cough, and Voice Sounds for COVID-19 Diagnosis and available at https://doi.org/10.48550/arXiv.2005.10548 The code of this work can be made available to the readers upon request via email.
References
M. Al-Khassaweneh, R.B. Abdelrahman, A signal processing approach for the diagnosis of asthma from cough sounds. J. Med. Eng. Technol. 37, 165–171 (2013)
I. Al-Shourbaji, P.H. Kachare, L. Abualigah, M.E. Abdelhag, B. Elnaim, A.M. Anter, A.H. Gandomi, A deep batch normalized convolution approach for improving COVID-19 detection from chest X-ray images. Pathogens 12, 17 (2022)
M. B. Alsabek, I. Shahin, A. Hassan, Studying the similarity of COVID-19 sounds based on correlation analysis of MFCC. In Proceedings International Conference on Communications, Computing, Cybersecurity, and Informatics (CCCI) (2020), pp. 1–5
M. Aly, K.H. Rahouma, S.M. Ramzy, Pay attention to the speech, COVID-19 diagnosis using machine learning and crowdsourced respiratory and speech recordings. Alexandria Eng. J. 61, 3487–3500 (2022)
V. Bairagi, EEG signal analysis for early diagnosis of Alzheimer disease using spectral and wavelet based features. Int. J. Inf. Technol. 10, 403–412 (2018)
V. Chandran, S. Elgar, A general procedure for the derivation of principal domains of higher-order spectra. IEEE Trans. Signal Process. 42, 229–233 (1994)
N.V. Chawla, K.W. Bowyer, L.O. Hall, W.P. Kegelmeyer, SMOTE: synthetic minority over-sampling technique. J. Artif. Intell. Res. 16, 321–357 (2002)
G. Deshpande, B. W. Schuller, COVID-19 biomarkers in speech: on source and filter components. In Proceedings 43rd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC) (2021), pp. 800–803
H. J. Dou, Z. Y. Wu, Y. Feng, Y. Z. Qian, Voice activity detection based on the bispectrum. In Proceedings IEEE 10th International Conference on Signal Processing (2010), pp. 502–505
K. Feng, F. He, J. Steinmann, I. Demirkiran, Deep-learning based approach to identify COVID-19. In Proceedings SoutheastCon (2021), pp. 1–4
C.J. Gaikwad, P. Sircar, Bispectrum-based technique to remove cross-terms in quadratic systems and Wigner-Ville distribution. Signal Image Video Process. 12, 703–710 (2018)
M. Gilke, P. Kachare, R. Kothalikar, V. P. Rodrigues, M. Pednekar, MFCC-based vocal emotion recognition using ANN. In International Conference on Electronics Engineering and Informatics (2012), pp. 150–154
J.A. Gordon, D.F. Buscher, Detection noise bias and variance in the power spectrum and bispectrum in optical interferometry. Astron. Astrophys. 541, A46 (2012)
I. Goodfellow, Y. Bengio, A. Courville, Deep Learning (MIT Press, 2016)
P. Henríquez, J.B. Alonso, M.A. Ferrer, C.M. Travieso, J.I. Godino-Llorente, F. Díaz-de-María, Characterization of healthy and pathological voice through measures based on nonlinear dynamics. IEEE Trans. Audio Speech Lang. Process. 17, 1186–1195 (2009)
A. Imran, I. Posokhova, H.N. Qureshi, U. Masood, M.S. Riaz, K. Ali, M. Nabeel, AI4COVID-19: AI enabled preliminary diagnosis for COVID-19 from cough samples via an app. Informat. Med. Unlocked 20, 100378 (2020)
R. Islam, E. Abdel-Raheem, M. Tarique, A study of using cough sounds and deep neural networks for the early detection of COVID-19. Biomed. Eng. Adv. 3, 100025 (2022)
R. Kakarala, The bispectrum as a source of phase-sensitive invariants for Fourier descriptors: a group-theoretic approach. J. Math. Imaging Vis. 44, 341–353 (2012)
S. Li, Y. Liu Feature extraction of lung sounds based on bispectrum analysis. In IEEE Third International Symposium on Information Processing (2010), pp. 393–397
M. Loey, S. Mirjalili, COVID-19 cough sound symptoms classification from scalogram image representation using deep learning models. Comput. Biol. Med. 139, 105020 (2021)
T. Matsuoka, T.J. Ulrych, Phase estimation using the bispectrum. Proc. IEEE 72, 1403–1411 (1984)
J.M. Mendel, Tutorial on higher-order statistics (spectra) in signal processing and system theory: theoretical results and some applications. Proc. IEEE 79, 278–305 (1991)
M. Pahar, M. Klopper, R. Warren, T. Niesler, COVID-19 cough classification using machine learning and global smartphone recordings. Comput. Biol. Med. 135, 104572 (2021)
M. Pahar, M. Klopper, R. Warren, T. Niesler, COVID-19 detection in cough, breath and speech using deep transfer learning and bottleneck features. Comput. Biol. Med. 141, 105153 (2022)
M. Sanaullah, A review of higher order statistics and spectra in communication systems. Global J. Sci. Front. Res. 13, 31–50 (2013)
S. B. Sangle, C. J. Gaikwad, Covid-19 detection using spectral and statistical features of cough and breath sounds. In IEEE International Conference on Decision Aid Sciences and Application (DASA) (2021), pp. 182–186
S. B. Sangle, C. J. Gaikwad, Accumulated bispectral image-based respiratory sound signal classification using deep learning. Signal Image Video Process. 1–8 (2023)
S. Shahnawazuddin, A. Kumar, S. Kumar, W. Ahmad, Enhancing robustness of zero resource children’s speech recognition system through bispectrum based front-end acoustic features. Digital Signal Process. 118, 103226 (2021)
N. Sharma, P. Krishnan, R. Kumar, S. Ramoji, S. R. Chetupalli, P. K. Ghosh, S. Ganapathy, Coswara–a database of breathing, cough, and voice sounds for COVID-19 diagnosis. (2020) arXiv preprint arXiv:2005.10548
A. Unnikrishnan, V. Sowmya, K.P. Soman, Deep AlexNet with reduced number of trainable parameters for satellite image classification. Proced. Comput. Sci. 143, 931–938 (2018)
WHO Coronavirus (COVID-19) Dashboard. https://covid19.who.int (Date Accessed: 14 Jan 2022)
M. Xu, Y. Han, X. Sun, Y. Shao, F. Gu, A.D. Ball, Vibration characteristics and condition monitoring of internal radial clearance within a ball bearing in a gear-shaft-bearing system. Mech. Syst. Signal Process. 165, 108280 (2022)
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no competing interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Sangle, S.B., Gaikwad, C.J. COVID-19 Respiratory Sound Signal Detection Using HOS-Based Linear Frequency Cepstral Coefficients and Deep Learning. Circuits Syst Signal Process 43, 331–347 (2024). https://doi.org/10.1007/s00034-023-02474-4
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00034-023-02474-4