Abstract
This work analyzes the peak to side-lobe ratio (PSR) around each glottal closure instant (GCI) in the Hilbert envelope (HE) of linear prediction (LP) residual as an excitation source-based cue for the hypernasality detection. PSR is defined as the ratio of peak value around GCI to the mean of sample values around GCI in the 3 ms range of HE of LP residual. The coupling between nasal and oral tract occurs during the production of voiced sound in hypernasal speech. The air leakage from nasal tract affects the abruptness of glottal closure, which in turn affects the peak strength around the GCIs. The nasal tract adds zeros in the spectrum of voiced sound. Since the LP model is poor in modeling the zeros in the spectrum, the zeros get filtered in the LP residual signal. This increases the side-lobe strength around the peak in the HE of LP residual. Hence, the PSR gets affected in hypernasal speech. Classification between pre-known normal and hypernasal sound based on a threshold value of PSR gives the accuracy of 70.49, 78.19, 63.15, 60.67, and 67.27% for high vowel, low vowel, glides, liquids, and voicebar sounds, respectively.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Anderson SR, Keating PA, Huffman MK, Krakow RA (2014) Nasals, nasalization, and the velum, vol. 5. Elsevier
Cairns D, Hansen JH, Riski JE et al (1996) A noninvasive technique for detecting hypernasal speech using a nonlinear operator. IEEE Trans Biomed Eng 43(1):35–45
Dubey AK, Prasanna SM, Dandapat S (2016) Zero time windowing analysis of hypernasality in speech of cleft lip and palate children. In: 2016 twenty second national conference on communication (NCC). IEEE, pp 1–6
Dubey AK, Prasanna SM, Dandapat S (2016) Zero time windowing based severity analysis of hypernasal speech. In: Region 10 conference (TENCON), 2016 IEEE, pp 970–974
Eshghi M, Alemi MM, Eshghi M (2015) Vowel nasalization might affect the envelop of the vowel signal by reducing the magnitude of the rising and falling slope amplitude. J Acoust Soc Am 137(4):2304
He L, Zhang J, Liu Q, Yin H, Lech M (2014) Automatic evaluation of hypernasality and consonant misarticulation in cleft palate speech. Signal Process Lett IEEE 21(10):1298–1301
Henningsson G, Kuehn DP, Sell D, Sweeney T, Trost-Cardamone JE, Whitehill TL (2008) Universal parameters for reporting speech outcomes in individuals with cleft palate. Cleft Palate Craniofac J 45(1):1–17
Kummer AW, Lee L (1996) Evaluation and treatment of resonance disorders. Lang Speech Hear Serv Sch 27(3):271–281
Lee GS, Wang CP, Yang CC, Kuo TB (2006) Voice low tone to high tone ratio: a potential quantitative index for vowel [a:] and its nasalization. IEEE Trans Biomed Eng 53(7):1437–1439
Maier A, Hönig F, Bocklet T, Nöth E, Stelzle F, Nkenke E, Schuster M (2009) Automatic detection of articulation disorders in children with cleft lip and palate. J Acoust Soc Am 126(5):2589–2602
Medina E, Solorio T (2006) Wavesurfer: a tool for sound analysis
Murty KSR, Yegnanarayana B (2008) Epoch extraction from speech signals. IEEE Trans Audio Speech Lang Process 16(8):1602–1613
Orozco-Arroyave JR, Rendón SM, Álvarez-Meza AM, Arias-Londoño JD, Delgado-Trejos E, Bonilla JFV, Castellanos-Domínguez CG (2011) Automatic selection of acoustic and non-linear dynamic features in voice signals for hypernasality detection. In: Interspeech. Citeseer, pp 529–532
Orozco-Arroyave JR, Arias-Londoño JD, Bonilla JFV, Nöth E (2012) Automatic detection of hypernasal speech signals using nonlinear and entropy measurements. In: INTERSPEECH, pp 2029–2032
Rah DK, Ko YI, Lee C, Kim DW (2001) A noninvasive estimation of hypernasality using a linear predictive model. Ann Biomed Eng 29(7):587–594
Raykar VC, Yegnanarayana B, Prasanna SM, Duraiswami R (2005) Speaker localization using excitation source information in speech. IEEE Trans Speech Audio Process 13(5):751–761
Rendón SM, Arroyave JO, Bonilla JV, Londoño JA, Domínguez CC (2011) Automatic detection of hypernasality in children. In: International work-conference on the interplay between natural and artificial computation. Springer, pp 167–174
Sharma B, Prasanna SM (2016) Sonority measurement using system, source and suprasegmental information. IEEE/ACM Trans Audio Speech Lang Process
Vijayalakshmi P, Reddy MR, O’Shaughnessy D (2007) Acoustic analysis and detection of hypernasality using a group delay function. IEEE Trans Biomed Eng 54(4):621–629
Acknowledgements
The authors are very much thankful to Prof. M. Pushpavathi and Prof. Ajish K. Abraham from AIISH Mysore for sharing the hyernasal speech of children with cleft palate.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Dubey, A.K., Singh, D.K., Tiwari, B.B. (2022). Significance of Source Information in Hypernasality Detection. In: Kaiser, M.S., Bandyopadhyay, A., Ray, K., Singh, R., Nagar, V. (eds) Proceedings of Trends in Electronics and Health Informatics. Lecture Notes in Networks and Systems, vol 376. Springer, Singapore. https://doi.org/10.1007/978-981-16-8826-3_7
Download citation
DOI: https://doi.org/10.1007/978-981-16-8826-3_7
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-16-8825-6
Online ISBN: 978-981-16-8826-3
eBook Packages: EngineeringEngineering (R0)