Significance of Source Information in Hypernasality Detection

Dubey, Akhilesh Kumar; Singh, Deepak Kumar; Tiwari, B. B.

doi:10.1007/978-981-16-8826-3_7

Akhilesh Kumar Dubey¹⁴,
Deepak Kumar Singh¹⁵ &
B. B. Tiwari¹⁵

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 376))

646 Accesses
1 Citations

Abstract

This work analyzes the peak to side-lobe ratio (PSR) around each glottal closure instant (GCI) in the Hilbert envelope (HE) of linear prediction (LP) residual as an excitation source-based cue for the hypernasality detection. PSR is defined as the ratio of peak value around GCI to the mean of sample values around GCI in the 3 ms range of HE of LP residual. The coupling between nasal and oral tract occurs during the production of voiced sound in hypernasal speech. The air leakage from nasal tract affects the abruptness of glottal closure, which in turn affects the peak strength around the GCIs. The nasal tract adds zeros in the spectrum of voiced sound. Since the LP model is poor in modeling the zeros in the spectrum, the zeros get filtered in the LP residual signal. This increases the side-lobe strength around the peak in the HE of LP residual. Hence, the PSR gets affected in hypernasal speech. Classification between pre-known normal and hypernasal sound based on a threshold value of PSR gives the accuracy of 70.49, 78.19, 63.15, 60.67, and 67.27% for high vowel, low vowel, glides, liquids, and voicebar sounds, respectively.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 189.00; Price excludes VAT (USA)

Softcover Book: USD 249.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Anderson SR, Keating PA, Huffman MK, Krakow RA (2014) Nasals, nasalization, and the velum, vol. 5. Elsevier
Google Scholar
Cairns D, Hansen JH, Riski JE et al (1996) A noninvasive technique for detecting hypernasal speech using a nonlinear operator. IEEE Trans Biomed Eng 43(1):35–45
Article Google Scholar
Dubey AK, Prasanna SM, Dandapat S (2016) Zero time windowing analysis of hypernasality in speech of cleft lip and palate children. In: 2016 twenty second national conference on communication (NCC). IEEE, pp 1–6
Google Scholar
Dubey AK, Prasanna SM, Dandapat S (2016) Zero time windowing based severity analysis of hypernasal speech. In: Region 10 conference (TENCON), 2016 IEEE, pp 970–974
Google Scholar
Eshghi M, Alemi MM, Eshghi M (2015) Vowel nasalization might affect the envelop of the vowel signal by reducing the magnitude of the rising and falling slope amplitude. J Acoust Soc Am 137(4):2304
Article Google Scholar
He L, Zhang J, Liu Q, Yin H, Lech M (2014) Automatic evaluation of hypernasality and consonant misarticulation in cleft palate speech. Signal Process Lett IEEE 21(10):1298–1301
Article Google Scholar
Henningsson G, Kuehn DP, Sell D, Sweeney T, Trost-Cardamone JE, Whitehill TL (2008) Universal parameters for reporting speech outcomes in individuals with cleft palate. Cleft Palate Craniofac J 45(1):1–17
Article Google Scholar
Kummer AW, Lee L (1996) Evaluation and treatment of resonance disorders. Lang Speech Hear Serv Sch 27(3):271–281
Article Google Scholar
Lee GS, Wang CP, Yang CC, Kuo TB (2006) Voice low tone to high tone ratio: a potential quantitative index for vowel [a:] and its nasalization. IEEE Trans Biomed Eng 53(7):1437–1439
Article Google Scholar
Maier A, Hönig F, Bocklet T, Nöth E, Stelzle F, Nkenke E, Schuster M (2009) Automatic detection of articulation disorders in children with cleft lip and palate. J Acoust Soc Am 126(5):2589–2602
Article Google Scholar
Medina E, Solorio T (2006) Wavesurfer: a tool for sound analysis
Google Scholar
Murty KSR, Yegnanarayana B (2008) Epoch extraction from speech signals. IEEE Trans Audio Speech Lang Process 16(8):1602–1613
Article Google Scholar
Orozco-Arroyave JR, Rendón SM, Álvarez-Meza AM, Arias-Londoño JD, Delgado-Trejos E, Bonilla JFV, Castellanos-Domínguez CG (2011) Automatic selection of acoustic and non-linear dynamic features in voice signals for hypernasality detection. In: Interspeech. Citeseer, pp 529–532
Google Scholar
Orozco-Arroyave JR, Arias-Londoño JD, Bonilla JFV, Nöth E (2012) Automatic detection of hypernasal speech signals using nonlinear and entropy measurements. In: INTERSPEECH, pp 2029–2032
Google Scholar
Rah DK, Ko YI, Lee C, Kim DW (2001) A noninvasive estimation of hypernasality using a linear predictive model. Ann Biomed Eng 29(7):587–594
Article Google Scholar
Raykar VC, Yegnanarayana B, Prasanna SM, Duraiswami R (2005) Speaker localization using excitation source information in speech. IEEE Trans Speech Audio Process 13(5):751–761
Article Google Scholar
Rendón SM, Arroyave JO, Bonilla JV, Londoño JA, Domínguez CC (2011) Automatic detection of hypernasality in children. In: International work-conference on the interplay between natural and artificial computation. Springer, pp 167–174
Google Scholar
Sharma B, Prasanna SM (2016) Sonority measurement using system, source and suprasegmental information. IEEE/ACM Trans Audio Speech Lang Process
Google Scholar
Vijayalakshmi P, Reddy MR, O’Shaughnessy D (2007) Acoustic analysis and detection of hypernasality using a group delay function. IEEE Trans Biomed Eng 54(4):621–629
Article Google Scholar

Download references

Acknowledgements

The authors are very much thankful to Prof. M. Pushpavathi and Prof. Ajish K. Abraham from AIISH Mysore for sharing the hyernasal speech of children with cleft palate.

Author information

Authors and Affiliations

Koneru Lakshmaiah Education Foundation (Deemed to be University), Vaddeswaram, Guntur, Andhra Pradesh, 522502, India
Akhilesh Kumar Dubey
VSB Purvanchal University, Jaunpur, Uttar Pradesh, 222002, India
Deepak Kumar Singh & B. B. Tiwari

Authors

Akhilesh Kumar Dubey
View author publications
You can also search for this author in PubMed Google Scholar
Deepak Kumar Singh
View author publications
You can also search for this author in PubMed Google Scholar
B. B. Tiwari
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Akhilesh Kumar Dubey .

Editor information

Editors and Affiliations

Jahangirnagar University, Dhaka, Bangladesh
M. Shamim Kaiser
National Institute for Materials Science, Tsukuba, Japan
Anirban Bandyopadhyay
Amity University, Jaipur, India
Kanad Ray
Pranveer Singh Institute Of Technology, Kanpur, India
Raghvendra Singh
Pranveer Singh Institute Of Technology, Kanpur, India
Vishal Nagar

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Dubey, A.K., Singh, D.K., Tiwari, B.B. (2022). Significance of Source Information in Hypernasality Detection. In: Kaiser, M.S., Bandyopadhyay, A., Ray, K., Singh, R., Nagar, V. (eds) Proceedings of Trends in Electronics and Health Informatics. Lecture Notes in Networks and Systems, vol 376. Springer, Singapore. https://doi.org/10.1007/978-981-16-8826-3_7

Download citation

DOI: https://doi.org/10.1007/978-981-16-8826-3_7
Published: 22 March 2022
Publisher Name: Springer, Singapore
Print ISBN: 978-981-16-8825-6
Online ISBN: 978-981-16-8826-3
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics

Significance of Source Information in Hypernasality Detection