Articulation Analysis in the Speech of Children with Cleft Lip and Palate
Hypernasality is a speech deficit that affects children with cleft lip and palate (CLP). It is characterized by the lack of control of the velum, which causes problems when controlling the amount of air passing from the oral to the nasal cavity while speaking. The automatic evaluation of hypernasality could help in the monitoring of speech-language therapies and in the design of better oriented exercises. Several articulation features have been used for the automatic detection of hypernasal speech. This paper evaluates the suitability of classical articulation features for the automatic classification of hypernasal and healthy speech recordings. Two different databases are considered with recordings collected under different acoustic conditions and with different audio settings. Besides the evaluation of the proposed approach upon each database separately, non-parametric statistical tests are performed to evaluate the possibility of merging features from the two databases with the aim of finding more robust systems that could be used in different acoustic conditions. The results indicate that the proposed approach has a high sensitivity, which indicates that it is suitable to detect hypernasal speech samples. We believe that promising results could be obtained with this approach in future experiments where the degree of hypernasality is evaluated.
KeywordsCleft lip and palate Hypernasality Articulation measures Classification
This work was partially funded by CODI at UdeA grant # PRG2018-23541 and SOS18-2-01_ES84180137.
- 1.World Health Organization: Global registry and database on craniofacial anomalies. Report of a WHO registry meeting on craniofacial anomalies (2001)Google Scholar
- 3.Kummer, A.W.: Cleft Lip, Palate and Craniofacial Anomalies. CENGAGE Learning, Boston (2014)Google Scholar
- 5.Rendón, S.M., Orozco Arroyave, J.R., Vargas Bonilla, J.F., Arias Londoño, J.D., Castellanos Domínguez, C.G.: Automatic detection of hypernasality in children. In: Ferrández, J.M., Álvarez Sánchez, J.R., de la Paz, F., Toledo, F.J. (eds.) IWINAC 2011. LNCS, vol. 6687, pp. 167–174. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-21326-7_19CrossRefGoogle Scholar
- 7.Orozco Arroyave, J.R., Arias Londoño, J.D., Vargas Bonilla, J.F., Nöth, E.: Automatic detection of hypernasal speech signals using nonlinear and entropy measurements. In: Proceedings of INTERSPEECH, pp. 2027–2030 (2012)Google Scholar
- 9.Carvajal Castaño, H.A.: Metodología para la reducción de ruido aditivo de fondo en sistemas basados en procesamiento de voz. Master’s thesis, Universidad de Antioquia (2013)Google Scholar
- 12.Rabiner, L., Schafer, R.W.: Theory and Applications of Digital Speech Processing. Prentice Hall, Upper Saddle River (2011)Google Scholar
- 13.Kaiser, J.F.: On a simple algorithm to calculate the ‘energy’ of a signal. In: Proceedings of ICASSP, pp. 381–384 (1990)Google Scholar