Automated Diagnosis and Assessment of Dysarthric Speech Using Relevant Prosodic Features

Kadi, Kamil Lahcene; Selouani, Sid Ahmed; Boudraa, Bachir; Boudraa, Malika

doi:10.1007/978-94-017-8832-8_38

Kamil Lahcene Kadi⁴,
Sid Ahmed Selouani⁵,
Bachir Boudraa⁴ &
…
Malika Boudraa⁴

1578 Accesses
6 Citations

Abstract

In this paper, linear discriminant analysis (LDA) is combined with two automatic classification approaches, the Gaussian mixture model (GMM) and support vector machine (SVM), to automatically assess dysarthric speech. The front-end processing uses a set of prosodic features selected by LDA on the basis of their discriminative ability, with Wilks’ lambda as the significant measure to show the discriminant power. More than eight hundred sentences produced by nine American dysarthric speakers of the Nemours database are used throughout the experiments. Results show a best classification rate of 93 % with the LDA/SVM system achieved over four severity levels of dysarthria, ranged from not affected to the more seriously ill. This tool can aid speech therapist and other clinicians to diagnose, assess, and monitor dysarthria. Furthermore, it may reduce some of the costs associated with subjective tests.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

C. Roth, in Encyclopedia of Clinical Neuropsychology, ed. by: B. Caplan, J. Deluca, J.S. Kreutzer (Springer, Heidelberg, 2011), pp. 905–908
Google Scholar
S.-A. Selouani, H. Dahmani, R. Amami, H. Hamam, in SOCO 2011: Dysarthric Speech Classification Using Hierarchical Multilayer Perceptrons and Posterior Rhythmic Features. Proceedings of 6th International Conference. Soft Computing Models in Industrials and Environmental Applications
Google Scholar
American Speech-Language-Hearing Association [Online]. Available: http://www.asha.org
J.B. Polikoff, H.T. Bunnel, in ICPhS: The Nemours database of dysarthric speech: a perceptual analysis. Proceedings of 14th International Congress of Phonetic Sciences, 1999, pp. 783–786
Google Scholar
S.-A. Selouani, H. Dahmani, R. Amami, H. Hamam, Using speech rhythm knowledge to improve dysarthric speech recognition. Int. J. Speech Technol. 15(1), 57–64 (2012)
Article Google Scholar
F. Rudzicz, in ICASSP 2009: Phonological Features in Discriminative Classification of Dysarthric Speech
Google Scholar
M.S. Paja, T.H. Falk, in Automated Dysarthria Severity Classification for Improved Objective Intelligibility Assessment of Spastic Dysarthric Speech. Interspeech, 2012
Google Scholar
X. Menendez-Pidal, J.B. Polikoff, S.M. Peters, J.E. Leonzio, H.T. Bunnell, in ICSLP: The Nemours Database of Dysarthric Speech. Fourth International Conference on Spoken Language, vol. 3 (IEEE, New York, 1996) pp. 1962–1965
Google Scholar
L. Mary, B. Yegnanarayana, Extraction and representation of prosodic features for language and speaker recognition. Speech Commun. 50(10), 782–796 (2008)
Article Google Scholar
E. Shriberg, A. Stolcke, D. Hakkani, Prosody-based automatic segmentation of speech into sentences and topics. Speech Communication 32(1–2), 127–154 (2000). (Special Issue on Accessing Information in Spoken Audio)
Article Google Scholar
J.R. Duffy, Motor Speech Disorders: Clues to Neurologic Diagnosis, in Parkinson’s Disease and Movement Disorders, ed. by C.H. Adler, J.E. Ahlskog (Springer, Heidelberg, 2000), pp. 35–53
Chapter Google Scholar
K.L. Kadi, S.-A. Selouani, B. Boudraa, M. Boudraa, in WCE 2013: Discriminative Prosodic Features to Assess the Dysarthria Severity Levels. Proceedings of The World Congress on Engineering 2013, 3–5 July London. Lecture Notes in Engineering and Computer Science, pp. 2201–2205
Google Scholar
R. Kent, H. Peters, P. Van-Lieshout, W. Hulstijn, Speech Motor Control in Normal and Disordered Speech (Oxford University Press, London, 2004)
Google Scholar
J.T. Hart, R. Collier, A. Cohen, A Perceptual Study of Intonation (Cambridge University Press, Cambridge, 1990)
Book Google Scholar
L. Mary, in Extraction and Representation of Prosody for Speaker, Speech and Language Recognition (Springer Briefs in Speech Technology, 2012), chap. 1
Google Scholar
H.F. Westzner, S. Schreiber, L. Amaro, Analysis of fundamental frequency, jitter, shimmer and vocal intensity in children with phonological disorders. Braz. J Orthinolaryngol. 71(5), 582–588 (2005)
Google Scholar
Multi-Dimensional Voice Processing Program (MDVP), Kay Elemetrics Company: http://www.kayelemetrics.com
L. Baghai-Ravary, S.W. Beet, in Automatic Speech Signal Analysis for Clinical Diagnosis and Assessment of Speech Disorders. Springer Briefs in Electrical and Computer Engineering, 2013
Google Scholar
J.M. Liss, L. White, S.L. Mattys, K. Lansford, A.J. Lotto, S.M. Spitzer, J.N. Caviness, Quantifying speech rhythm abnormalities in the dysarthrias. J Speech, Lang. Hear. Res. 52, 1334–1352 (2009)
Article Google Scholar
Calliope, La parole et son traitement automatique, Dunod, 1989
Google Scholar
P. Boersma, D. Weenink, Praat, a system for doing phonetics by computer. Glot Int 5(9–10), 341–345 (2001)
Google Scholar
C.E. Guerra, D.F. Lovey, in EMBS 2003: A Modern Approach to Dysarthria Classification. Proceedings of the 25th Annual International Conference of the IEEE, New York. Engineering in Medecine and Biology Society
Google Scholar
Copyright IBM Corporation., 1989–2012. Available: http://www.ibm.com
A. El Ouardighi, A. El Akadi, D. Aboutadjine, in ISCCIII: Feature Selection on Supervised Classification Using Wilk’s Lambda Statistic. International Symposium on Computational Intelligence and Intelligent Informatics, 2007, pp. 51–55
Google Scholar
A.P. Dempster, N.M. Laird, D.B. Rubin, Maximum-likelihood from incomplete data via the EM algorithm. J. Acoust. Soc. Am. 39(1), 1–38 (1977)
MATH MathSciNet Google Scholar
D. Istrate, E. Castelli, M. Vacher, L. Besacier, J. Serignat, Information extraction from sound for medical telemonitoring. IEEE Trans. Inf. Technol. Biomed. 10(2), 264–274 (2006)
Article Google Scholar
V.N. Vapnik, An overview of statistical learning theory. IEEE Trans. Neural Networks 10(5), 988–999 (1999)
Article Google Scholar
H. Gao, A. Guo, X. Yu, C. Li, in WiCOM’08: Rbf-Svm and its Application on Network Security Risk Evaluation. Proceedings of 4th International Conference on Wireless Communication, Networking and Mobile Computing, 2008
Google Scholar
A. Fleury, M. Vacher, N. Noury, SVM-Based multimodal classification of Activities of daily living in health smart homes: sensors, algorithms, and first experimental results. IEEE Trans. Inf. Technol.Biomed. 14(2), 274–283 (2010)
Article Google Scholar

Download references

Acknowledgments

This work was supported in part by the Natural Sciences and Engineering Research Council of Canada (NSERC).

Author information

Authors and Affiliations

Faculty of Electronics and Computer Science, University of Sciences and Technology Houari Boumediene, 32 El Alia, 16111, Bab Ezzouar Algiers, Algeria
Kamil Lahcene Kadi, Bachir Boudraa & Malika Boudraa
Department of Information Management, University of Moncton, Campus of Shippagan, 218 boulevard J.-D.-Gauthier Shippagan NB, E8S 1P6, Moncton, Canada
Sid Ahmed Selouani

Authors

Kamil Lahcene Kadi
View author publications
You can also search for this author in PubMed Google Scholar
Sid Ahmed Selouani
View author publications
You can also search for this author in PubMed Google Scholar
Bachir Boudraa
View author publications
You can also search for this author in PubMed Google Scholar
Malika Boudraa
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Kamil Lahcene Kadi .

Editor information

Editors and Affiliations

Department of Multimedia Engineering, College of Engineering, Mokpo National University, Mokpo, Jeonnam, Korea, Republic of (South Korea)
Gi-Chul Yang
Unit 1, 1/F IAENG Secretariat, International Association of Engine, Hong Kong, Hong Kong SAR
Sio-Iong Ao
Department of Applied Mathematics and Computing, Cranfield University, Cranfield, Bedfordshire, United Kingdom
Len Gelman

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kadi, K.L., Selouani, S.A., Boudraa, B., Boudraa, M. (2014). Automated Diagnosis and Assessment of Dysarthric Speech Using Relevant Prosodic Features. In: Yang, GC., Ao, SI., Gelman, L. (eds) Transactions on Engineering Technologies. Springer, Dordrecht. https://doi.org/10.1007/978-94-017-8832-8_38

Download citation

DOI: https://doi.org/10.1007/978-94-017-8832-8_38
Published: 27 April 2014
Publisher Name: Springer, Dordrecht
Print ISBN: 978-94-017-8831-1
Online ISBN: 978-94-017-8832-8
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics