Skip to main content
Log in

Evaluation of Supervised Learning Algorithms Based on Speech Features as Predictors to the Diagnosis of Mild to Moderate Intellectual Disability

  • 3DR Express
  • Published:
3D Research

Abstract

Due to age-bound onset of symptoms used for diagnosis of mild to moderate intellectual disability, early diagnosis of these problems has long been a difficult issue. The diagnosis includes tests pertaining to intellectual functioning and adaptive behaviours including communication skills etc. In this paper, it is proposed to use speech features as an early indicator of the disorder which can be used to train machine learning algorithms for differentiating between speech of normally developing children and children with intellectual disability. In this paper, speech abnormalities are quantified using acoustic parameters including Linear Predictive Cepstral Coefficients, Mel Frequency Cepstral Coefficients and spectral features in speech samples of 48 participants (24 with intellectual disability and 24 age-matched controls). A training dataset was created by extracting these features which was used for learning by various classifiers. The experiments show promising results where Support Vector Machine gives an accuracy of 98%. Consequently, a well-trained classification algorithm can be used as an aid in early detection of mild to moderate intellectual disability.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3

Similar content being viewed by others

References

  1. Abbeduto, L., & Nuccio, J. B. (1991). Relation between receptive language and cognitive maturity in persons with mental retardation. American Journal of Mental Retardation, 96, 143–149.

    Google Scholar 

  2. Abbeduto, L., Furman, L., & Davies, B. (1989). Relation between the receptive language and mental age of persons with mental retardation. American Journal of Mental Retardation, 93, 535–543.

    Google Scholar 

  3. Aggarwal, G., & Singh, L. (2018). Classification of intellectual disability using LPC, LPCC, and WLPCC parameterization techniques. International Journal of Computers and Applications. https://doi.org/10.1080/1206212X.2018.1475330.

    Article  Google Scholar 

  4. Ai, O. C., Hariharan, M., Yaacob, S., & Chee, L. S. (2012). Classification of speech dysfluencies with MFCC and LPCC features. Expert Systems with Applications, 39(2), 2157–2165.

    Article  Google Scholar 

  5. American Psychiatric Association. (2013). Diagnostic and statistical manual of mental disorders (DSM-5 ® ). Arlington: American Psychiatric Publishing.

    Book  Google Scholar 

  6. Antoniol, G., Rollo, V. F., & Venturi, G. (2005, May). Linear predictive coding and cepstrum coefficients for mining time variant information from software repositories. In ACM SIGSOFT software engineering notes (Vol. 30, No. 4, pp. 1–5). ACM.

  7. Batshaw, M. L. (2002). Children with disabilities (5th ed.). Baltimore: Brookes.

    Google Scholar 

  8. Belva, B. C., Matson, J. L., Sipes, M., & Bamburg, J. W. (2012). An examination of specific communication deficits in adults with profound intellectual disabilities. Research in Developmental Disabilities, 33, 525–529.

    Article  Google Scholar 

  9. Berry, P. B. (1972). Comprehension of possessive and present continuous sentences by nonretarded, mildly retarded, and severely retarded children. American Journal of Mental Deficiency, 76, 540–544.

    Google Scholar 

  10. Biagetti, G., Crippa, P., Curzi, A., Orcioni, S., & Turchetti, C. (2015, January). Speaker identification with short sequences of speech frames. In Proceedings of the international conference on pattern recognition applications and methods (Vol. 2, pp. 178–185). SciTePress, Setúbal.

  11. Biagetti, G., Crippa, P., Falaschetti, L., Orcioni, S., & Turchetti, C. (2017). An investigation on the accuracy of truncated DKLT representation for speaker identification with short sequences of speech frames. IEEE transactions on cybernetics, 47(12), 4235–4249.

    Article  Google Scholar 

  12. Bourlard, H., & Wellekens, C. J. (1989). Speech pattern discrimination and multilayer perceptrons. Computer Speech & Language, 3(1), 1–19.

    Article  Google Scholar 

  13. Cabanas, R. (1954). Some findings in speech and voice therapy among mentally deficient children. Folia Phoniatrica et Logopaedica, 6(1), 34–37.

    Article  Google Scholar 

  14. Cheslock, M. A., Barton-Hulsey, A., Romski, M., & Sevcik, R. A. (2008). Using a speech-generating device to enhance communicative abilities for an adult with moderate intellectual disability. Journal of Intellectual and Developmental Disability, 46, 376–386.

    Article  Google Scholar 

  15. Clegg, J., Hollis, C., Mawhood, L., & Rutter, M. (2005). Developmental language disorders—A follow-up in later adult life. Cognitive, language and psychosocial outcomes. Journal of Child Psychology and Psychiatry, 46(2), 128–149.

    Article  Google Scholar 

  16. Davis, S. B., & Mermelstein, P. (1990). Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences. In Readings in speech recognition (pp. 65–74).

    Chapter  Google Scholar 

  17. Griffiths, R. (1970). The ability of young children. A study in mental measurement. London: University of London Press.

    Google Scholar 

  18. Harel, S., Greenstein, Y., Kramer, U., Yifat, R., Samuel, E., Nevo, Y., et al. (1996). Clinical characteristics of children referred to a child development center for evaluation of speech, language, and communication disorders. Pediatric Neurology, 15(4), 305–311.

    Article  Google Scholar 

  19. Hariharan, M., Chee, L. S., Ai, O. C., & Yaacob, S. (2012). Classification of speech dysfluencies using LPC based parameterization techniques. Journal of Medical Systems, 36(3), 1821–1830.

    Article  Google Scholar 

  20. Harrison, P., & Oakland, T. (2003). Adaptive behavior assessment system (ABAS-II). San Antonio, TX: The Psychological Corporation.

    Google Scholar 

  21. Hau, C. C. (Ed.). (2015). Handbook of pattern recognition and computer vision. Singapore: World Scientific.

    Google Scholar 

  22. Jothilakshmi, S., Ramalingam, V., & Palanivel, S. (2009). Unsupervised speaker segmentation with residual phase and MFCC features. Expert Systems with Applications, 36(6), 9799–9804.

    Article  Google Scholar 

  23. Kail, R. (1992). General slowing of information-processing by persons with mental retardation. American Journal of Mental Retardation, 97, 333–341.

    Google Scholar 

  24. Koul, R., & Clapsaddle, K. C. (2006). Effects of repeated listening experiences on the perception of synthetic speech by individuals with mild-to-moderate intellectual disabilities. Augmentative & Alternative Communication, 22, 112–122.

    Article  Google Scholar 

  25. Lehiste, I., & Lass, N. J. (1976). Suprasegmental features of speech. Contemporary Issues in Experimental Phonetics, 225, 239.

    Google Scholar 

  26. Lesser, R., & Hassip, S. (1986). Knowledge and opinions of speech therapy in teachers, doctors and nurses. Child: Care, Health and Development, 12(4), 235–249.

    Google Scholar 

  27. Lynch, M. P., Oller, D. K., Eilers, R. E., & Basinger, D. (1990). Vocal development of infants with Down syndrome. In University of Wisconsin 11th annual symposium for research on child language disorders, Madison, WI.

  28. Maber-Aleksandrowicz, S., Avent, C., & Hassiotis, A. (2016). A systematic review of animal-assisted therapy on psychosocial outcomes in people with intellectual disability. Research in Developmental Disabilities, 49, 322–338.

    Article  Google Scholar 

  29. McLean, L. K., Brady, N. C., McLean, J. E., & Behrens, G. A. (1999). Communication forms and functions of children and adults with severe mental retardation in community and institutional settings. Journal of Speech, Language, and Hearing Research, 42, 231–240.

    Article  Google Scholar 

  30. Memisevic, H., & Hadzic, S. (2013). Speech and language disorders in children with intellectual disability in Bosnia and Herzegovina. Disability, CBR & Inclusive Development, 24(2), 92–99.

    Article  Google Scholar 

  31. Merrill, E. C., & Jackson, T. S. (1992). Degree of associative relatedness and sentence processing by adolescents with and without mental retardation. American Journal of Mental Retardation, 97, 173–185.

    Google Scholar 

  32. Merrill, E. C., & Jackson, T. S. (1992). Sentence processing by adolescents with and without mental retardation. American Journal of Mental Retardation, 97, 342–350.

    Google Scholar 

  33. Merrill, E. C., & Mar, H. H. (1987). Differences between mentally retarded and nonretarded persons’ efficiency of auditory sentence processing. American Journal of Mental Deficiency, 91, 406–414.

    Google Scholar 

  34. Mervis, C. B., & Becerra, A. M. (2007). Language and communicative development in Williams syndrome. Mental Retardation and Developmental Disabilities Research Reviews, 13(1), 3–15.

    Article  Google Scholar 

  35. Mervis, C. B., & Cicchetti, D. (1990). Early conceptual development of children with Down syndrome. In Children with Down syndrome: A developmental perspective (pp. 252–301).

  36. Rabiner, L. R., & Juang, B. H. (1993). Fundamentals of speech recognition (Vol. 14). Englewood Cliffs: PTR Prentice Hall.

    Google Scholar 

  37. Räsänen, O., & Pohjalainen, J. (2013). Random subset feature selection in automatic recognition of developmental disorders, affective states, and level of conflict from speech. In INTERSPEECH (pp. 210–214).

  38. Rumelhart, D. E., Hinton, G. E., & Williams, R. J. (1986). Learning representations by back-propagating errors. Nature, 323(6088), 533.

    Article  MATH  Google Scholar 

  39. Schalock, R. L., Borthwick-Duffy, S. A., Bradley, V. J., Buntinx, W. H., Coulter, D. L., Craig, E. M., et al. (2010). Intellectual disability: Definition, classification, and systems of supports. Washington, DC: American Association on Intellectual and Developmental Disabilities.

    Google Scholar 

  40. Tager-Flusberg, H. E. L. E. N., & Sullivan, K. (1998). Children with mental retardation. In Handbook of mental retardation and development (p. 208–239).

  41. Vapnik, V. N. (1999). An overview of statistical learning theory. IEEE Transactions on Neural Networks, 10(5), 988–999.

    Article  Google Scholar 

  42. Wechsler, D. (2012). WPPSI-IV: Wechsler Preschool and Primary Scale of Intelligence. London: Pearson, Psychological Corporation.

    Google Scholar 

  43. Xiao, X., Chng, E. S., & Li, H. (2008). Normalization of the speech modulation spectra for robust speech recognition. IEEE Transactions on Audio, Speech and Language Processing, 16(8), 1662–1674.

    Article  Google Scholar 

  44. Zhong, J., Hu, W., Soong, F., & Meng, H. (2017). DNN i-vector speaker verification with short, text-constrained test utterances. In Proceedings of Interspeech 2017 (pp. 1507–1511).

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Gaurav Aggarwal.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Aggarwal, G., Singh, L. Evaluation of Supervised Learning Algorithms Based on Speech Features as Predictors to the Diagnosis of Mild to Moderate Intellectual Disability. 3D Res 9, 55 (2018). https://doi.org/10.1007/s13319-018-0207-6

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s13319-018-0207-6

Keywords

Navigation