Skip to main content

Recognition of Repetition and Prolongation in Stuttered Speech Using ANN

  • Conference paper
  • First Online:

Part of the book series: Smart Innovation, Systems and Technologies ((SIST,volume 43))

Abstract

This paper mainly focuses on repetition and prolongation detection in stuttered speech signal. The acoustic and pitch related features like Mel-frequency cepstral coefficients (MFCCs), formants, pitch, zero crossing rate (ZCR) and Energy are used to test the effectiveness in recognizing repetitions and prolongations in stammered speech. Artificial Neural Networks (ANN) are used as classifier. The results are evaluated using combination of different features. The results show that the ANN classifier trained using MFCC features achieves an average accuracy of 87.39 % for repetition and prolongation recognition.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD   219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Van Riper, C.: The Nature of Stuttering. Prentice Hall, New Jersey (1971)

    Google Scholar 

  2. Czyzewski, Andrzej, Kaczmarek, Andrzej, Kostek, Bozena: Intelligent processing of stuttered speech. J. Intell. Inf. Syst. 21, 143–171 (2003)

    Article  Google Scholar 

  3. Kully, D., Boerg, E.: An investigation of inter-clinic agreement in the identification of fluent and stuttered syllables. J. Fluency disord. 13, 309–318 (1988)

    Article  Google Scholar 

  4. Conture, E.: International Conference on Intelligent and Advanced Systems, 2nd edn. Prentice-Hall, Englewood Cliffs (1990)

    Google Scholar 

  5. Lyons, J.: Mel frequency cepstral coefficient (MFCC) tutorial. http://practicalcryptography.com/miscellaneous/machine-learning/guide-mel-frequency-cepstral-coefficients-mfccs/

  6. Zhang, J., Dong, B., Yan, Y.: A computer-assist algorithm to detect repetitive stuttering automatically. In: International Conference on Asian Language Processing, pp. 249–252 (2013)

    Google Scholar 

  7. Sin Chee, L., Chia Ai, O., Hariharan, M.: MFCC based recognition of repetition and prolongations in stuttered speech using artificial k-nn and lda. In: IEEE Student Conference on Research and Development, pp. 146–149 (2009)

    Google Scholar 

  8. Ravikumar, K.M., Rajagopal, R., Nagaraj, H.C.: An approach for objective assessment of stuttered speech using MFCC features. ICGST Int. J. Digital Signal Process. 9, 19–24 (2009)

    Google Scholar 

  9. Chia Ai, O., Hariharan, M., Yaacob, S., Sin Chee, L.: Classification of speech dysfluencies with MFCC and LPCC features. J. Med. Syst. 39, 2157–2165 (2012)

    Google Scholar 

  10. Wisniewski, M., Kuniszyk, J.W., Smolka, E., Suszynski, W.: Automatic detection of disorders in a continuous speech with the hidden markov models approach. Comput. Recogn. Syst. 2(45), 445–453 (2007)

    Article  Google Scholar 

  11. Sin Chee, L., Chia Ai, O., Hariharan, M., Yaacob, S.: Automatic detection of prolongations and repetitions using LPCC. In; 2009 International Conference Technical Postgraduates (TECHPOS) (2009)

    Google Scholar 

  12. Tan, T.S., Liboh, H., Ariff, A.K., Ting, C.M., Salleh, H.: Application of malay speech technology in malay speech therapy assistance tools. Int. Conf. Intell. Adv. Syst. 48, 330–334 (2007)

    Google Scholar 

  13. Ravikumar, K.M., Balakrishna Reddy, Rajagopal, R., Nagaraj, H.C.: Automatic detection of syllable repetition in read speech for objective assessment of stuttered disfluencies. Proce. World Acad. Sci. 2, 220–223 (2008)

    Google Scholar 

  14. Rabiner, L., Juang, B., Yegnanarayana, B.: Fundamentals of Speech Recognition. Pearson, India (2010)

    Google Scholar 

  15. Welling, L., Ney, H.: Formant estimation for speech recognition. IEEE Trans. Speech Audio Process. 6, 36–48 (1998)

    Article  Google Scholar 

  16. IIT Guwahati.: Estimation of pitch from speech signal. http://iitg.vlab.co.in/

  17. Gevaert, W., Tsenov, G., Mladenov, V.: Neural networks used for speech recognition. J. Autom. Control 2, 732–735 (2010)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to P. S. Savin .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer India

About this paper

Cite this paper

Savin, P.S., Ramteke, P.B., Koolagudi, S.G. (2016). Recognition of Repetition and Prolongation in Stuttered Speech Using ANN. In: Nagar, A., Mohapatra, D., Chaki, N. (eds) Proceedings of 3rd International Conference on Advanced Computing, Networking and Informatics. Smart Innovation, Systems and Technologies, vol 43. Springer, New Delhi. https://doi.org/10.1007/978-81-322-2538-6_8

Download citation

  • DOI: https://doi.org/10.1007/978-81-322-2538-6_8

  • Published:

  • Publisher Name: Springer, New Delhi

  • Print ISBN: 978-81-322-2537-9

  • Online ISBN: 978-81-322-2538-6

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics