Recognition of Repetition and Prolongation in Stuttered Speech Using ANN

Savin, P. S.; Ramteke, Pravin B.; Koolagudi, Shashidhar G.

doi:10.1007/978-81-322-2538-6_8

Recognition of Repetition and Prolongation in Stuttered Speech Using ANN

P. S. Savin⁶,
Pravin B. Ramteke⁶ &
Shashidhar G. Koolagudi⁶

Conference paper
First Online: 01 January 2015

1070 Accesses
3 Citations
1 Altmetric

Part of the book series: Smart Innovation, Systems and Technologies ((SIST,volume 43))

Abstract

This paper mainly focuses on repetition and prolongation detection in stuttered speech signal. The acoustic and pitch related features like Mel-frequency cepstral coefficients (MFCCs), formants, pitch, zero crossing rate (ZCR) and Energy are used to test the effectiveness in recognizing repetitions and prolongations in stammered speech. Artificial Neural Networks (ANN) are used as classifier. The results are evaluated using combination of different features. The results show that the ANN classifier trained using MFCC features achieves an average accuracy of 87.39 % for repetition and prolongation recognition.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Van Riper, C.: The Nature of Stuttering. Prentice Hall, New Jersey (1971)
Google Scholar
Czyzewski, Andrzej, Kaczmarek, Andrzej, Kostek, Bozena: Intelligent processing of stuttered speech. J. Intell. Inf. Syst. 21, 143–171 (2003)
Article Google Scholar
Kully, D., Boerg, E.: An investigation of inter-clinic agreement in the identification of fluent and stuttered syllables. J. Fluency disord. 13, 309–318 (1988)
Article Google Scholar
Conture, E.: International Conference on Intelligent and Advanced Systems, 2nd edn. Prentice-Hall, Englewood Cliffs (1990)
Google Scholar
Lyons, J.: Mel frequency cepstral coefficient (MFCC) tutorial. http://practicalcryptography.com/miscellaneous/machine-learning/guide-mel-frequency-cepstral-coefficients-mfccs/
Zhang, J., Dong, B., Yan, Y.: A computer-assist algorithm to detect repetitive stuttering automatically. In: International Conference on Asian Language Processing, pp. 249–252 (2013)
Google Scholar
Sin Chee, L., Chia Ai, O., Hariharan, M.: MFCC based recognition of repetition and prolongations in stuttered speech using artificial k-nn and lda. In: IEEE Student Conference on Research and Development, pp. 146–149 (2009)
Google Scholar
Ravikumar, K.M., Rajagopal, R., Nagaraj, H.C.: An approach for objective assessment of stuttered speech using MFCC features. ICGST Int. J. Digital Signal Process. 9, 19–24 (2009)
Google Scholar
Chia Ai, O., Hariharan, M., Yaacob, S., Sin Chee, L.: Classification of speech dysfluencies with MFCC and LPCC features. J. Med. Syst. 39, 2157–2165 (2012)
Google Scholar
Wisniewski, M., Kuniszyk, J.W., Smolka, E., Suszynski, W.: Automatic detection of disorders in a continuous speech with the hidden markov models approach. Comput. Recogn. Syst. 2(45), 445–453 (2007)
Article Google Scholar
Sin Chee, L., Chia Ai, O., Hariharan, M., Yaacob, S.: Automatic detection of prolongations and repetitions using LPCC. In; 2009 International Conference Technical Postgraduates (TECHPOS) (2009)
Google Scholar
Tan, T.S., Liboh, H., Ariff, A.K., Ting, C.M., Salleh, H.: Application of malay speech technology in malay speech therapy assistance tools. Int. Conf. Intell. Adv. Syst. 48, 330–334 (2007)
Google Scholar
Ravikumar, K.M., Balakrishna Reddy, Rajagopal, R., Nagaraj, H.C.: Automatic detection of syllable repetition in read speech for objective assessment of stuttered disfluencies. Proce. World Acad. Sci. 2, 220–223 (2008)
Google Scholar
Rabiner, L., Juang, B., Yegnanarayana, B.: Fundamentals of Speech Recognition. Pearson, India (2010)
Google Scholar
Welling, L., Ney, H.: Formant estimation for speech recognition. IEEE Trans. Speech Audio Process. 6, 36–48 (1998)
Article Google Scholar
IIT Guwahati.: Estimation of pitch from speech signal. http://iitg.vlab.co.in/
Gevaert, W., Tsenov, G., Mladenov, V.: Neural networks used for speech recognition. J. Autom. Control 2, 732–735 (2010)
Google Scholar

Download references

Author information

Authors and Affiliations

National Institute of Technology Karnataka, Surathkal, 575025, Karnataka, India
P. S. Savin, Pravin B. Ramteke & Shashidhar G. Koolagudi

Authors

P. S. Savin
View author publications
You can also search for this author in PubMed Google Scholar
Pravin B. Ramteke
View author publications
You can also search for this author in PubMed Google Scholar
Shashidhar G. Koolagudi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to P. S. Savin .

Editor information

Editors and Affiliations

Department of Computer Science, Liverpool Hope University, Liverpool, United Kingdom
Atulya Nagar
Dept. of Computer Science and Engineering, National Institute of Technology Rourkela, Rourkela, Odisha, India
Durga Prasad Mohapatra
Computer Science & Engineering, University of Calcutta, Kolkata, West Bengal, India
Nabendu Chaki

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Savin, P.S., Ramteke, P.B., Koolagudi, S.G. (2016). Recognition of Repetition and Prolongation in Stuttered Speech Using ANN. In: Nagar, A., Mohapatra, D., Chaki, N. (eds) Proceedings of 3rd International Conference on Advanced Computing, Networking and Informatics. Smart Innovation, Systems and Technologies, vol 43. Springer, New Delhi. https://doi.org/10.1007/978-81-322-2538-6_8

Download citation

DOI: https://doi.org/10.1007/978-81-322-2538-6_8
Published: 08 October 2015
Publisher Name: Springer, New Delhi
Print ISBN: 978-81-322-2537-9
Online ISBN: 978-81-322-2538-6
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics