Repetition Detection in Stuttered Speech

Ramteke, Pravin B.; Koolagudi, Shashidhar G.; Afroz, Fathima

doi:10.1007/978-81-322-2538-6_63

Pravin B. Ramteke⁶,
Shashidhar G. Koolagudi⁶ &
Fathima Afroz⁶

Part of the book series: Smart Innovation, Systems and Technologies ((SIST,volume 43))

1096 Accesses
9 Citations
1 Altmetric

Abstract

This paper mainly focuses on detection of repetitions in stuttered speech. The stuttered speech signal is divided into isolated units based on energy. Mel-frequency cepstrum coefficients (MFCCs), formants and shimmer are used as features for repetition recognition. These features are extracted from each isolated unit. Using Dynamic Time Warping (DTW) the features of each isolated unit are compared with those subsequent units within one second interval of speech. Based on the analysis of scores obtained from DTW a threshold is set, if the score is below the set threshold then the units are identified as repeated events. Twenty seven seconds of speech data used in this work, consists of 50 repetition events. The result shows that the combination of MFCCs, formants and shimmer can be used for the recognition of repetitions in stuttered speech. Out of 50 repetitions, 47 are correctly identified.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Riper, V.: The Nature of Stuttering. Prentice Hall, New Jersey (1971)
Google Scholar
Kully, D., Boerg, E.: An investigation of inter-clinic agreement in the identification of fluent and stuttered syllables. J. Fluency Disord. 13, 309–318 (1988)
Article Google Scholar
Conture, E.G.: Stuttering Englewood cliffs, New Jersey: Prentice-Hall, 2nd edn. (1990)
Google Scholar
Zhang, J., Dong, B., Yan, Y.: A computer-assist algorithm to detect repetitive stuttering automatically. In: International Conference on Asian Language Processing, pp. 249–252 (2013)
Google Scholar
Ravikumar, K.M., Balakrishna, R., Rajagopal, R., Nagaraj, H.C.: Automatic detection of syllable repetition in read speech for objective assessment of stuttered disfluencies. Proce. World Acad. Sci. 2, 220–223 (2008)
Google Scholar
Palfy, J., Pospichal, J.: Recognition of repetitions using support vector machines. In: Signal Processing Algorithms, Architectures, Arrangements, and Applications Conference Proceedings (SPA), 2011, pp. 1–6 (2011)
Google Scholar
Chee, L.S., Ai, O.C., Hariharan, M., Yaacob, S.: Automatic detection of prolongations and repetitions using LPCC. In: 2009 International Conference for Technical Postgraduates (TECHPOS). pp. 1–4 (2009)
Google Scholar
Ai, O.C., Hariharan, M., Yaacob, S., Chee, L.S.: Classification of speech dysfluencies with MFCC and LPCC features. J. Med. Syst. 39, 2157–2165 (2012)
Google Scholar
Ying, G.S., Mitchell, C.D., Jamieson, L.H.: Endpoint detection of isolated utterances based on a modified teager energy measurement. International Conference on Acoustics, Speech and Signal Processing, vol. 2, pp. 732–735 (1993)
Google Scholar
James, L.: MFCC tutorial. http://practicalcryptography.com/miscellaneous/machine-learning/guide-mel-frequency-cepstral-coefficients-mfccs/
Welling, L., Ney, H.: Formant estimation for speech recognition. IEEE Transactions on Speech Audio Processing, vol. 6, pp. 36–48 (1998)
Article Google Scholar
Li, X., Tao, J., Johnson, M.T., Soltis, J., Savage, A., Kirsten, M.L., Newman, J.D.: Stress and emotion classification using Jitter and Shimmer features. In: IEEE International Conference on Acoustics, Speech and Signal Processing, 2007, vol. 4., pp. IV–1081. IEEE (2007)
Google Scholar
Keogh, E., Ratanamahatana, C.A.: Exact indexing of dynamic time warping. Knowl. Inf. Syst. 7, 358–386 (2005)
Article Google Scholar

Download references

Author information

Authors and Affiliations

National Institute of Technology Karnataka, Surathkal, 575 025, Karnataka, India
Pravin B. Ramteke, Shashidhar G. Koolagudi & Fathima Afroz

Authors

Pravin B. Ramteke
View author publications
You can also search for this author in PubMed Google Scholar
Shashidhar G. Koolagudi
View author publications
You can also search for this author in PubMed Google Scholar
Fathima Afroz
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Pravin B. Ramteke .

Editor information

Editors and Affiliations

Department of Computer Science, Liverpool Hope University, Liverpool, United Kingdom
Atulya Nagar
Dept. of Computer Science and Engineering, National Institute of Technology Rourkela, Rourkela, Odisha, India
Durga Prasad Mohapatra
Computer Science & Engineering, University of Calcutta, Kolkata, West Bengal, India
Nabendu Chaki

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ramteke, P.B., Koolagudi, S.G., Afroz, F. (2016). Repetition Detection in Stuttered Speech. In: Nagar, A., Mohapatra, D., Chaki, N. (eds) Proceedings of 3rd International Conference on Advanced Computing, Networking and Informatics. Smart Innovation, Systems and Technologies, vol 43. Springer, New Delhi. https://doi.org/10.1007/978-81-322-2538-6_63

Download citation

DOI: https://doi.org/10.1007/978-81-322-2538-6_63
Published: 08 October 2015
Publisher Name: Springer, New Delhi
Print ISBN: 978-81-322-2537-9
Online ISBN: 978-81-322-2538-6
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics