Skip to main content

Correlation Normalization of Syllables and Comparative Evaluation of Pronunciation Quality in Speech Rehabilitation

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 10458))

Abstract

The paper considers the solution of aligning syllables in time problem. This kind of normalization allows to compare different implementations of the same syllable. This allows us to talk about a comparative evaluation of the syllables pronunciation quality in the event that one of the syllables is a reference implementation. If a patient’s record before the operative treatment of oral cancer is used as such a syllable, a comparative assessment of the quality of pronunciation of syllables in the process of speech rehabilitation can be made. In the process of normalization, an approach aimed at maximizing the correlation between individual fragments of the syllable is applied. Then, as a measure of similarity between the reference and the estimated syllable, the correlation coefficient is used. The work demonstrates the validity of such a decision based on the processing of records from healthy people and patients before and after surgical treatment. The results of this work allow us to approach the implementation of an automated software system for assessing the quality of pronunciation of syllables and proceed to implement its working prototype.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Kaprin, A.D., Starinskiy, V.V., Petrova, G.V.: Status of cancer care the population of Russia in 2014. MNIOI name of P.A. Herzen, Moscow, 236 p. (2015)

    Google Scholar 

  2. Kaprin, A.D., Starinskiy, V.V., Petrova, G.V.: Malignancies in Russia in 2014 (Morbidity and mortality). MNIOI name of P.A. Herzen, Moscow, 250 p. (2015)

    Google Scholar 

  3. Standard GOST R 50840–95 Voice over paths of communication. Methods for assessing the quality, legibility and recognition. Publishing Standards, Moscow, 234 p. (1995)

    Google Scholar 

  4. Kostyuchenko, E., Roman, M., Ignatieva, D., Pyatkov, A., Choynzonov, E., Balatskaya, L.: Evaluation of the speech quality during rehabilitation after surgical treatment of the cancer of oral cavity and oropharynx based on a comparison of the fourier spectra. In: Ronzhin, A., Potapova, R., Németh, G. (eds.) SPECOM 2016. LNCS, vol. 9811, pp. 287–295. Springer, Cham (2016). doi:10.1007/978-3-319-43958-7_34

    Chapter  Google Scholar 

  5. Balatskaya, L.N., Choinzonov, E.L., Chizevskaya, S.Y., Kostyuchenko, E.U., Meshcheryakov, R.V.: Software for assessing voice quality in rehabilitation of patients after surgical treatment of cancer of oral cavity, oropharynx and upper jaw. In: Železný, M., Habernal, I., Ronzhin, A. (eds.) SPECOM 2013. LNCS, vol. 8113, pp. 294–301. Springer, Cham (2013). doi:10.1007/978-3-319-01931-4_39

    Chapter  Google Scholar 

  6. Kostyuchenko, E.Y., Mescheryakov, R.V., Balatskaya, L.N., Choynzonov, E.L.: Structure and database of software for speech quality and intelligibility assessment in the process of rehabilitation after surgery in the treatment of cancers of the oral cavity and oropharynx, maxillofacial area. In: Proceedings of SPIIRAN, vol. 32, pp. 116–124 (2014)

    Google Scholar 

  7. MedFind. Oncology. Plastic surgery in the surgical treatment of tumors of the face, jaws, http://medfind.ru/modules/sections/index.php?op=viewarticle&artid=324

  8. Sergienko, A.B.: Digital Signal Processing. Peter, St. Petersburg, 751 p. (2006)

    Google Scholar 

  9. Benesty, J., Sondhi, M.M., Huang, Y. (eds.): Springer Handbook of Speech Processing, 1176 p. Springer, Heidelberg (2008)

    Google Scholar 

  10. Rabiner, L.R., Schafer, R.W.: Introduction to Digital Speech Processing. Foundations and Trends in Signal Processing, 194 p. (2007)

    Google Scholar 

  11. Shuyin, Z., Ying, G., Buhong, W.: Auto-correlation property of speech and its application in voice activity detection. In: First International Workshop on Education Technology and Computer Science, ETCS 2009, pp. 265–268 (2009)

    Google Scholar 

  12. ITU-T Recommendation G.729. Coding of speech at 8 kbit/s using conjugate-structure algebraic-code-excited linear prediction (CS-ACELP) (2012)

    Google Scholar 

  13. Atal, B., Rabiner, L.: A pattern recognition approach to voiced-unvoiced-silence classification with applications to speech recognition. IEEE Trans. ASSP ASSP-24, 201–212 (1976)

    Google Scholar 

  14. Ronzhin, A.L., Karpov, A.A.: Russian voice interface. Pattern Recognit. Image Anal. 17(2), 321–336 (2007)

    Article  Google Scholar 

  15. Chu, W., Alwan, A.: A correlation-maximization denoising filter used as an enhancement frontend for noise robust bird call classification. In: Proceedings of Interspeech 2009, pp. 2831–2834 (2009)

    Google Scholar 

Download references

Acknowledgments

The study was performed by a grant from the Russian Science Foundation (project 16-15-00038).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Evgeny Kostyuchenko .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Cite this paper

Kostyuchenko, E., Meshcheryakov, R., Ignatieva, D., Pyatkov, A., Choynzonov, E., Balatskaya, L. (2017). Correlation Normalization of Syllables and Comparative Evaluation of Pronunciation Quality in Speech Rehabilitation. In: Karpov, A., Potapova, R., Mporas, I. (eds) Speech and Computer. SPECOM 2017. Lecture Notes in Computer Science(), vol 10458. Springer, Cham. https://doi.org/10.1007/978-3-319-66429-3_25

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-66429-3_25

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-66428-6

  • Online ISBN: 978-3-319-66429-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics