Skip to main content

Automatic Prediction of Word Form Reduction in Russian Spontaneous Speech

  • Conference paper
  • First Online:
Speech and Computer (SPECOM 2020)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 12335))

Included in the following conference series:

Abstract

The aim of this paper is to determine the main features of word forms that undergo reduction in Russian spontaneous speech using machine learning algorithms. We examined the factors proposed in the previous corpus-based studies, namely, the number of syllables, word frequency, part of speech, reduction of the preceding word. We used the texts from the Corpus of Russian Oral Speech as the data. The following machine learning algorithms were applied: extremely randomized tree, random forest and logistic regression. The results show that the higher the frequency of a word form is, the higher the chances are that it will be reduced; the more syllables a word form has, the higher probability of its reduction is. But we did not find the influence of the factor whether the previous word is reduced or not. Regarding the part-of-speech feature, the results are not that straightforward since we received different lists using different algorithms, but the adjective and parenthetical word were in both of them. Thus, we can conclude that adjectives and parenthetical words are likely to be reduced more often than other parts of speech.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Kohler, K.J.: Segmental reduction in connected speech in German: phonological facts and phonetic explanations. In: Hardcastle, W.J., Marchal, A. (eds.) Speech Production and Speech. NATO ASI Series, pp. 69–92. Springer, Dordrecht (1990). https://doi.org/10.1007/978-94-009-2037-8_4

    Chapter  Google Scholar 

  2. Brand, S., Ernestus, M.: Listeners’ processing of a given reduced word pronunciation variant directly reflects their exposure to this variant: evidence from native listeners and learners of French. Q. J. Exp. Psychol. 71, 1240–1259 (2018). https://doi.org/10.1080/17470218.2017.1313282

    Article  Google Scholar 

  3. Ernestus, M., Baayen, R.H., Schreuder, R.: The recognition of reduced word forms. Brain Lang. 81, 162–173 (2002). https://doi.org/10.1006/brln.2001.2514

    Article  Google Scholar 

  4. Riekhakaynen, E.: Reduction in spontaneous speech: How to survive. In: Heegart, J., Henrichsen, P.J. (eds.) New Perspectives on Speech in Action: Proceedings of the 2nd SJUSK Conference on Contemporary Speech Habits (Copenhagen Studies in Language 43), Samfundslitteratur, Frederiksberg, pp. 153–167 (2013)

    Google Scholar 

  5. Stoyka, D.A.: Reduced forms of Russian speech: linguistic and extralinguistic aspects. PhD thesis, Saint Petersburg (2016). (In Rus.)

    Google Scholar 

  6. Zemskaya, E.A. (ed.): Conversational Russian Speech. Nauka, Moscow (1973). (In Rus.)

    Google Scholar 

  7. Jurafski, D., Bell, A., Gregory, M., Raymond, W.D.: Probabilistic relations between words: evidence from reduction in lexical production. In: Bybee, J., Hopper, P. (eds.) Frequency and the Emergence of Linguistic, pp. 229–254. John Benjamins, Philadelphia (2000)

    Google Scholar 

  8. Jurafsky, D., Martin, J.H.: Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition. Prentice Hall, Upper Saddle River (2000)

    Google Scholar 

  9. Schiel, F., Stevens, M., Reichel, U., Cutugno F.: Machine learning of probabilistic phonological pronunciation rules from the Italian CLIPS corpus. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, pp. 1414–1418 (2013). https://doi.org/10.5282/ubm/epub.18046

  10. Vainio, M.: Phonetics and machine learning: hierarchical modelling of prosody in statistical speech synthesis. In: Besacier, L., Dediu, A.-H., Martín-Vide, C. (eds.) SLSP 2014. LNCS (LNAI), vol. 8791, pp. 37–54. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-11397-5_3

    Chapter  Google Scholar 

  11. Seide, F., Li, G., Yu, D.: Conversational speech transcription using context-dependent deep neural networks. In: Proceedings of Interspeech, pp. 437–440 (2011)

    Google Scholar 

  12. Ellis, D.P.W., Singh, R., Sivadas, S.: Tandem acoustic modeling in large-vocabulary recognition. In: IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings, Salt Lake City, USA, vol. 1, pp. 517–520 (2001). https://doi.org/10.1109/ICASSP.2001.940881

  13. Dahl, G., Yu, D., Deng, L., Acero, A.: Context-dependent pre-trained deep neural networks for large vocabulary speech recognition. IEEE Trans. Audio Speech Lang. Process. 20(1), 30–42 (2012). https://doi.org/10.1109/TASL.2011.2134090

    Article  Google Scholar 

  14. Zhang, Y., et al.: Towards end-to-end speech recognition with deep convolutional neural networks. In: Proceedings of Interspeech, pp. 410–414 (2016). https://doi.org/10.21437/Interspeech.2016-1446

  15. Ravanelli, M., Serdyuk, D., Bengio, Y.: Twin regularization for online speech recognition. In: Karpov, A., Potapova, R., Mporas, I. (eds.) Proceedings of Interspeech (2018). https://arxiv.org/pdf/1804.05374.pdf. Accessed 28 May 2020. https://doi.org/10.21437/Interspeech.2018-1407

  16. Maas, L., Xie, Z., Jurafsky, D., Ng, A.Y.: Lexicon-free conversational speech recognition with neural networks. In: Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 345–354. Curran Associates, New York (2015). https://doi.org/10.3115/v1/N15-1038

  17. Mizera, P., Pollak, P.: Improving of LVCSR for causal czech using publicly available language resources. In: Karpov, A., Potapova, R., Mporas, I. (eds.) SPECOM 2017. LNCS (LNAI), vol. 10458, pp. 427–437. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-66429-3_42

    Chapter  Google Scholar 

  18. Kipyatkova, I.: Improving Russian LVCSR using deep neural networks for acoustic and language modeling. In: Karpov, A., Jokisch, O., Potapova, R. (eds.) SPECOM 2018. LNCS (LNAI), vol. 11096, pp. 291–300. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-99579-3_31

    Chapter  Google Scholar 

  19. Markovnikov, N.M., Kipyatkova, I.S.: Researching methods of building encoder-decoder models for Russian speech. Data Manage. Syst. 4, 45–53 (2019). (in Russian)

    Google Scholar 

  20. Riekhakaynen, E.: Corpora of Russian spontaneous speech as a tool for modelling natural speech production and recognition. In: 10th Annual Computing and Communication Workshop and Conference, CCWC 2020, January 2020, pp. 406–411. IEEE, Las Vegas (2020). https://doi.org/10.1109/CCWC47524.2020.9031251

  21. Ventsov, A.V., Grudeva, E.V.: A Frequency Dictionary of Russian. CHSU Publishing House, Cherepovets (2008). (in Russian)

    Google Scholar 

  22. Shcherba, L.V.: About parts of speech in the Russian language. In: Language System and Speech Behaviour, pp. 77–100. Nauka, Leningrad (1974). (in Russian)

    Google Scholar 

  23. Harrington, P.: Machine Learning in Action. Manning Publications, New York (2012)

    Google Scholar 

Download references

Acknowledgements

The research is supported by the grant #19-012-00629 from the Russian Foundation for Basic Research.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Elena Riekhakaynen .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Dayter, M., Riekhakaynen, E. (2020). Automatic Prediction of Word Form Reduction in Russian Spontaneous Speech. In: Karpov, A., Potapova, R. (eds) Speech and Computer. SPECOM 2020. Lecture Notes in Computer Science(), vol 12335. Springer, Cham. https://doi.org/10.1007/978-3-030-60276-5_12

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-60276-5_12

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-60275-8

  • Online ISBN: 978-3-030-60276-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics