Automatic Prediction of Word Form Reduction in Russian Spontaneous Speech

Dayter, Maria; Riekhakaynen, Elena

doi:10.1007/978-3-030-60276-5_12

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 12335))

Included in the following conference series:

International Conference on Speech and Computer

1577 Accesses
1 Citations

Abstract

The aim of this paper is to determine the main features of word forms that undergo reduction in Russian spontaneous speech using machine learning algorithms. We examined the factors proposed in the previous corpus-based studies, namely, the number of syllables, word frequency, part of speech, reduction of the preceding word. We used the texts from the Corpus of Russian Oral Speech as the data. The following machine learning algorithms were applied: extremely randomized tree, random forest and logistic regression. The results show that the higher the frequency of a word form is, the higher the chances are that it will be reduced; the more syllables a word form has, the higher probability of its reduction is. But we did not find the influence of the factor whether the previous word is reduced or not. Regarding the part-of-speech feature, the results are not that straightforward since we received different lists using different algorithms, but the adjective and parenthetical word were in both of them. Thus, we can conclude that adjectives and parenthetical words are likely to be reduced more often than other parts of speech.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

What Causes Phonetic Reduction in Russian Speech: New Evidence from Machine Learning Algorithms

Disfluency Insertion for Spontaneous TTS: Formalization and Proof of Concept

Linguistic Features and Sociolinguistic Variability in Everyday Spoken Russian

References

Kohler, K.J.: Segmental reduction in connected speech in German: phonological facts and phonetic explanations. In: Hardcastle, W.J., Marchal, A. (eds.) Speech Production and Speech. NATO ASI Series, pp. 69–92. Springer, Dordrecht (1990). https://doi.org/10.1007/978-94-009-2037-8_4
Chapter Google Scholar
Brand, S., Ernestus, M.: Listeners’ processing of a given reduced word pronunciation variant directly reflects their exposure to this variant: evidence from native listeners and learners of French. Q. J. Exp. Psychol. 71, 1240–1259 (2018). https://doi.org/10.1080/17470218.2017.1313282
Article Google Scholar
Ernestus, M., Baayen, R.H., Schreuder, R.: The recognition of reduced word forms. Brain Lang. 81, 162–173 (2002). https://doi.org/10.1006/brln.2001.2514
Article Google Scholar
Riekhakaynen, E.: Reduction in spontaneous speech: How to survive. In: Heegart, J., Henrichsen, P.J. (eds.) New Perspectives on Speech in Action: Proceedings of the 2nd SJUSK Conference on Contemporary Speech Habits (Copenhagen Studies in Language 43), Samfundslitteratur, Frederiksberg, pp. 153–167 (2013)
Google Scholar
Stoyka, D.A.: Reduced forms of Russian speech: linguistic and extralinguistic aspects. PhD thesis, Saint Petersburg (2016). (In Rus.)
Google Scholar
Zemskaya, E.A. (ed.): Conversational Russian Speech. Nauka, Moscow (1973). (In Rus.)
Google Scholar
Jurafski, D., Bell, A., Gregory, M., Raymond, W.D.: Probabilistic relations between words: evidence from reduction in lexical production. In: Bybee, J., Hopper, P. (eds.) Frequency and the Emergence of Linguistic, pp. 229–254. John Benjamins, Philadelphia (2000)
Google Scholar
Jurafsky, D., Martin, J.H.: Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition. Prentice Hall, Upper Saddle River (2000)
Google Scholar
Schiel, F., Stevens, M., Reichel, U., Cutugno F.: Machine learning of probabilistic phonological pronunciation rules from the Italian CLIPS corpus. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, pp. 1414–1418 (2013). https://doi.org/10.5282/ubm/epub.18046
Vainio, M.: Phonetics and machine learning: hierarchical modelling of prosody in statistical speech synthesis. In: Besacier, L., Dediu, A.-H., Martín-Vide, C. (eds.) SLSP 2014. LNCS (LNAI), vol. 8791, pp. 37–54. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-11397-5_3
Chapter Google Scholar
Seide, F., Li, G., Yu, D.: Conversational speech transcription using context-dependent deep neural networks. In: Proceedings of Interspeech, pp. 437–440 (2011)
Google Scholar
Ellis, D.P.W., Singh, R., Sivadas, S.: Tandem acoustic modeling in large-vocabulary recognition. In: IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings, Salt Lake City, USA, vol. 1, pp. 517–520 (2001). https://doi.org/10.1109/ICASSP.2001.940881
Dahl, G., Yu, D., Deng, L., Acero, A.: Context-dependent pre-trained deep neural networks for large vocabulary speech recognition. IEEE Trans. Audio Speech Lang. Process. 20(1), 30–42 (2012). https://doi.org/10.1109/TASL.2011.2134090
Article Google Scholar
Zhang, Y., et al.: Towards end-to-end speech recognition with deep convolutional neural networks. In: Proceedings of Interspeech, pp. 410–414 (2016). https://doi.org/10.21437/Interspeech.2016-1446
Ravanelli, M., Serdyuk, D., Bengio, Y.: Twin regularization for online speech recognition. In: Karpov, A., Potapova, R., Mporas, I. (eds.) Proceedings of Interspeech (2018). https://arxiv.org/pdf/1804.05374.pdf. Accessed 28 May 2020. https://doi.org/10.21437/Interspeech.2018-1407
Maas, L., Xie, Z., Jurafsky, D., Ng, A.Y.: Lexicon-free conversational speech recognition with neural networks. In: Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 345–354. Curran Associates, New York (2015). https://doi.org/10.3115/v1/N15-1038
Mizera, P., Pollak, P.: Improving of LVCSR for causal czech using publicly available language resources. In: Karpov, A., Potapova, R., Mporas, I. (eds.) SPECOM 2017. LNCS (LNAI), vol. 10458, pp. 427–437. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-66429-3_42
Chapter Google Scholar
Kipyatkova, I.: Improving Russian LVCSR using deep neural networks for acoustic and language modeling. In: Karpov, A., Jokisch, O., Potapova, R. (eds.) SPECOM 2018. LNCS (LNAI), vol. 11096, pp. 291–300. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-99579-3_31
Chapter Google Scholar
Markovnikov, N.M., Kipyatkova, I.S.: Researching methods of building encoder-decoder models for Russian speech. Data Manage. Syst. 4, 45–53 (2019). (in Russian)
Google Scholar
Riekhakaynen, E.: Corpora of Russian spontaneous speech as a tool for modelling natural speech production and recognition. In: 10th Annual Computing and Communication Workshop and Conference, CCWC 2020, January 2020, pp. 406–411. IEEE, Las Vegas (2020). https://doi.org/10.1109/CCWC47524.2020.9031251
Ventsov, A.V., Grudeva, E.V.: A Frequency Dictionary of Russian. CHSU Publishing House, Cherepovets (2008). (in Russian)
Google Scholar
Shcherba, L.V.: About parts of speech in the Russian language. In: Language System and Speech Behaviour, pp. 77–100. Nauka, Leningrad (1974). (in Russian)
Google Scholar
Harrington, P.: Machine Learning in Action. Manning Publications, New York (2012)
Google Scholar

Download references

Acknowledgements

The research is supported by the grant #19-012-00629 from the Russian Foundation for Basic Research.

Author information

Authors and Affiliations

Saint-Petersburg State University, Universitetskaya emb. 7/9, 199034, St. Petersburg, Russia
Maria Dayter & Elena Riekhakaynen

Authors

Maria Dayter
View author publications
You can also search for this author in PubMed Google Scholar
Elena Riekhakaynen
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Elena Riekhakaynen .

Editor information

Editors and Affiliations

St. Petersburg Institute for Informatics and Automation, Russian Academy of Sciences, St. Petersburg, Russia
Alexey Karpov
Institute for Applied and Mathematical Linguistics, Moscow State Linguistic University, Moscow, Russia
Rodmonga Potapova

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Dayter, M., Riekhakaynen, E. (2020). Automatic Prediction of Word Form Reduction in Russian Spontaneous Speech. In: Karpov, A., Potapova, R. (eds) Speech and Computer. SPECOM 2020. Lecture Notes in Computer Science(), vol 12335. Springer, Cham. https://doi.org/10.1007/978-3-030-60276-5_12

Download citation

DOI: https://doi.org/10.1007/978-3-030-60276-5_12
Published: 29 September 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-60275-8
Online ISBN: 978-3-030-60276-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Automatic Prediction of Word Form Reduction in Russian Spontaneous Speech

Abstract

Access this chapter

Similar content being viewed by others

What Causes Phonetic Reduction in Russian Speech: New Evidence from Machine Learning Algorithms

Disfluency Insertion for Spontaneous TTS: Formalization and Proof of Concept

Linguistic Features and Sociolinguistic Variability in Everyday Spoken Russian

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Automatic Prediction of Word Form Reduction in Russian Spontaneous Speech

Abstract

Access this chapter

Similar content being viewed by others

What Causes Phonetic Reduction in Russian Speech: New Evidence from Machine Learning Algorithms

Disfluency Insertion for Spontaneous TTS: Formalization and Proof of Concept

Linguistic Features and Sociolinguistic Variability in Everyday Spoken Russian

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation