Abstract
The aim of this paper is to determine the main features of word forms that undergo reduction in Russian spontaneous speech using machine learning algorithms. We examined the factors proposed in the previous corpus-based studies, namely, the number of syllables, word frequency, part of speech, reduction of the preceding word. We used the texts from the Corpus of Russian Oral Speech as the data. The following machine learning algorithms were applied: extremely randomized tree, random forest and logistic regression. The results show that the higher the frequency of a word form is, the higher the chances are that it will be reduced; the more syllables a word form has, the higher probability of its reduction is. But we did not find the influence of the factor whether the previous word is reduced or not. Regarding the part-of-speech feature, the results are not that straightforward since we received different lists using different algorithms, but the adjective and parenthetical word were in both of them. Thus, we can conclude that adjectives and parenthetical words are likely to be reduced more often than other parts of speech.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Kohler, K.J.: Segmental reduction in connected speech in German: phonological facts and phonetic explanations. In: Hardcastle, W.J., Marchal, A. (eds.) Speech Production and Speech. NATO ASI Series, pp. 69–92. Springer, Dordrecht (1990). https://doi.org/10.1007/978-94-009-2037-8_4
Brand, S., Ernestus, M.: Listeners’ processing of a given reduced word pronunciation variant directly reflects their exposure to this variant: evidence from native listeners and learners of French. Q. J. Exp. Psychol. 71, 1240–1259 (2018). https://doi.org/10.1080/17470218.2017.1313282
Ernestus, M., Baayen, R.H., Schreuder, R.: The recognition of reduced word forms. Brain Lang. 81, 162–173 (2002). https://doi.org/10.1006/brln.2001.2514
Riekhakaynen, E.: Reduction in spontaneous speech: How to survive. In: Heegart, J., Henrichsen, P.J. (eds.) New Perspectives on Speech in Action: Proceedings of the 2nd SJUSK Conference on Contemporary Speech Habits (Copenhagen Studies in Language 43), Samfundslitteratur, Frederiksberg, pp. 153–167 (2013)
Stoyka, D.A.: Reduced forms of Russian speech: linguistic and extralinguistic aspects. PhD thesis, Saint Petersburg (2016). (In Rus.)
Zemskaya, E.A. (ed.): Conversational Russian Speech. Nauka, Moscow (1973). (In Rus.)
Jurafski, D., Bell, A., Gregory, M., Raymond, W.D.: Probabilistic relations between words: evidence from reduction in lexical production. In: Bybee, J., Hopper, P. (eds.) Frequency and the Emergence of Linguistic, pp. 229–254. John Benjamins, Philadelphia (2000)
Jurafsky, D., Martin, J.H.: Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition. Prentice Hall, Upper Saddle River (2000)
Schiel, F., Stevens, M., Reichel, U., Cutugno F.: Machine learning of probabilistic phonological pronunciation rules from the Italian CLIPS corpus. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, pp. 1414–1418 (2013). https://doi.org/10.5282/ubm/epub.18046
Vainio, M.: Phonetics and machine learning: hierarchical modelling of prosody in statistical speech synthesis. In: Besacier, L., Dediu, A.-H., MartÃn-Vide, C. (eds.) SLSP 2014. LNCS (LNAI), vol. 8791, pp. 37–54. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-11397-5_3
Seide, F., Li, G., Yu, D.: Conversational speech transcription using context-dependent deep neural networks. In: Proceedings of Interspeech, pp. 437–440 (2011)
Ellis, D.P.W., Singh, R., Sivadas, S.: Tandem acoustic modeling in large-vocabulary recognition. In: IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings, Salt Lake City, USA, vol. 1, pp. 517–520 (2001). https://doi.org/10.1109/ICASSP.2001.940881
Dahl, G., Yu, D., Deng, L., Acero, A.: Context-dependent pre-trained deep neural networks for large vocabulary speech recognition. IEEE Trans. Audio Speech Lang. Process. 20(1), 30–42 (2012). https://doi.org/10.1109/TASL.2011.2134090
Zhang, Y., et al.: Towards end-to-end speech recognition with deep convolutional neural networks. In: Proceedings of Interspeech, pp. 410–414 (2016). https://doi.org/10.21437/Interspeech.2016-1446
Ravanelli, M., Serdyuk, D., Bengio, Y.: Twin regularization for online speech recognition. In: Karpov, A., Potapova, R., Mporas, I. (eds.) Proceedings of Interspeech (2018). https://arxiv.org/pdf/1804.05374.pdf. Accessed 28 May 2020. https://doi.org/10.21437/Interspeech.2018-1407
Maas, L., Xie, Z., Jurafsky, D., Ng, A.Y.: Lexicon-free conversational speech recognition with neural networks. In: Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 345–354. Curran Associates, New York (2015). https://doi.org/10.3115/v1/N15-1038
Mizera, P., Pollak, P.: Improving of LVCSR for causal czech using publicly available language resources. In: Karpov, A., Potapova, R., Mporas, I. (eds.) SPECOM 2017. LNCS (LNAI), vol. 10458, pp. 427–437. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-66429-3_42
Kipyatkova, I.: Improving Russian LVCSR using deep neural networks for acoustic and language modeling. In: Karpov, A., Jokisch, O., Potapova, R. (eds.) SPECOM 2018. LNCS (LNAI), vol. 11096, pp. 291–300. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-99579-3_31
Markovnikov, N.M., Kipyatkova, I.S.: Researching methods of building encoder-decoder models for Russian speech. Data Manage. Syst. 4, 45–53 (2019). (in Russian)
Riekhakaynen, E.: Corpora of Russian spontaneous speech as a tool for modelling natural speech production and recognition. In: 10th Annual Computing and Communication Workshop and Conference, CCWC 2020, January 2020, pp. 406–411. IEEE, Las Vegas (2020). https://doi.org/10.1109/CCWC47524.2020.9031251
Ventsov, A.V., Grudeva, E.V.: A Frequency Dictionary of Russian. CHSU Publishing House, Cherepovets (2008). (in Russian)
Shcherba, L.V.: About parts of speech in the Russian language. In: Language System and Speech Behaviour, pp. 77–100. Nauka, Leningrad (1974). (in Russian)
Harrington, P.: Machine Learning in Action. Manning Publications, New York (2012)
Acknowledgements
The research is supported by the grant #19-012-00629 from the Russian Foundation for Basic Research.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Dayter, M., Riekhakaynen, E. (2020). Automatic Prediction of Word Form Reduction in Russian Spontaneous Speech. In: Karpov, A., Potapova, R. (eds) Speech and Computer. SPECOM 2020. Lecture Notes in Computer Science(), vol 12335. Springer, Cham. https://doi.org/10.1007/978-3-030-60276-5_12
Download citation
DOI: https://doi.org/10.1007/978-3-030-60276-5_12
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-60275-8
Online ISBN: 978-3-030-60276-5
eBook Packages: Computer ScienceComputer Science (R0)