Skip to main content

The “One Day of Speech” Corpus: Phonetic and Syntactic Studies of Everyday Spoken Russian

  • Conference paper
  • First Online:
Speech and Computer (SPECOM 2015)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9319))

Included in the following conference series:

Abstract

The studies described in the paper are made on the base of the ORD – “One day of speech” – corpus of Russian everyday speech which contains long-term audio recordings of daily communication. The ORD corpus provides rich authentic material for research in phonetics and syntax of spoken Russian, and may be used for adjustment and improvement of speech synthesis and recognition systems. Current phonetic investigations of the ORD corpus relate to temporal studies, study of speech reduction, phonetic realization of words and affixes, investigation of phonetic errors and mondegreens, studies of rhythm structures and hesitation phenomena. Syntactic studies primarily deal with linear word order of syntactic groups, syntactic complexity of spoken utterances, and specific syntactic phenomena of spontaneous speech. In this paper, we summarize main achievements in phonetic and syntactic studies made on the base of the ORD corpus and outline some directions for further investigations.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Grishina, E.: Ustnaja rech v Nacionalnom korpuse russkogo jazyka. Nacionalnyj korpus russkogo jazyka: 2003–2005, pp. 94–110. Indrik Publication, Moscow (2005) (in Russian)

    Google Scholar 

  2. Kibrik, A., Podlesskaya, V. (eds.): Rasskazy o snovidenijakh. Korpusnoe issledovanie ustnogo russkogo diskursa. Yazyki slavyanskikh kul’tur, Moscow (2009) (in Russian)

    Google Scholar 

  3. Krivnova, O.: Russkij rechevoj korpus RuSpeech. In: Proceedings of the VII International Scientific Conference “Fonetika segodnia”, pp. 54–56 (2013)

    Google Scholar 

  4. Skrelin, P., Volskaya, N., Kocharov, D., Evgrafova, K. et al.: A fully annotated corpus of Russian speech. In: Proceedings of LREC 2010, pp. 109–112, Malta (2010)

    Google Scholar 

  5. Kotov, A., Gopkalo, O.: Russkojazychnyj emocional’nyj korpus: kommunikativnoe vzaimodejstvie v real’nykh emocional’nykh situaciajkh. In: Proceedings of the International Conference “Corpus linguistics-2013”, pp. 211–216. St. Petersburg State University, St. Petersburg (2013) (in Russian)

    Google Scholar 

  6. Asinovsky, A., Bogdanova, N., Rusakova, M., Ryko, A., Stepanova, S., Sherstinova, T.: The ORD speech corpus of Russian everyday communication “One Speaker’s Day”: creation principles and annotation. In: Matoušek, V., Mautner, P. (eds.) TSD 2009. LNCS, vol. 5729, pp. 250–257. Springer, Heidelberg (2009)

    Google Scholar 

  7. Reference Guide for the British National Corpus. http://www.natcorp.ox.ac.uk/docs/URG.xml

  8. Campbell, N.: Speech and expression; the value of a longitudinal corpus. In: Proceedings of LREC 2004, pp. 183–186 (2004)

    Google Scholar 

  9. ELAN - Linguistic Annotator. Version 4.9.0. http://www.mpi.nl/corpus/html/elan/

  10. Praat: Doing Phonetics by computer. http://www.praat.org

  11. Sherstinova, T.: Quantitative data processing in the ORD speech corpus of Russian everyday communication. In: Grzybek, P., Kelih, E., Mačutek, J. (eds.) Text and Language: Structures, Functions, Interrelations, pp. 195–206. Praesens Verlag, Wien (2010)

    Google Scholar 

  12. Sherstinova, T.: Russian everyday utterances: the top lists and some statistics. In: Thielemann, N., Kosta, P. (eds.) Approaches to Slavic Interaction. Dialogue Studies, vol. 20, pp. 105–116. John Benjamins Publication Company, Amsterdam/Philadelphia (2013)

    Google Scholar 

  13. Stepanova, S.: Speech rate as reflection of speakers social characteristics. In: Thielemann, N., Kosta, P. (eds.) Approaches to Slavic Interaction. Dialogue Studies, vol. 20, pp. 117–129. John Benjamins Publishing Company, Amsterdam/Philadelphia (2013)

    Google Scholar 

  14. Metlova, V.: Temp rechi v svobodnoj kommunikacii: sociolingvisticheskij aspekt. Vestnik Permskogo universiteta. Rossijskaja i zarubezhnaja filologija 4(28), pp. 58–65 (2014) (in Russian)

    Google Scholar 

  15. Bogdanova, N., Palshina, D.: Reducirovannye formy russkoj rechi (opyt leksikograficheskogo opisanija). In: Proceedings of Sc. Conference “Slovo. Slovar’. Slovesnost’: Tekst slovaria i kontekst leksikografii”, pp. 491–497. RGPU imeni A. Gerzena, St. Petersburg (2010) (in Russian)

    Google Scholar 

  16. Stepanova, S., Asinovsky, A., Ryko, A., Sherstinova, T.: Zvukovaja real’nost’ slovoizmenitel’nykh affiksov (po dannym Zvukovogo korpusa russkogo jazyka). In: Proceedings of the International Conference “Dialog 2010”, pp. 41–46, Bekasovo (2010) (in Russian)

    Google Scholar 

  17. Stepanova, S.: Oslyshki i peresprosy kak baza dlia issedovanija vosprijatija rechi. In: Aktual’nye voprosy teoreticheskoj i prikladnoj fonetiki, pp. 383–397, BukiVedi, Moscow (2014) (in Russian)

    Google Scholar 

  18. Bogdanova-Beglarjan, N. (ed.): Zvukovoj korpus kak material dlja analiza russkoj rechi. Chast 1. Chtenie. Pereskaz. Opisanie. Philological Faculty of St. Petersburg State University, St. Petersburg (2013) (in Russian)

    Google Scholar 

  19. Sherstinova, T.: Ob izokhronnosti strukturnykh jedinic v spontannoj rechi (k postanovke problemy). In: Asinovsky, A.S., Bogdanova N.V. (eds.) Proceedings of XXXVII International Philological Conference, Issue 23, pp. 109–118. St. Petersburg State University, St. Petersburg (2010) (in Russian)

    Google Scholar 

  20. Bogdanova-Beglarian, N., Sherstinova, T., and Kisloshchuk, A.: O ritmoobrazujushchej funkcii diskursivnykh jedinic. Vestnik Permskogo universiteta. Rossijskaja i zarubezhnaja filologija 2(22), pp. 7–17 (2013) (in Russian)

    Google Scholar 

  21. Bogdanova-Beglarian, N.: Kto ishchet - vsegda li najdet? (o poiskovoj funkcii verbalnykh khezitativov v russkoj spontannoj rechi). In: Proceedings of the International Conference “Dialog-2013”, pp. 125–136 (2013) (in Russian)

    Google Scholar 

  22. Lapteva, O.A.: Russkij razgovornyj sintaksis. Nauka, Moscow (1976) (in Russian)

    Google Scholar 

  23. Martynenko, G.: Sintaksis zhivoj spontannoj rechi: simmetrija linejnykh poriadkov. In: Proceedings of the International Conference “Corpus linguistics-2015” pp. 307–314 (2015) (in Russian)

    Google Scholar 

  24. Bogdanova-Beglarjan, N. (ed.): Zvukovoj korpus kak material dlja analiza russkoj rechi. Chast 2. Teoreticheskie i prakticheskie aspekty analiza. Vol. 1. O nekotorykh osobennostjakh ustnoj spontannoj rechi raznogo tipa. Zvukovoj korpus kak material dlja prepodavanija russkogo jazyka v inostrannoj auditorii. Philological Faculty of St. Petersburg State University, St. Petersburg (2014) (in Russian)

    Google Scholar 

  25. Baeva, E.M.: O sposobax sociolingvisticheskoj balansirovki ustnogo korpusa (na primere “Odnogo rechevogo dn’a”). Vestnik Permskogo universiteta. Rossijskaja i zarubezhnaja filologia, 4(28), pp. 48–57 (2014) (in Russian)

    Google Scholar 

  26. Martynenko, G.: Osnovy stilemetrii. Leningrad State University, Leningrad (1988) (in Russian)

    Google Scholar 

Download references

Acknowledgements

The research is made within the framework of the project “Everyday Russian Language in Different Social Groups”  supported by the Russian Scientific Foundation, project # 14-18-02070.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Tatiana Sherstinova .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Bogdanova-Beglarian, N., Martynenko, G., Sherstinova, T. (2015). The “One Day of Speech” Corpus: Phonetic and Syntactic Studies of Everyday Spoken Russian. In: Ronzhin, A., Potapova, R., Fakotakis, N. (eds) Speech and Computer. SPECOM 2015. Lecture Notes in Computer Science(), vol 9319. Springer, Cham. https://doi.org/10.1007/978-3-319-23132-7_53

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-23132-7_53

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-23131-0

  • Online ISBN: 978-3-319-23132-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics