Skip to main content

Macro Episodes of Russian Everyday Oral Communication: Towards Pragmatic Annotation of the ORD Speech Corpus

  • Conference paper
  • First Online:
Speech and Computer (SPECOM 2015)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9319))

Included in the following conference series:

Abstract

The ORD corpus is a representative resource of everyday spoken Russian that contains about 1000 h of long-term audio recordings of daily communication made in real settings by research volunteers. ORD macro episodes are the large communication episodes united by setting/scene of communication, social roles of participants and their general activity. The paper describes annotation principles used for tagging of macro episodes, provides current statistics on communication situations presented in the corpus and reveals their most common types. Annotation of communication situations allows using these codes as filters for selection of audio data, therefore making it possible to study Russian everyday speech in different communication situations, to determine and describe various registers of spoken Russian. As an example, several high frequency word lists referring to different communication situations are compared. Annotation of macro episodes that is made for the ORD corpus is a prerequisite for its further pragmatic annotation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    For more information on how to use frequency word list structure in study of language styles see [8].

References

  1. Asinovsky, A., Bogdanova, N., Rusakova, M., Ryko, A., Stepanova, S., Sherstinova, T.: The ORD speech corpus of Russian everyday communication “One Speaker’s Day”: creation principles and annotation. In: Matoušek, V., Mautner, P. (eds.) TSD 2009. LNCS, vol. 5729, pp. 250–257. Springer, Heidelberg (2009)

    Google Scholar 

  2. Sherstinova, T.: Communikativnyje macroepizody v korpuse povsednevnoj russkoj rechi “Odin rechevoj den\(\text{' }\,\)”: principy annotirovanija i rezul’taty statisticheskoj obrabotki. In: Zakharov, V., Mitrofanova, O., Khokhlova, M. (eds.) Proceeding of the International Conference “Corpus linguistics-2013”, pp. 449–456. St. Petersburg State University, St. Petersburg (2013)

    Google Scholar 

  3. Sherstinova, T.: Pragmaticheskoe annotirovanie konnunicativnykh jedinic v korpuse ORD: mikroepisody i rechevye akty. In: Proceeding of the International Conference “Corpus linguistics-2015”, pp. 436–446 (2015) (in Russian)

    Google Scholar 

  4. Potapova, R.K.: Rech: kommunikacija, informatika, kibernetika. URSS, Moscow (2003)

    Google Scholar 

  5. Sherstinova, T.: The structure of the ORD speech corpus of Russian everyday communication. In: Matoušek, V., Mautner, P. (eds.) TSD 2009. LNCS, vol. 5729, pp. 258–265. Springer, Heidelberg (2009)

    Google Scholar 

  6. Chebanov, S., Martynenko, G.: Semiotika opisatel’nykh tekstov: tipologicheskij aspekt. St. Peterburg State University, St. Petersburg (1999)

    Google Scholar 

  7. Ottenheimer, H.J.: The Anthropology of Language: An Introduction to Linguistic Anthropology. Wadsworth Cenage Learning, Belmont, CA (2006)

    Google Scholar 

  8. Martynenko, G.: Osnovy stilemetrii. Leningrad State University, Leningrad (1988)

    Google Scholar 

Download references

Acknowledgements

The annotation principles for macro episodes tagging have been developed with support of the Russian Foundation for Humanities (project # 12-04-12017, “Information System of Communication Scenarios of Russian Spontaneous Speech”). The presented statistics were obtained within the framework of the project “Everyday Russian Language in Different Social Groups” supported by the Russian Scientific Foundation, project # 14-18-02070.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Tatiana Sherstinova .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Sherstinova, T. (2015). Macro Episodes of Russian Everyday Oral Communication: Towards Pragmatic Annotation of the ORD Speech Corpus. In: Ronzhin, A., Potapova, R., Fakotakis, N. (eds) Speech and Computer. SPECOM 2015. Lecture Notes in Computer Science(), vol 9319. Springer, Cham. https://doi.org/10.1007/978-3-319-23132-7_33

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-23132-7_33

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-23131-0

  • Online ISBN: 978-3-319-23132-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics