Statistical Word Sense Disambiguation in Contexts for Russian Nouns Denoting Physical Objects

  • Olga Mitrofanova
  • Olga Lashevskaya
  • Polina Panicheva
Part of the Lecture Notes in Computer Science book series (LNCS, volume 5246)

Abstract

The paper presents experimental results on automatic word sense disambiguation (WSD). Contexts for polysemous and/or homonymic Russian nouns denoting physical objects serve as an empirical basis of the study. Sets of contexts were extracted from the Russian National Corpus (RNC). Machine learning software for WSD was developed within the framework of the project. WSD tool used in experiments is aimed at statistical processing and classification of noun contexts. WSD procedure was performed taking into account lexical markers of word meanings in contexts and semantic annotation of contexts. Sets of experiments allowed to define optimal conditions for WSD in Russian texts.

Keywords

WSD Russian corpora 

References

  1. 1.
    Agirre, E., Edmonds, Ph. (eds.): Word Sense Disambiguation: Algorithms and Applications. Text, Speech and Language Technology, vol. 33. Springer, Berlin (2007)Google Scholar
  2. 2.
    Lukaševič, N.V., Čujko, D.S.: Avtomatičeskoje razrešenije leksičeskoj mnogoznačnosti na baze tezaurusnyh znanij. In: Internet-matematika 2007, pp. 108–117. Ekaterinburg (2007)Google Scholar
  3. 3.
    Rahilina, E.V., Kobricov, B.P., Kustova, G.I., L’aševskaja, O.N., Šemanajeva Ju, O.: Mnogoznačnost’ kak prikladnaja problema: leksiko-semantičeskaja razmetka v Nacional’nom korpuse russkogo jazyka. In: Kompjuternaja lingvistika i intellektual’nyje tehnologii: Trudy meždunarodnoj konferencii Dialog 2006, Moscow, pp. 445–450 (2006)Google Scholar
  4. 4.
    Azarova, I.V., Marina, A.S.: Avtomatizirovannaja klassifikacija kontekstov pri podgotovke dannyh dl’a kompjuternogo tezaurusa RussNet. In: Kompjuternaja lingvistika i intellektual’nyje tehnologii: Trudy meždunarodnoj konferencii Dialog 2006, Moscow, pp. 13–17 (2006)Google Scholar
  5. 5.
    Kobricov, B.P., L’aševskaja, O.N., Šemanajeva, O., Ju, O.: Sn’atije leksiko-semantičeskoj omonimii v novostnyh i gazteno-žurnal’nyh tekstah: poverhnostnyje fil’try i statističeskaja ocenka. In: Internet-matematika 2005: Avtomatičeskaja obrabotka web-dannyh, Moscow, pp. 38–57 (2005)Google Scholar
  6. 6.
    Toldova, S.J., Kustova, G.I., L’aševskaja, O.N.: Semantičeskije fil’try dl’a razrešenija mnogoznačnosti v nacional’nom korpuse russkogo jazyka: glagoly. In: Kompjuternaja lingvistika i intellektual’nyje tehnologii: Trudy meždunarodnoj konferencii Dialog 2008, Moscow, pp. 522–529 (2008)Google Scholar
  7. 7.
    Mitrofanova, O., Mukhin, A., Panicheva, P., Savitsky, V.: Automatic Word Clustering in Russian Texts. In: Matoušek, V., Mautner, P. (eds.) TSD 2007. LNCS (LNAI), vol. 4629, pp. 85–91. Springer, Heidelberg (2007)CrossRefGoogle Scholar
  8. 8.
    L’aševskaja, O.N., Sharoff, S.A.: Častotnyj slovar’ nacional’nogo korpusa russkogo jazyka: koncepcija i tehnologija sozdanija. In: Kompjuternaja lingvistika i intellektual’nyje tehnologii: Trudy meždunarodnoj konferencii Dialog 2008, Moscow, pp. 345–351 (2008)Google Scholar
  9. 9.
    Čermák, F., Křen, M.: Large Corpora, Lexical Frequencies and Coverage of Texts. In: Proceedings of the Corpus Linguistics Conference, Birmingham, July 14–17 (2005), http://www.corpus.bham.ac.uk/PCLC/CermakKren05.doc
  10. 10.
    Pala, K.: Word Sketches and Semantic Roles // Trudy meždunarodnoj konferencii Korpusnaja Lingvistika – 2006, pp. 307–317. St. Petersburg (2006)Google Scholar
  11. 11.
    Mitrofanova, O., Belik, V., Kadina, V.: Corpus Analysis of Selectional Preferences in Russian. In: Levická, J., Garabík, R. (eds.) Computer Treatment of Slavic and East European Languages: Proceedings of the Fourth International Seminar SLOVKO 2007, Bratislava, Slovakia, October 25–27, 2007, pp. 176–182 (2007)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2008

Authors and Affiliations

  • Olga Mitrofanova
    • 1
  • Olga Lashevskaya
    • 2
  • Polina Panicheva
    • 1
  1. 1.Department of Mathematical Linguistics Faculty of Philology and ArtsSt. Petersburg State UniversitySt. PetersburgRussia
  2. 2.Institute of the Russian Language MoscowRussia

Personalised recommendations