Grammatical Agreement and Automatic Morphological Disambiguation of Inflectional Languages
This paper describes a part of one of the most important syntactic subsystems present in many inflectional languages — grammatical agreement — from the viewpoint of automatic morphological disambiguation of such languages. One of the languages on which the main ideas will be demonstrated is Czech which — due to its morphological and syntactic complexity — can be regarded as a representative of the inflectional subgroup of the Slavic language family. It will be shown that notwithstanding the intricacies of the syntax of Czech a deeper understanding of the nature of grammatical agreement can result in the development of surface syntax rules which can considerably contribute to solving the problem of automatic morphological disambiguation of texts stored in Czech corpora. Although the language being studied is only Czech the ideas presented seem to be applicable, mutatis mutandis, also to the morphological disambiguation of a si-milar type of languages, especially the Slavic ones.
KeywordsNominal Group Prepositional Phrase Nominative Case Slavic Language Nominative Subject
Unable to display preview. Download preview PDF.
- 1.Czech National Corpus. Faculty of Arts, Charles University. http://ucnk.ff.cuni.cz.
- 2.Hajič, J., Hladká, B.: Probabilistic and Rule-Based Tagger of an Inflective Language — a Comparison. Proceedings of the Fifth Conference on Applied Natural Language Processing. Washington D.C. (1997).Google Scholar
- 3.Hladká, B.: Czech Language Tagging. PhD Thesis. MFF UK (2000).Google Scholar
- 4.Oliva, K., Hnátková, M., Petkevič, V., Květoň, P.: The Linguistic Basis of a Rule-Based Tagger of Czech. Text, Speech and Dialogue. Proceedings of the Third InternationalWorkshop, TSD 2000, LNAI 1902. Brno 2000, 3–8.Google Scholar
- 5.Hajič, J.: Disambiguation of Rich Inflection (Computational Morphology of Czech), Vol. 1. Karolinum Charles University Press, Prague. In press.Google Scholar