Abstract
A dictionary-free morphological classifier of nouns for a highly inflective language is developed. The classifier is a front-end utility for acquiring a very large DB of Russian collocations and WordNet-like semantic links. For its main functions, the classifier uses the final letters of standard noun forms and extensive morphological and lexical data. The percentage of nouns correctly classified in a standalone manner is now 99.65%. A completely error-free performance is impossible for context-free methods in principle, primarily because of homonymy: the nouns of various senses may decline in different ways. Therefore the classifier’s results are additionally tested against more than 200,000 collocations stored in the DB and, when it is necessary, are automatically corrected.
Work done under partial support of Mexican Government (CONACyT, SNI, SIP-IPN) and Russian Foundation of Fundamental Research (RFFI, the grant 06-01-00571). Many thanks to Steve Legrand for good advice and proofreading.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Belonogov, G.G., et al.: Algorithm of multi-step morphological analysis of Russian words (in Russian). Nauchno-Tekhnicheskaya Informatsiya (NTI) Ser. 2(1), 6–10 (1983)
Bolshakov, I.A.: Getting One’s First Million..Collocations. In: Gelbukh, A. (ed.) CICLing 2004. LNCS, vol. 2945, pp. 229–242. Springer, Heidelberg (2004)
Gelbukh, A.F.: An effectively implementable model of morphology of an inflective language (in Russian). Nauchno-Tekhnicheskaya Informatsiya (NTI) Ser. 2(1), 24–31 (1992), http://www.gelbukh.com/CV/Publications/1992/NTI-Morph-model.htm
Gelbukh, A., Sidorov, G.: Approach to construction of automatic morphological analysis systems for inflective languages with little effort. In: Gelbukh, A. (ed.) CICLing 2003. LNCS, vol. 2588, pp. 215–220. Springer, Heidelberg (2003)
Mitkov, R. (ed.): The Oxford Handbook of Computational Linguistics. Oxford University Press, Oxford (2003)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Bolshakov, I.A., Bolshakova, E.I. (2006). Dictionary-Free Morphological Classifier of Russian Nouns. In: Salakoski, T., Ginter, F., Pyysalo, S., Pahikkala, T. (eds) Advances in Natural Language Processing. FinTAL 2006. Lecture Notes in Computer Science(), vol 4139. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11816508_25
Download citation
DOI: https://doi.org/10.1007/11816508_25
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-37334-6
Online ISBN: 978-3-540-37336-0
eBook Packages: Computer ScienceComputer Science (R0)