Morphological Analysis Using Linguistically Motivated Decomposition of Unknown Words

  • Stephan Bopp
  • Sandro Pedrazzini
Part of the Communications in Computer and Information Science book series (CCIS, volume 41)

Abstract

Integrating the decomposition of unknown morphologically complex words can enhance the recognition rates of morphological analyzers. Using linguisti cally motivated strategies for this decomposition leads to even more expressive re sults. The approach described here uses word formation rules and filtering tech niques to analyze and decompose words that are not contained in the underlying dictionary database. The average recognition rate of our German analyzers, applied to our test corpus, increased from 91% to 95,4%. Together with the current implementation, further future decomposition strategies will be presented.

Keywords

Morphological analysis controlled word decomposition finite state tools 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Abel, A.: ELDIT (Elektronisches Lernerwörterbuch Deutsch-Italienisch) und elexiko: Ein Vergleich. In: Klosa, A. (ed.) Lexikografische Portale im Internet (= OPAL-Sonderheft 1/2008, hrsg. vom Institut für Deutsche Sprache Mannheim), pp. 175–189. Mannheim (2008)Google Scholar
  2. 2.
    Canoo.net: German dictionaries and grammar, http://www.canoo.net
  3. 3.
    DIX: Deutsch-Spanisch Wörterbuch, http://dix.osola.com/
  4. 4.
    Domenig, M., ten Hacken, P.: Word Manager: A System for Morphological Dictionaries. Georg Olms Verlag, Hildesheim (1992)Google Scholar
  5. 5.
  6. 6.
    ten Hacken, P., Domenig, M.: Reusable Dictionaries for NLP: The Word Manager Approach. Lexicology 2, 232–255 (1996)Google Scholar
  7. 7.
  8. 8.
    Lüdeling, A., Fitschen, A.: An Integrated Lexicon for the Analysis of Complex Words. In: Proceedings of EURALEX 2002, Copenhagen (2002)Google Scholar
  9. 9.
  10. 10.
    Pedrazzini, S.: Periphrastic Inflection Clustering for Term Extraction. In: Proceedings of the Seventh International Symposium on Communication and Applied Linguistics, Editorial Oriente, Santiago de Cuba (2001)Google Scholar
  11. 11.
    Pons: Das Online-Wörterbuch in fünf Sprachen, http://www.pons.eu

Copyright information

© Springer-Verlag Berlin Heidelberg 2009

Authors and Affiliations

  • Stephan Bopp
    • 1
  • Sandro Pedrazzini
    • 1
  1. 1.Canoo Engineering AGBaselSwitzerland

Personalised recommendations