Affisix: Tool for Prefix Recognition

  • Jaroslava Hlaváčová
  • Michal Hrušecký
Part of the Lecture Notes in Computer Science book series (LNCS, volume 5246)

Abstract

In the paper, we present a software tool Affisix for automatic recognition of prefixes. On the basis of an extensive list of words in a language, it determines the segments – candidates for prefixes. There are two methods implemented for the recognition – the entropy method and the squares method. We briefly describe the methods, propose their improvements and present the results of experiments with Czech.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Hlaváčová, J.: Morphological Guesser of Czech Words. In: Matoušek, V. (ed.) Proc. TSD 2001, pp. 70–75. Springer, Berlin (2001)Google Scholar
  2. 2.
    Ústav Českého národního korpusu FF UK: Český národní korpus – Syn2000 (2000), http://ucnk.ff.cuni.cz
  3. 3.
    Urrea, A.M.: Automatic discovery of affixes by means of a corpus: A catalog of spanish affixes. Journal of Quantitative Linguistics 7, 97–114 (2000)CrossRefGoogle Scholar
  4. 4.
    Urrea, A.M., Hlaváčová, J.: Automatic Recognition of Czech Derivational Prefixes. In: Gelbukh, A. (ed.) CICLing 2005. LNCS, vol. 3406, pp. 189–197. Springer, Heidelberg (2005)Google Scholar
  5. 5.
    Hrušecký, M.: Affisix, http://affisix.sf.net

Copyright information

© Springer-Verlag Berlin Heidelberg 2008

Authors and Affiliations

  • Jaroslava Hlaváčová
    • 1
  • Michal Hrušecký
    • 1
  1. 1.Charles University in Prague, ÚFAL MFFCzech Republic

Personalised recommendations