Prefix Recognition Experiments

  • Jaroslava Hlaváčová
  • Michal Hrušecký
Part of the Lecture Notes in Computer Science book series (LNCS, volume 6836)


The paper deals with automatic methods for prefix extraction and their comparison. We present experiments with Czech and English and compare the results with regard to the size and type (wordforms vs. lemmas) of input data.


Initial Segment Rank Score Entropy Method Naive Approach Recognition Experiment 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Urrea, A.M.: Automatic discovery of affixes by means of a corpus: A catalog of spanish affixes. Journal of Quantitative Linguistics 7, 97–114 (2000)CrossRefGoogle Scholar
  2. 2.
    Hrušecký, M.: Affisix,
  3. 3.
    Hrušecký, M., Hlaváčová, J.: Automatické rozpoznávání předpon a přípon s pomocí nástroje affisix. In: Pardubská, D. (ed.) Informačné technológie Aplikácie a Teória, Zborník príspevkov prezentovaných na konferencii ITAT, Seňa, Slovakia, PONT s. r. o, pp. 63–67 (2010)Google Scholar
  4. 4.
    Bojar, O., Straňák, P., Zeman, D., Jain, G., Hrušecký, M., Richter, M., Hajič, J.: English-hindi translation obtaining mediocre results with bad data and fancy models. In: Sharma, D., Varma, V., Sangal, R. (eds.) Proceedings of ICON 2009: 7th International Conference on Natural Language Processing, Hyderabad, India, NLP Association of India, pp. 316–321. Macmillan Publishers, India (2009)Google Scholar
  5. 5.
    Hlaváčová, J., Hrušecký, M.: “affisix” tool for prefix recognition. In: Sojka, P., Horák, A., Kopeček, I., Pala, K. (eds.) TSD 2008. LNCS (LNAI), vol. 5246, pp. 85–92. Springer, Heidelberg (2008)CrossRefGoogle Scholar
  6. 6.
    Ústav Českého národního korpusu FF UK: Český národní korpus - syn2000, syn2005, syn2010 (2000),
  7. 7.
    Oxford University Computing Services on behalf of the BNC Consortium: The british national corpus (2007),

Copyright information

© Springer-Verlag Berlin Heidelberg 2011

Authors and Affiliations

  • Jaroslava Hlaváčová
    • 1
  • Michal Hrušecký
    • 1
  1. 1.ÚFAL MFFCharles UniversityPragueCzech Republic

Personalised recommendations