Skip to main content

Derivancze — Derivational Analyzer of Czech

  • Conference paper
  • First Online:
Text, Speech, and Dialogue (TSD 2015)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9302))

Included in the following conference series:

Abstract

The paper describes a new tool Derivancze, which provides an information on derivational relations between Czech words. After a summary of linguistic descriptions of Czech derivation we present a structure of our data and types of derivational relations we use. We compare our approach and results with Czech lexical network DeriNet, in particular, we discuss many differences between the two approaches. Our tool presently works with Czech data only, but the solution is general and can be used also for other languages.

This work was supported by the Ministry of Education of CR within the Lindat Clarin Center.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Šojat, K., Srebačić, M., Tadić, M., Pavelić, T.: CroDeriV: a new resource for processing croatian morphology. In: Calzolari, N., et al. (eds.) Proceedings of LREC 2014. ELRA, Reykjavik (2014)

    Google Scholar 

  2. Pala, K.: Derivational relations in slavonic languages. In: Proceedings of the FASSBL 2008, pp. 21–28. Croatian Language Technologies Society, Zagreb (2008)

    Google Scholar 

  3. Dokulil, M., et al.: Mluvnice češtiny I (Grammar of Czech I). Academia, Praha (1986)

    Google Scholar 

  4. Ševčíková, M., Žabokrtský, Z.: Word-Formation network for czech. In: Calzolari, N., et al. (eds.) Proceedings of LREC 2014. ELRA, Reykjavik (2014)

    Google Scholar 

  5. Dokulil, M.: Teorie odvozování slov (Theory of the Word Derivation). Academia, Praha (1962)

    Google Scholar 

  6. Karlík, P., et al.: Příruční mluvnice češtiny (Reference Grammar of Czech). Nakladatelství Lidové noviny, Praha (1995)

    Google Scholar 

  7. Čechová, M., et al.: Čeština – řeč a jazyk (Czech – Speech and Language). ISV nakladatelství, Praha (2002)

    Google Scholar 

  8. Hlaváčková, D., Osolsobě, K., Pala, K., Šmerk, P.: Exploring derivational relations in czech with the deriv tool. In: NLP, Corpus Linguistics, Corpus Based Grammar Research, Bratislava, Slovakia, Tribun, pp. 152–161 (2009)

    Google Scholar 

  9. Cvrček, V., Vondřička, P.: Nástroj pro slovotvornou analýzu jazykového korpusu (A Tool for Word-Formation Analysis of a Language Corpus). In: Gramatika a korpus, Hradec Králové, Gaudeamus (2013)

    Google Scholar 

  10. Zeller, B., Padó, S., Šnajder, J.: Towards semantic validation of a derivational lexicon. In: Proceedings of COLING 2014: Technical Papers, Dublin City University and ACL, pp. 1728–1739 (2014)

    Google Scholar 

  11. Ústav Českého národního korpusu FF UK: Český národní korpus - SYN (Czech National Corpus - SYN), Praha (2014). http://www.korpus.cz (cited April 1, 2015)

  12. Šmerk, P.: Tools for fast morphological analysis based on finite state automata. In: Recent Advances in Slavonic Natural Language Processing 2014, Brno, Tribun EU, pp. 147–150 (2014)

    Google Scholar 

  13. Jakubíček, M., Kovář, V., Šmerk, P.: Czech morphological tagset revisited. In: Recent Advances in Slavonic Natural Language Processing 2011, Brno, Tribun EU, pp. 29–42 (2011)

    Google Scholar 

  14. Nevěřilová, Z.: Paraphrase and Textual Entailment Generation in Czech. Computación y Sistemas 18 (2014)

    Google Scholar 

  15. Veber, M., Sedláček, R., Pala, K., Osolsobě, K.: A procedure for word derivational processes concerning lexicon extension in highly inflected languages. In: Proceedings of LREC 2002, Las Palmas de Gran Canaria, pp. 1254–1259. ELRA (2002)

    Google Scholar 

  16. Pala, K., Hlaváčková, D.: Derivational relations in czech WordNet. In: Proceedings of the Workshop on Balto-Slavonic Natural Language Processing, pp. 75–81. ACL, Praha (2007)

    Google Scholar 

  17. Horák, A., Smrž, P.: VisDic - Wordnet browsing and editing tool. In: Proceedings of GWC 2004, Brno, Czech Republic, Masaryk University, pp. 136–141 (2003)

    Google Scholar 

  18. Filipec, J., et al.: Slovník spisovné češtiny. Academia, Praha (1994)

    Google Scholar 

  19. Suchomel, V.: Recent czech web corpora. In: Recent Advances in Slavonic Natural Language Processing 2012, Brno, Tribun EU, pp. 77–83 (2012)

    Google Scholar 

  20. Šmerk, P.: Towards Morphological Disambiguation of Czech. PhD thesis proposals, Faculty of Informatics, Masaryk University, Brno (2007) (in Czech)

    Google Scholar 

  21. Hajič, J.: Disambiguation of Rich Inflection (Computational Morphology of Czech). Charles Univeristy Press, Prague, Czech Republic (2004)

    Google Scholar 

  22. Spoustová, D., Hajič, J., Votrubec, J., Krbec, P., Květoň, P.: The best of two worlds: Cooperation of statistical and rule-based taggers for czech. In: Proceedings of the Workshop on Balto-Slavonic Natural Language Processing, Prague, pp. 67–74. ACL (2007)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Pavel Šmerk .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Pala, K., Šmerk, P. (2015). Derivancze — Derivational Analyzer of Czech. In: Král, P., Matoušek, V. (eds) Text, Speech, and Dialogue. TSD 2015. Lecture Notes in Computer Science(), vol 9302. Springer, Cham. https://doi.org/10.1007/978-3-319-24033-6_58

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-24033-6_58

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-24032-9

  • Online ISBN: 978-3-319-24033-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics