Abstract
The paper describes a new tool Derivancze, which provides an information on derivational relations between Czech words. After a summary of linguistic descriptions of Czech derivation we present a structure of our data and types of derivational relations we use. We compare our approach and results with Czech lexical network DeriNet, in particular, we discuss many differences between the two approaches. Our tool presently works with Czech data only, but the solution is general and can be used also for other languages.
This work was supported by the Ministry of Education of CR within the Lindat Clarin Center.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Šojat, K., Srebačić, M., Tadić, M., Pavelić, T.: CroDeriV: a new resource for processing croatian morphology. In: Calzolari, N., et al. (eds.) Proceedings of LREC 2014. ELRA, Reykjavik (2014)
Pala, K.: Derivational relations in slavonic languages. In: Proceedings of the FASSBL 2008, pp. 21–28. Croatian Language Technologies Society, Zagreb (2008)
Dokulil, M., et al.: Mluvnice češtiny I (Grammar of Czech I). Academia, Praha (1986)
Ševčíková, M., Žabokrtský, Z.: Word-Formation network for czech. In: Calzolari, N., et al. (eds.) Proceedings of LREC 2014. ELRA, Reykjavik (2014)
Dokulil, M.: Teorie odvozování slov (Theory of the Word Derivation). Academia, Praha (1962)
Karlík, P., et al.: Příruční mluvnice češtiny (Reference Grammar of Czech). Nakladatelství Lidové noviny, Praha (1995)
Čechová, M., et al.: Čeština – řeč a jazyk (Czech – Speech and Language). ISV nakladatelství, Praha (2002)
Hlaváčková, D., Osolsobě, K., Pala, K., Šmerk, P.: Exploring derivational relations in czech with the deriv tool. In: NLP, Corpus Linguistics, Corpus Based Grammar Research, Bratislava, Slovakia, Tribun, pp. 152–161 (2009)
Cvrček, V., Vondřička, P.: Nástroj pro slovotvornou analýzu jazykového korpusu (A Tool for Word-Formation Analysis of a Language Corpus). In: Gramatika a korpus, Hradec Králové, Gaudeamus (2013)
Zeller, B., Padó, S., Šnajder, J.: Towards semantic validation of a derivational lexicon. In: Proceedings of COLING 2014: Technical Papers, Dublin City University and ACL, pp. 1728–1739 (2014)
Ústav Českého národního korpusu FF UK: Český národní korpus - SYN (Czech National Corpus - SYN), Praha (2014). http://www.korpus.cz (cited April 1, 2015)
Šmerk, P.: Tools for fast morphological analysis based on finite state automata. In: Recent Advances in Slavonic Natural Language Processing 2014, Brno, Tribun EU, pp. 147–150 (2014)
Jakubíček, M., Kovář, V., Šmerk, P.: Czech morphological tagset revisited. In: Recent Advances in Slavonic Natural Language Processing 2011, Brno, Tribun EU, pp. 29–42 (2011)
Nevěřilová, Z.: Paraphrase and Textual Entailment Generation in Czech. Computación y Sistemas 18 (2014)
Veber, M., Sedláček, R., Pala, K., Osolsobě, K.: A procedure for word derivational processes concerning lexicon extension in highly inflected languages. In: Proceedings of LREC 2002, Las Palmas de Gran Canaria, pp. 1254–1259. ELRA (2002)
Pala, K., Hlaváčková, D.: Derivational relations in czech WordNet. In: Proceedings of the Workshop on Balto-Slavonic Natural Language Processing, pp. 75–81. ACL, Praha (2007)
Horák, A., Smrž, P.: VisDic - Wordnet browsing and editing tool. In: Proceedings of GWC 2004, Brno, Czech Republic, Masaryk University, pp. 136–141 (2003)
Filipec, J., et al.: Slovník spisovné češtiny. Academia, Praha (1994)
Suchomel, V.: Recent czech web corpora. In: Recent Advances in Slavonic Natural Language Processing 2012, Brno, Tribun EU, pp. 77–83 (2012)
Šmerk, P.: Towards Morphological Disambiguation of Czech. PhD thesis proposals, Faculty of Informatics, Masaryk University, Brno (2007) (in Czech)
Hajič, J.: Disambiguation of Rich Inflection (Computational Morphology of Czech). Charles Univeristy Press, Prague, Czech Republic (2004)
Spoustová, D., Hajič, J., Votrubec, J., Krbec, P., Květoň, P.: The best of two worlds: Cooperation of statistical and rule-based taggers for czech. In: Proceedings of the Workshop on Balto-Slavonic Natural Language Processing, Prague, pp. 67–74. ACL (2007)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Pala, K., Šmerk, P. (2015). Derivancze — Derivational Analyzer of Czech. In: Král, P., Matoušek, V. (eds) Text, Speech, and Dialogue. TSD 2015. Lecture Notes in Computer Science(), vol 9302. Springer, Cham. https://doi.org/10.1007/978-3-319-24033-6_58
Download citation
DOI: https://doi.org/10.1007/978-3-319-24033-6_58
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-24032-9
Online ISBN: 978-3-319-24033-6
eBook Packages: Computer ScienceComputer Science (R0)