Next Step in Online Querying and Visualization of Word-Formation Networks
- 1 Citations
- 377 Downloads
Abstract
In this paper, we introduce a new and improved version of DeriSearch, a search engine and visualizer for word-formation networks.
Word-formation networks are datasets that express derivational, compounding and other word-formation relations between words. They are usually expressed as directed graphs, in which nodes correspond to words and edges to the relations between them. Some networks also add other linguistic information, such as morphological segmentation of the words or identification of the processes expressed by the relations.
Networks for morphologically rich languages with productive derivation or compounding have large connected components, which are difficult to visualize. For example, in the network for Czech, DeriNet 2.0, connected components over 500 words large contain
Open image in new window
of the vocabulary, including its most common parts. In the network for Latin, Word Formation Latin, over 10 000 words (
Open image in new window
of the vocabulary) are in a single connected component.
With the recent release of the Universal Derivations collection of word-formation networks for several languages, there is a need for a searching and visualization tool that would allow browsing such complex data.
Keywords
Derivational morphology Word formation Graph visualization Search engineNotes
Acknowledgments
This work was supported by the Grant No. GA19-14534S of the Czech Science Foundation, by the Charles University Grant Agency project No. 1176219 and by the SVV project No. 260 575. It uses language resources developed, stored, and distributed by the LINDAT/CLARIAH CZ project (LM2015071, LM2018101).
References
- 1.Christ, O., Schulze, B.M., Hofmann, A., König, E.: The IMS Corpus Workbench: Corpus Query Processor (CQP) User’s Manual. University of Stuttgart, Germany (1999)Google Scholar
- 2.Culy, C., Litta, E., Passarotti, M.: Visual exploration of Latin derivational morphology. In: Proceedings of FLAIRS 2017, pp. 601–606 (2017)Google Scholar
- 3.Horák, A., Pala, K., Rambousek, A., Povolný, M.: DEBVisDic - first version of new client-server Wordnet browsing and editing tool. In: Proceedings of the Third International WordNet Conference (GWC 2006), pp. 325–328 (2005)Google Scholar
- 4.Kyjánek, L.: Morphological resources of derivational word-formation relations. Technical report ÚFAL TR-2018-61, ÚFAL MFF UK, Prague, Czechia (2018)Google Scholar
- 5.Kyjánek, L., Žabokrtský, Z., Ševčíková, M., Vidra, J.: Universal derivations kickoff: a collection of harmonized derivational resources for eleven languages. In: Proceedings of DeriMo 2019, Prague, Czechia, pp. 101–110 (2019)Google Scholar
- 6.Litta, E., Passarotti, M., Culy, C.: Formatio formosa est. Building a word formation lexicon for Latin. In: Proceedings of CLiC-IT 2016, pp. 185–189 (2016)Google Scholar
- 7.Pala, K., Šmerk, P.: Derivancze — derivational analyzer of Czech. In: Král, P., Matoušek, V. (eds.) TSD 2015. LNCS (LNAI), vol. 9302, pp. 515–523. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-24033-6_58CrossRefGoogle Scholar
- 8.Panocová, R.: Internationalisms with the suffix -ácia and their adaptation in Slovak. In: Proceedings of DeriMo 2017, Milano, Italy, pp. 129–139 (2017)Google Scholar
- 9.Rambousek, A., Horák, A., Klement, D., Kletečka, J.: New features in DEBVisDic for WordNet visualization and user feedback. In: Proceedings of RASLAN 2017 (2017)Google Scholar
- 10.Šojat, K., Srebačić, M., Tadić, M., Pavelić, T.: CroDeriV: a new resource for processing Croatian morphology. In: Proceedings of LREC 2014 (2014)Google Scholar
- 11.Talamo, L., Celata, C., Bertinetto, P.M.: DerIvaTario: an annotated lexicon of Italian derivatives. Word Struct. 9(1), 72–102 (2016)CrossRefGoogle Scholar
- 12.Vidra, J.: Implementation of a search engine for DeriNet. In: Proceedings of ITAT 2015, Prague, Czechia, pp. 100–106 (2015)Google Scholar
- 13.Vidra, J., Žabokrtský, Z.: Online software components for accessing derivational networks. In: Proceedings of DeriMo 2017, Milano, Italy, pp. 129–139 (2017)Google Scholar
- 14.Vidra, J., Žabokrtský, Z., Ševčíková, M., Kyjánek, L.: Derinet 2.0: towards an all-in-one word-formation resource. In: Proceedings of DeriMo 2019, Prague, Czechia (2019)Google Scholar