Abstract
This article describe a set of modules and functions added to the TEITOK corpus environment that turn TEITOK into a full environment for working with dependency parsed corpora, allow parsing document, correcting parsing errors, visualizing parse results, and searching the corpus with a modified version of the CQL query language that can exploit dependency relations.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Janssen, M.: TEITOK: text-faithful annotated corpora. In: Proceedings of the 10th International Conference on Language Resources and Evaluation, LREC 2016, pp. 4037–4043 (2016)
Straka, M., Straková, J.: Tokenizing, POS tagging, lemmatizing and parsing UD 2.0 with UDPipe. In: Proceedings of the CoNLL 2017 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies, Vancouver, Canada, pp. 88–99. Association for Computational Linguistics, August 2017
Mendes, A., Antunes, S., Janssen, M., Gonçalves, A.: The COPLE2 corpus: a learner corpus for Portuguese. In: Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016), Paris, France. European Language Resources Association (ELRA), May 2016
Hardie, A.: CQPweb - combining power, flexibility and usability in a corpus analysis tool. Int. J. Corpus Linguist. 17(3), 380–409 (2012)
Krause, T., Zeldes, A.: ANNIS3: a new architecture for generic corpus query and visualization. Digit. Scholarsh. Humanit. 31(1), 118–139 (2016)
Zeldes, A., Schroeder, C.T.: An NLP pipeline for coptic. In: Proceedings of the 1st SIGHUM Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities (LaTeCH) (2016)
Tyers, F.M., Sheyanova, M., Washington, J.N.: UD annotatrix: an annotation tool for universal dependencies. In: Proceedings of the 16th International Workshop on Treebanks and Linguistic Theories (TLT16), pp. 10–17 (2018)
Nivre, J., et al.: Universal dependencies v1: a multilingual treebank collection. In: Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, Portorož, Slovenia, 23–28 May 2016
Evert, S., Hardie, A.: Twenty-first century corpus workbench: updating a query architecture for the new millennium. In: Corpus Linguistics 2011 (2011)
Knig, E., Lezius, W.: The TIGER language - a description language for syntax graphs - formal definition, May 2003
Kilgarriff, A., Tugwell, D.: Sketching words. In: Corréard, M.H. (ed.) Lexicography and Natural Language Processing: A Festschrift in Honour of B. T. S. Atkins. EURALEX, pp. 125–137 (2002)
Kilgarriff, A., et al.: The sketch engine: ten years on. Lexicography 1, 7–36 (2014)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Switzerland AG
About this paper
Cite this paper
Janssen, M. (2018). Dependency Graphs and TEITOK: Exploiting Dependency Parsing. In: Villavicencio, A., et al. Computational Processing of the Portuguese Language. PROPOR 2018. Lecture Notes in Computer Science(), vol 11122. Springer, Cham. https://doi.org/10.1007/978-3-319-99722-3_47
Download citation
DOI: https://doi.org/10.1007/978-3-319-99722-3_47
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-99721-6
Online ISBN: 978-3-319-99722-3
eBook Packages: Computer ScienceComputer Science (R0)