Dependency Graphs and TEITOK: Exploiting Dependency Parsing

Janssen, Maarten

doi:10.1007/978-3-319-99722-3_47

Maarten Janssen²¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 11122))

Included in the following conference series:

International Conference on Computational Processing of the Portuguese Language

790 Accesses

Abstract

This article describe a set of modules and functions added to the TEITOK corpus environment that turn TEITOK into a full environment for working with dependency parsed corpora, allow parsing document, correcting parsing errors, visualizing parse results, and searching the corpus with a modified version of the CQL query language that can exploit dependency relations.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Janssen, M.: TEITOK: text-faithful annotated corpora. In: Proceedings of the 10th International Conference on Language Resources and Evaluation, LREC 2016, pp. 4037–4043 (2016)
Google Scholar
Straka, M., Straková, J.: Tokenizing, POS tagging, lemmatizing and parsing UD 2.0 with UDPipe. In: Proceedings of the CoNLL 2017 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies, Vancouver, Canada, pp. 88–99. Association for Computational Linguistics, August 2017
Google Scholar
Mendes, A., Antunes, S., Janssen, M., Gonçalves, A.: The COPLE2 corpus: a learner corpus for Portuguese. In: Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016), Paris, France. European Language Resources Association (ELRA), May 2016
Google Scholar
Hardie, A.: CQPweb - combining power, flexibility and usability in a corpus analysis tool. Int. J. Corpus Linguist. 17(3), 380–409 (2012)
Article Google Scholar
Krause, T., Zeldes, A.: ANNIS3: a new architecture for generic corpus query and visualization. Digit. Scholarsh. Humanit. 31(1), 118–139 (2016)
Article Google Scholar
Zeldes, A., Schroeder, C.T.: An NLP pipeline for coptic. In: Proceedings of the 1st SIGHUM Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities (LaTeCH) (2016)
Google Scholar
Tyers, F.M., Sheyanova, M., Washington, J.N.: UD annotatrix: an annotation tool for universal dependencies. In: Proceedings of the 16th International Workshop on Treebanks and Linguistic Theories (TLT16), pp. 10–17 (2018)
Google Scholar
Nivre, J., et al.: Universal dependencies v1: a multilingual treebank collection. In: Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, Portorož, Slovenia, 23–28 May 2016
Google Scholar
Evert, S., Hardie, A.: Twenty-first century corpus workbench: updating a query architecture for the new millennium. In: Corpus Linguistics 2011 (2011)
Google Scholar
Knig, E., Lezius, W.: The TIGER language - a description language for syntax graphs - formal definition, May 2003
Google Scholar
Kilgarriff, A., Tugwell, D.: Sketching words. In: Corréard, M.H. (ed.) Lexicography and Natural Language Processing: A Festschrift in Honour of B. T. S. Atkins. EURALEX, pp. 125–137 (2002)
Google Scholar
Kilgarriff, A., et al.: The sketch engine: ten years on. Lexicography 1, 7–36 (2014)
Article Google Scholar

Download references

Author information

Authors and Affiliations

CELGA-ILTEC, Coimbra, Portugal
Maarten Janssen

Authors

Maarten Janssen
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Maarten Janssen .

Editor information

Editors and Affiliations

Institute of Informatics, Federal University of Rio Grande do Sul, Porto Alegre, Brazil
Aline Villavicencio
Instituto de Informática - UFRGS, Porto Alegre, Brazil
Viviane Moreira
INESC-ID, Lisbon, Portugal
Alberto Abad
UFSCAR, Sao Carlos, Brazil
Helena Caseli
Centro Singular de Investigación en Tecnoloxías, Universidade de Santiago de Compostela, Santiago de Compostela, La Coruña, Spain
Pablo Gamallo
Université de Toulon, Parc Scientifique Technologique Luminy, Marseille, France
Carlos Ramisch
Centro de Informática e Sistemas, Universidade de Coimbra, Coimbra, Portugal
Hugo Gonçalo Oliveira
Federal University of Technology, Dois Vizinhos, Paraná, Brazil
Gustavo Henrique Paetzold

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Janssen, M. (2018). Dependency Graphs and TEITOK: Exploiting Dependency Parsing. In: Villavicencio, A., et al. Computational Processing of the Portuguese Language. PROPOR 2018. Lecture Notes in Computer Science(), vol 11122. Springer, Cham. https://doi.org/10.1007/978-3-319-99722-3_47

Download citation

DOI: https://doi.org/10.1007/978-3-319-99722-3_47
Published: 26 August 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-99721-6
Online ISBN: 978-3-319-99722-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics