Skip to main content

Dependency Graphs and TEITOK: Exploiting Dependency Parsing

  • Conference paper
  • First Online:
Computational Processing of the Portuguese Language (PROPOR 2018)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 11122))

  • 790 Accesses

Abstract

This article describe a set of modules and functions added to the TEITOK corpus environment that turn TEITOK into a full environment for working with dependency parsed corpora, allow parsing document, correcting parsing errors, visualizing parse results, and searching the corpus with a modified version of the CQL query language that can exploit dependency relations.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Janssen, M.: TEITOK: text-faithful annotated corpora. In: Proceedings of the 10th International Conference on Language Resources and Evaluation, LREC 2016, pp. 4037–4043 (2016)

    Google Scholar 

  2. Straka, M., Straková, J.: Tokenizing, POS tagging, lemmatizing and parsing UD 2.0 with UDPipe. In: Proceedings of the CoNLL 2017 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies, Vancouver, Canada, pp. 88–99. Association for Computational Linguistics, August 2017

    Google Scholar 

  3. Mendes, A., Antunes, S., Janssen, M., Gonçalves, A.: The COPLE2 corpus: a learner corpus for Portuguese. In: Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016), Paris, France. European Language Resources Association (ELRA), May 2016

    Google Scholar 

  4. Hardie, A.: CQPweb - combining power, flexibility and usability in a corpus analysis tool. Int. J. Corpus Linguist. 17(3), 380–409 (2012)

    Article  Google Scholar 

  5. Krause, T., Zeldes, A.: ANNIS3: a new architecture for generic corpus query and visualization. Digit. Scholarsh. Humanit. 31(1), 118–139 (2016)

    Article  Google Scholar 

  6. Zeldes, A., Schroeder, C.T.: An NLP pipeline for coptic. In: Proceedings of the 1st SIGHUM Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities (LaTeCH) (2016)

    Google Scholar 

  7. Tyers, F.M., Sheyanova, M., Washington, J.N.: UD annotatrix: an annotation tool for universal dependencies. In: Proceedings of the 16th International Workshop on Treebanks and Linguistic Theories (TLT16), pp. 10–17 (2018)

    Google Scholar 

  8. Nivre, J., et al.: Universal dependencies v1: a multilingual treebank collection. In: Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, Portorož, Slovenia, 23–28 May 2016

    Google Scholar 

  9. Evert, S., Hardie, A.: Twenty-first century corpus workbench: updating a query architecture for the new millennium. In: Corpus Linguistics 2011 (2011)

    Google Scholar 

  10. Knig, E., Lezius, W.: The TIGER language - a description language for syntax graphs - formal definition, May 2003

    Google Scholar 

  11. Kilgarriff, A., Tugwell, D.: Sketching words. In: Corréard, M.H. (ed.) Lexicography and Natural Language Processing: A Festschrift in Honour of B. T. S. Atkins. EURALEX, pp. 125–137 (2002)

    Google Scholar 

  12. Kilgarriff, A., et al.: The sketch engine: ten years on. Lexicography 1, 7–36 (2014)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Maarten Janssen .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Janssen, M. (2018). Dependency Graphs and TEITOK: Exploiting Dependency Parsing. In: Villavicencio, A., et al. Computational Processing of the Portuguese Language. PROPOR 2018. Lecture Notes in Computer Science(), vol 11122. Springer, Cham. https://doi.org/10.1007/978-3-319-99722-3_47

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-99722-3_47

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-99721-6

  • Online ISBN: 978-3-319-99722-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics