Advertisement

Cat3LB and Cast3LB: From Constituents to Dependencies

  • Montserrat Civit
  • Ma. Antònia Martí
  • Núria Bufí
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4139)

Abstract

In this paper we present the conversion of two treebanks (Cat3LB for Catalan, and Cast3LB for Spanish) from its original constituent format into dependencies. The process has been done automatically but by manually writing the head and the function table. The process has also been used to improve the quality of the first annotation and to modifiy the annotation for further extensions of the treebanks. Treebanks in both formats are freely available for research purposes.

Keywords

Noun Phrase Relative Clause Linguistic Theory Function Table Nominal Group 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Beil, F., Prescher, D., Schmid, H., Shulte im Walde, S.: Evaluation of the Gramotron parser for German. In: Beyond Parseval, a LREC 2002 Workshop (2002)Google Scholar
  2. 2.
    Brants, S., Dipper, S., Hansen, S., Lezius, W., Smith, G.: The TIGER Treebank. In: Proccedings of the Workshop on Treebanks and Linguistic Theories (2002)Google Scholar
  3. 3.
    Civit, M., Martí, M.A.: Building Cast3LB: a Spanish Treebank. Research on Language & Computation 2(4) (2005)Google Scholar
  4. 4.
    Civit, M., Bufí, N., Valverde, M.P.: CAT3LB: a Treebank for Catalan with Word Sense Annotation. In: 3rd Workshop on Treebanks and Linguistic Theories (TLT 2004), Tuebingen, Germany (2004)Google Scholar
  5. 5.
    Civit, M.: Guía para la anotación sintáctica de Cast3LB: un corpus del español con anotación sintáctica, semántica y pragmática (2003), Available at: http://clic.fil.ub.es/
  6. 6.
    Civit, M.: Guía para la anotación de las funciones sintácticas de Cast3LB: un corpus del español con anotación sintáctica, semántica y pragmática (2003), Available at: http://clic.fil.ub.es/
  7. 7.
    Civit, M., Bufí, N., Valverde, M.P.: Guia per a la anotació de les funcions sintàctiques de Cat3LB: un corpus del català amb anotació sintàctica, semàntica i pragmàtica (2004), Available at: http://clic.fil.ub.es/
  8. 8.
    Hajic, J.: Building a syntactically annotated corpus: the Prague Dependency Treebank. Issues in Valency and Meaning. Studies in honour of Jarmila Panevova (1999)Google Scholar
  9. 9.
    Kromann, M.: The Danish Dependency Treebank and the underlying linguistic theory. In: Proceedings of the Second Workshop on Treebanks and Linguistic Theories (2003)Google Scholar
  10. 10.
    Lin, D.: A dependency-based method for evaluating broad-coverage parsers. In: Proceedings of IJCAI 1995, pp. 1420–1425 (1995)Google Scholar
  11. 11.
    Lin, D.: A dependency-based method for evaluating broad-coverage parsers. Natural Language Engineering 4(2), 1420–1425 (1998)CrossRefGoogle Scholar
  12. 12.
    Valverde, M.P., Civit, M., Bufí, N.: Guia per a la anotació sintàctica de Cat3LB: un corpus del català amb anotació sintàctica, semàntica i pragmàtica (2004), Available at: http://clic.fil.ub.es/

Copyright information

© Springer-Verlag Berlin Heidelberg 2006

Authors and Affiliations

  • Montserrat Civit
    • 1
  • Ma. Antònia Martí
    • 1
  • Núria Bufí
    • 1
  1. 1.CLiC Centre de Llenguatge i ComputacióUniversitat de Barcelona 

Personalised recommendations