Skip to main content

Building the Italian Syntactic-Semantic Treebank

  • Chapter
Treebanks

Abstract

The paper reports on the design and construction of a multi-layered corpus of Italian, annotated at the syntactic and lexico-semantic levels, whose development is supported by dedicated software augmented with an intelligent interface. The issue of evaluating this type of resource is also addressed.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • Abeillé A., Clément L., Toussenel F., (2003). Building a treebank for French. This volume.

    Google Scholar 

  • Alonge A., Calzolari N., Vossen P., Bloksma L., Castellon L., Martì T., Peters W., (1998). The Linguistic Design of the EuroWordNet Database. Computers and the Humanities 32, 2–3, p. 91–115.

    Article  Google Scholar 

  • Bates E., (1976). Language and Context: Studies in the Acquisition of Pragmatics, New York, Academic Press.

    Google Scholar 

  • Brants T., Skut W., Uszkoreit H., (2003). Syntactic annotation of a German newspaper corpus. This volume.

    Google Scholar 

  • Böhmová A., Hajic J., Hladká B., Panenová J., (2003). The Prague Dependency Treebank: a three-level annotation scenario. This volume.

    Google Scholar 

  • Calzolari N., Corazzari O., (2000). Senseval/Romanseval: the framework for Italian. Computers and the Humanities, vol. 34, n. 1–2, p. 61–78.

    Article  Google Scholar 

  • Carroll J., Briscoe T., Calzolari N., Federici S., Montemagni S., Pirrelli V., Grefenstette G., Sanfilippo A., Carroll G., Rooth M., (1996). Specification of Phrasal Parsing, SPARKLE Deliverable 1.1.

    Google Scholar 

  • Carroll J., Briscoe E., Sanfilippo A., (1998). Parser Evaluation: a Survey and a New Proposal. Proceedings of LREC, Granada, p. 447–454.

    Google Scholar 

  • Caroll J., Minnen G., Briscoe T., (2003). Parser evaluation using a grammatical relation annotation scheme. This volume.

    Google Scholar 

  • Chen K., Luo C, Gao Z., Chang M., Chen F., Chen C., (2003). Sinica Tree-bank. This volume.

    Google Scholar 

  • Corazzari O., Calzolari N., Zampolli A., (2000). An Experiment of Lexical-Semantic Tagging of an Italian Corpus. LREC-2000 Proceedings, Athens.

    Google Scholar 

  • Corazzari O., Monachini M., (1995). ELSNET: Italian Corpus Sample, ILC, Pisa.

    Google Scholar 

  • CPR, ITC-IRST, Quinary, (2000). ItalWordNet: Rete semantico-lessicale per l’italiano. SI-TAL, Specifiche Tecniche di SI-TAL, Manuale Operativo, Capitolo 2.

    Google Scholar 

  • Delmonte R., (1999). From Shallow Parsing to Functional Structure. Atti del Workshop AI*IA “Elaborazione del Linguaggio e Riconoscimento del Par-lato”, IRST Trento, p. 8–19.

    Google Scholar 

  • Delmonte R., (2000). Shallow Parsing and Functional Structure in Italian Corpora. LREC-2000 Proceedings, Athens.

    Google Scholar 

  • Goggi S., Biagini L., Picchi E., Bindi R., Rossi S., Marinelli R., (1997). Italian Corpus Documentation, LE-PAROLEWP2.il, ILC, Pisa.

    Google Scholar 

  • Greenbaum S. (ed.), (1996). English Worldwide: The International Corpus of English, Oxford, Clarendon Press.

    Google Scholar 

  • Karlsson F., Voutilainen A., Heikkila J., Anttila A. (eds.), (1995). Constraint Grammar, a language-independent system for parsing unconstrained text. Berlin and New York, Mouton de Gruyter.

    Google Scholar 

  • Lenci A., Montemagni S., Pirrelli V., Soria C., (1999). FAME: a Functional Annotation Meta-scheme for Multimodal and Multi-lingual Parsing Evaluation. Proceeding of the ACL99 Workshop on Computer-Mediated Language Assessment and Evaluation in NLP, University of Maryland.

    Google Scholar 

  • Lenci A., Montemagni S., Pirrelli V., Soria C, (2000). Where opposites meet. A Syntactic Meta-scheme for Corpus Annotation and Parsing Evaluation. LREC-2000 Proceedings, Athens.

    Google Scholar 

  • Lin D., (1998). A dependency-based method for evaluating broad-coverage parsers. Natural Language Engineering, 4(2), p. 97–114.

    Article  Google Scholar 

  • Lin D., (2003). Dependency-based evaluation of MINIPAR. This volume.

    Google Scholar 

  • Marcus M., Marcinkiewicz M.A., Santorini B., (1993). Building a Large Annotated Corpus of English: The Penn Treebank. Computational Linguistics, 19(2), p. 313–330.

    Google Scholar 

  • McCord M.C., (1980). Slot Grammars. Computational Linguistics, vol. 6, p. 31–43.

    Google Scholar 

  • McCord M.C., (1989). Design of LMT: A Prolog-based Machine Translation System. Computational Linguistics, vol 15, p. 33–52.

    Google Scholar 

  • Monachini M., Calzolari N., (1996). Synopsis and Comparison of Morphosyn-tactic Phenomena Encoded in Lexicons and Corpora. A Common Proposal and Application to European Languages. EAGLES Recommendations. Pisa, ILC.

    Google Scholar 

  • Montemagni S., Barsotti F., Battista M., Calzolari N., Corazzari O., Zampolli A., Fanciulli F., Massetani M., Raffaelli R., Basili R., Pazienza M.T., Sara-cino D., Zanzotto F., Mana N., Pianesi F., Delmonte R., (2000). The Italian Syntactic-Semantic Treebank: Architecture, Annotation, Tools and Evaluation. Proceedings of the COLING Workshop on “Linguistically Interpreted Corpora (LINC-2000)”, Luxembourg, 6 August 2000, p. 18–27.

    Google Scholar 

  • Montemagni S., and Pazienza M.T. (eds), (2001) Atti del Workshop su “La Treebank sintattico-semantica delVitaliano in SI-TAL”. 7° Congresso della Associazione Italiana per l’Intelligenza Artificiale (AI*IA 2001), Bari, 26 settembre 2001.

    Google Scholar 

  • Moreno A., López A., Sánchez F., (1999). Spanish Tree Bank: Specifications, Version 4, Manuscript.

    Google Scholar 

  • Moreno A., López A., Sánchez F., Grishman R., (2003). Developing a syntactically annotation scheme and tools for a Spanish treebank. This volume.

    Google Scholar 

  • Picchi E., (1994). Pi-Tagger: A tagger and lemmatizer for Italian. EURALEX-94 Proceedings, Amsterdam.

    Google Scholar 

  • Sampson G., (1995). English for the Computer, Oxford, Clarendon Press.

    Google Scholar 

  • Sampson G., (2000). A proposal for improving the measurement of parse accuracy. International Journal of Corpus Linguistics, vol. 5, n. 1, p. 53–68.

    Article  Google Scholar 

  • Sampson G., (2003). Thoughts on two decades of drawing trees. This volume.

    Google Scholar 

  • Sinclair J., (1996). The Empty Lexicon. International Journal of Corpus Linguistics, 1, p. 99–119.

    Article  Google Scholar 

  • SI-TAL, (2000). Specifiche Tecniche di SI-TAL. Manuale Operativo. ILC, Pisa.

    Google Scholar 

  • Taylor A., et al. (2003). The Penn Treebank: an overview. This volume.

    Google Scholar 

  • Wallis S., (2003). Completing parsed corpora from correction to evolution. This volume.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2003 Springer Science+Business Media Dordrecht

About this chapter

Cite this chapter

Montemagni, S. et al. (2003). Building the Italian Syntactic-Semantic Treebank. In: Abeillé, A. (eds) Treebanks. Text, Speech and Language Technology, vol 20. Springer, Dordrecht. https://doi.org/10.1007/978-94-010-0201-1_11

Download citation

  • DOI: https://doi.org/10.1007/978-94-010-0201-1_11

  • Publisher Name: Springer, Dordrecht

  • Print ISBN: 978-1-4020-1335-5

  • Online ISBN: 978-94-010-0201-1

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics