Skip to main content

From Italian Text to TimeML Document via Dependency Parsing

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 6609))

Abstract

This paper describes the first prototype for building TimeML xml documents starting from raw text for Italian. First, the text is parsed with the TULE parser, a dependency parser developed at the University of Turin. The parsed text is then used as input to the TimeML rule-based module we have implemented, henceforth called as ‘The converter’. So far, the converter identifies and classifies events in the sentence. The results are rather satisfatory, and this leads us to support the use of dependency syntactic relations for the development of higher level semantic tools.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Baroni, M., Bernardini, S., Comastri, F., Piccioni, L., Volpi, A., Aston, G., Mazzoleni, M.: Introducing the “la Repubblica” corpus: A large, annotated, TEI(XML)-compliant corpus of newspaper italian. In: Proceedings of the Fourth International conference on Language Resources and Evaluation, LREC 2004 (2004)

    Google Scholar 

  2. Bentivogli, L., Pianta, E.: Exploiting parallel texts in the creation of multilingual semantically annotated resources: the multisemcor corpus. Natural Language Engineering 11(3), 247–261 (2005)

    Article  Google Scholar 

  3. Bosco, C., Montemagni, A., Mazzei, A., Lombardo, V., Dell’Orletta, F., Lenci, A., Lesmo, L., Attardi, G., Simi, M., Lavelli, A., Hall, J., Nilsson, J., Nivre, J.: Comparing italian parsers on a common treebank: the evalita experience. In: Proc. of the 6th Int. Conf. on Language Resources and Evaluation (LREC 2010) (2010)

    Google Scholar 

  4. Bosco, C.: A grammatical relation system for treebank annotation. PhD thesis, University of Turin, Italy (2004)

    Google Scholar 

  5. Caselli, T.: TimeML Annotation Scheme for Italian - Version 1.3.1 (2010)

    Google Scholar 

  6. Grover, C., Tobin, R., Alex, B., Byrne, K.: Edinburgh-ltg: Tempeval-2 system description. In: Proc. of the 5th International Workshop on Semantic Evaluation, Uppsala, Sweden, pp. 333–336. Association for Computational Linguistics (July 2010)

    Google Scholar 

  7. Kumar Kolya, A., Ekbal, A., Bandyopadhyay, S.: Ju_cse_temp: A first step towards evaluating events, time expressions and temporal relations. In: Proc. of the 5th International Workshop on Semantic Evaluation, Uppsala, Sweden. ACL (2010)

    Google Scholar 

  8. Lesmo, L., Lombardo, V.: Transformed subcategorization frames in chunk parsing. In: Proc. of the 3rd Int. Conf. on Language Resources and Evaluation (LREC 2002), Las Palmas, pp. 512–519 (2002)

    Google Scholar 

  9. Llorens, H., Saquete, E., Navarro-Colorado, B.: Timeml events recognition and classification: Learning crf models with semantic roles. In: Proceedings of the 23rd International Conference on Computational Linguistics (Coling 2010), Beijing, China, pp. 725–733. Coling 2010 Organizing Committee (2010)

    Google Scholar 

  10. Llorens, H., Saquete, E., Navarro, B.: Tipsem (english and spanish): Evaluating crfs and semantic roles in tempeval-2. In: Proc. of the 5th International Workshop on Semantic Evaluation, Uppsala, Sweden, pp. 284–291. ACL (2010)

    Google Scholar 

  11. Mel’cuk, I.: Actants in semantics and syntax. Linguistics (42), 247–291 (2004)

    Google Scholar 

  12. Pustejovsky, J., Castao, J., Saurì, R., Ingria, R., Gaizauskas, R., Setzer, A., Katz, G.: TimeML: Robust specification of event and temporal expressions in text. In: Fifth International Workshop on Computational Semantics (IWCS-5) (2005)

    Google Scholar 

  13. Pustejovsky, J., Hanks, P., Saurì, R., See, A., Gaizauskas, R., Setzer, A., Radev, D., Sundheim, B., Day, D., Ferro, L., Lazo, M.: The TIMEBANK corpus. In: Corpus Linguistics 2003 (2003)

    Google Scholar 

  14. Pustejovsky, J., Verhagen, M.: Semeval-2010 task 13: Evaluating events, time expressions, and temporal relations (tempeval-2). In: Proceedings of the Workshop on Semantic Evaluations: Recent Achievements and Future Directions (SEW 2009), Boulder, Colorado, pp. 112–116. Association for Computational Linguistics (June 2009)

    Google Scholar 

  15. Ruimy, N., Monachini, M., Gola, E., Calzolari, N., Fiorentino, M.D., Ulivieri, M., Rossi, S.: A computational semantic lexicon of italian: SIMPLE. In: Linguistica Computazionale XVIII-XIX, Pisa, pp. 821–864 (2003)

    Google Scholar 

  16. Saquete, E., Gonzàlez, J.L.V., Martìnez-Barco, P., Munoz, R., Llorens, H.: Enhancing qa systems with complex temporal question processing capabilities. J. Artif. Intell. Res (JAIR) 35, 775–811 (2009)

    MATH  Google Scholar 

  17. Saurì, R., Knippen, R., Verhagen, M., Pustejovsky, J.: Evita: A robust event recognizer for qa systems. In: Proceedings of Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing (HLT/EMNLP 2005), pp. 700–707 (2005)

    Google Scholar 

  18. UzZaman, N., Allen, J.: Trips and trios system for tempeval-2: Extracting temporal information from text. In: Proc. of the 5th International Workshop on Semantic Evaluation, Uppsala, Sweden, pp. 276–283. ACL (2010)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Robaldo, L., Caselli, T., Russo, I., Grella, M. (2011). From Italian Text to TimeML Document via Dependency Parsing. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2011. Lecture Notes in Computer Science, vol 6609. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-19437-5_14

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-19437-5_14

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-19436-8

  • Online ISBN: 978-3-642-19437-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics