Language Resources and Evaluation

, Volume 41, Issue 1, pp 91–115

TimeBank evolution as a community resource for TimeML parsing

Authors

    • IBM T.J. Watson Research Center
  • James Pustejovsky
    • Brandeis University
  • Rie Ando
    • IBM T.J. Watson Research Center
  • Marc Verhagen
    • Brandeis University
Original Paper

DOI: 10.1007/s10579-007-9018-8

Cite this article as:
Boguraev, B., Pustejovsky, J., Ando, R. et al. Lang Resources & Evaluation (2007) 41: 91. doi:10.1007/s10579-007-9018-8

Abstract

TimeBank is the only reference corpus for TimeML, an expressive language for annotating complex temporal information. It is a rich resource for a broad range of research into various aspects of the expression of time and temporally related events. This paper traces the development of TimeBank from its initial—and somewhat noisy—version (1.1) to a substantially revised release (1.2), now available via the Linguistic Data Consortium. The development path is motivated by the encouraging empirical results of TimeML-compliant annotators developed on the basis of TimeBank 1.1, and is informed by a detailed study of the characteristics of that initial release, which guides a clean-up process turning TimeBank 1.2 into a consistent and robust community resource.

Keywords

TimeMLTimeBankCorpus analysisTemporal information extraction

Abbreviations

TimeML

A Markup Language for Time

timex

Time Expression

LDC

Linguistic Data Consortium

IE

Information Extraction

IAA

Inter-Annotator Agreement

Copyright information

© Springer Science+Business Media B.V. 2007