Abstract
An important aspect of discourse understanding and generation involves the recognition and processing of discourse relations. These are conveyed by discourse connectives, i.e., lexical items like because and as a result or implicit connectives expressing an inferred discourse relation. The Penn Discourse TreeBank (PDTB) provides annotations of the argument structure, attribution and semantics of discourse connectives. In this paper, we provide the rationale of the tagset, detailed descriptions of the senses with corpus examples, simple semantic definitions of each type of sense tags as well as informal descriptions of the inferences allowed at each level.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Asher, N.: Reference to Abstract Objects. Kluwer, Dordrecht (1993)
Carlson, L., Marcu, D., Okurowski, M.: Current Directions in Discourse and Dialogue, Building a Discourse-Tagged Corpus in the Framework of Rhetorical Structure Theory. Kluwer Academic Publishers, Dordrecht (2003)
Forbes-Riley, K., Webber, B., Joshi, A.: Computing discourse semantics: The predicate-argument semantics of discourse connectives in D-LTAG. Journal of Semantics 23, 55–106 (2006)
Gaizauskas, R., et al.: The timebank corpus. In: Corpus Linguistics 2003. Lancaster, U.K (2003)
Giordano, L., Schwind, C.: Conditional logic of actions and causation. Artificial Intelligence 157, 239–279 (2004)
Kingsbury, P., Palmer, M.: From Treebank to Propbank. In: Third International Conference on Language Resources and Evaluation, LREC-2002, Las Palmas, Canary Islands, Spain (2002)
Marcus, M.P., Santorini, B., Marcinkiewicz, M.A.: Building a large annotated corpus of English: The Penn Treebank. Computational Linguistics 19(2), 313–330 (1993)
Mitkov, R., et al.: Coreference and anaphora: Developing annotating tools, annotated resources and annotation strategies. In: Proc. of the Discourse Anaphora and Anaphora Resolution Colloquium (DAARC 2000), Lancaster, U.K. (2000)
Miltsakaki, E., et al.: The Penn Discourse Treebank. In: Proc. of the 4th International Conference on Language Rescourses and Evaluation (LREC 2004) (2004)
Miltsakaki, E., et al.: Experiments on sense annotation and sense disambiguation of discourse connectives. In: Proc. of the 4th Workshop on Treebanks and Linguistic Theories (TLT2005) (2005)
Prasad, R., et al.: Discourse TreeBank as a resource for natural language generation. In: Proc. of the Corpus Linguistics Workshop on Using Corpora for NLG (2005)
Webber, B., et al.: Anaphora and discourse structure. Computational Linguistics 29(4), 545–587 (2003)
Webber, B., et al.: A short introduction to the PDTB. In: Copenhagen Working Papers in Language and Speech Processing (2005)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Miltsakaki, E., Robaldo, L., Lee, A., Joshi, A. (2008). Sense Annotation in the Penn Discourse Treebank. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2008. Lecture Notes in Computer Science, vol 4919. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-78135-6_23
Download citation
DOI: https://doi.org/10.1007/978-3-540-78135-6_23
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-78134-9
Online ISBN: 978-3-540-78135-6
eBook Packages: Computer ScienceComputer Science (R0)