Abstract
In this paper we report on the efforts of three projects to annotate texts and dialogues with discourse structure. We provide a theoretical discussion of various alternatives and then present our approach to discourse structure annotation, along with some applications of the resources that we have developed.
Nicholas Asher—Part of this research was supported by European Research Council, Grant n. 269427.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
Not all annotation campaigns of course have this as a goal, the PDTB being one prominent example.
- 2.
- 3.
But see [63] for an investigation of some of these cases.
- 4.
see chapter “Crowdsourcing”, this volume, for a discussion on this point.
- 5.
Taking into account the gold annotations rather than the annotations produced during the two first phases.
- 6.
The project was CASOAR, http://projetcasoar.wordpress.com, a two year DGA-RAPID project (2010–2012).
References
Afantenos, S.D., Asher, N.: Testing SDRT’s right frontier. In: Proceedings of the 23rd International Conference on Computational Linguistics (COLING 2010), pp. 1–9 (2010)
Afantenos, S.D., Denis, P., Muller, P., Danlos, L.: Learning recursive segments for discourse parsing. In: Proceedings of LREC 2010 (2010)
Afantenos, S., Asher, N., Benamara, F., Bras, M., Fabre, C., Ho-Dac, L.M., Le Draoulec, A., Muller, P., Péry-Woodley, M. P., Prévot, L., Rebeyrolles, J., Tanguy, L., Vergez-Couret, M., Vieu, L.: An empirical resource for discovering cognitive principles of discourse organisation: the ANNODIS corpus. In: Calzolari, N., Choukri, K., Declerck, T., Doǧan, M.U., Maegaard, B., Mariani, J., Odijk, J., Piperidis, S. (eds.) Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC’12), European Language Resources Association (ELRA). Istanbul, Turkey (2012)
Asher, N.: Reference to Abstract Objects in Discourse. Kluwer, The Netherlands (1993)
Asher, N.: Lexical Meaning in Context: A Web of Words. Cambridge University Press, Cambridge (2011)
Asher, N., Lascarides, A.: Logics of conversation. In: Studies in Natural Language Processing. Cambridge University Press, Cambridge (2003)
Asher, N., Hardt, D., Busquets, J.: Discourse parallelism, ellipsis and ambiguity. J. Semant. 18(1), (2001)
Asher, N., Benamara, F., Mathieu, Y.Y.: Distilling opinion in discourse: a preliminary study. In: Proceedings of Computational Linguistics (CoLing), pp. 7–10 (2008)
Atallah, C.: Analyse de relations de discours causales en corpus: étude empirique et caractérisation théorique. Ph.D. thesis, Université de Toulouse, Toulouse (2014)
Baldridge, J., Asher, N., Hunter, J.: Annotation for and Robust Parsing of Discourse Structure on Unrestricted Texts. Zeitschrift fur. Sprachwissenschaft 26, 213–239 (2007)
Benamara, F, Asher, N, Mathieu, Y, Popescu, V., Chardon, B.: Evaluation in discourse: a corpus-based study. In: Dialogue and Discourse (2015). (in press)
Biber, D.: Variation Across Speech and Writing. Cambridge University Press, Cambridge (1988)
Bourigault, D.: Un analyseur syntaxique opérationnel : SYNTEX. Université de Toulouse, Mémoire d’HDR (2007)
Bras, M.: French adverb d’abord and discourse structure. In: Aurnague, M., Larrazabal, J-M., Korta, K. (eds.) Language, Representation and Reasoning. Memorial Volume to Isabel Gomez Txurruka, pp. 77–102. Presses Universitaires du Pays Basque, Bilbao (2007)
Bras, M., Le Draoulec, A., Vieu, L.: French adverbial Puis between temporal structure and discourse structure. In: Bras, M., Vieu, L. (eds.) Semantic and Pragmatic Issues in Dialogue: Experimenting with Current Theories. CRISPI, vol. 9, pp. 109–146. Elsevier, Amsterdam (2001)
Bras, M., Le Draoulec, A., Asher, N.: A formal analysis of the French temporal connective alors. Oslo Stud Lang 1, 149–170 (2009)
Carletta, J., Isard, S., Doherty-Sneddon, G.: HCRC Dialogue Structure Coding Manual. HCRC Publications, The University of Edinburgh (1996)
Chafe, W.L.: Discourse Consciousness and Time: The Flow and Displacement of Conscious Experience in Speaking and Writing. University of Chicago Press, Chicago (1994)
Chardon, B., Benamara, F., Mathieu, Y.Y., Popescu, V., Asher, N.: Measuring the effect of discourse structure on sentiment analysis. In: CICLing, pp. 25–37 (2013a)
Chardon, B., Benamara, F., Mathieu, Y. Y., Popescu, V., Asher, N.: Sentiment composition using a parabolic model. In: Proceedings of the 10th International Conference on Computational Semantics (IWCS 2013), pp. 47–58 (2013b)
Charolles, M.: L’encadrement du discours - Univers, champs, domaines et espace. Cahiers de recherche linguistique 6, 1–73 (1997)
Charolles, M., Le Draoulec, A., Péry-Woodley, M.-P., Sarda, L.: Temporal and spatial dimensions of discourse organisation. J. Fr. Lang. Stud. 15(2), 203–218 (2005)
Cohen, J.: A coefficient of agreement for nominal scales. Educ. Psychol. Meas. 20(1), 37–46 (1960)
Colléter, M., Fabre, C., Ho-Dac, L.-M., Péry-Woodley, M.-P., Rebeyrolle, J., Tanguy, L.: La ressource ANNODIS multi-échelle : guide d’annotation et bonus. Technical report 20. Carnets de grammaires, CLLE-ERSS (2012)
Cornish, F.: Anaphora. Discourse and Understanding. Evidence from English and French. Clarendon Press, Oxford (1999)
Danlos, L.: Strong generative capacity of RST, SDRT and discourse dependency DAGSs. Pages 69–95 of: Benz, A., Kuhnlein, P. (eds.) Constraints in Discourse. John Benjamins, Amsterdam (2008)
Egg, M., Redeker, G.: How complex is discourse structure? In: Calzolari, N., Choucri, K., Maegaard, B., Mariani, J., Odijk, J., Piperidis, S., Rosner, M., Tapias, D. (eds.) Proceedings of LREC’10. ELRA (2010)
Enkvist, N.E.: Connexity, interpretability, universes of discourse, and text worlds. In: Allén, S. (ed.) Possible Worlds in Humanities, Arts and Sciences, pp. 162–186. Walter de Gruyter, Berlin (1989)
Feng, V.W., Hirst, G.: Text-level discourse parsing with rich linguistic features. IN: Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, vol. 1: Long Papers), pp. 60–68. Association for Computational Linguistics, Jeju Island, Korea (2012)
Forbes, K., Miltsakaki, E., Prasad, R., Sarkar, A., Joshi, A.K., Webber, B.L.: D-LTAG system: discourse parsing with a lexicalized tree-adjoining grammar. J. Logic Lang. Inf. 12(3), 261–279 (2003)
Francis, G.: Labelling discourse: an aspect of nominal-group lexical cohesion. In: Coulthard, M. (ed.) Advances in Written Text Analysis, pp. 83–101. Routledge, London (1994)
Fries, P.: Themes method of development and texts. In: Hasan, R., Fries, P. (eds.) On Subject and Theme: A Discourse Functional Perspective, pp. 317–359. John Benjamins, Amsterdam (1995)
Goutsos, D.: A model of sequential relations in expository test. Text 16(4), 501–533 (1996)
Grosz, B., Sidner, C.: Attention, intentions and the structure of discourse. Comput. Linguist. 12, 175–204 (1986)
Halliday, M.A.K.: Text as semantic choice in social contexts. In: van Dijk, T., Petöfi, J.S. (eds.) Grammars and Descriptions, pp. 176–226. Walter de Gruyter, Berlin (1977)
Halliday, M.A.K.: An Introduction to Functional Grammar, 2nd edn. Arnold, London (1985)
Halliday, M.A.K., Hasan, R.: Cohesion in English. Longman, London (1976)
Hempel, S., Degand, L.: sequencers in different text genres: academic writing, journalese and fiction. J. Pragmat. 40, 676–693 (2008)
Hernault, H., Prendinger, H., duVerle, D.A., Ishizuka, M.: HILDA: a discourse parser using support vector machine classification. Dialogue Discourse 1(3), 1–33 (2010)
Hitzeman, J., Moens, M., Grover, C.: Algorithms for analyzing the temporal structure of discourse. In: Proceedings of the 7th Meeting of the European Chapter of the Association for Computational Linguistics, pp. 253–260 (1995)
Ho-Dac, L-M., Péry-Woodley, M-P.: A data-driven study of temporal adverbials as discourse segmentation markers. Discours 4 (2009)
Ho-Dac, L.-M., Péry-Woodley, M.-P, Tanguy, L.: Anatomie des structures énumératives. In: Actes de TALN, (ed.) 2010. Université de Montréal, for ATALA, Montréal (2010)
Ho-Dac, L.-M., Fabre, Cécile, Péry-Woodley, M.-P., Rebeyrolle, J., Tanguy, L.: On the signalling of multi-level discourse structures. Discours 10 (2012)
Hobbs, J.R.: Coherence and coreference. Cognit. Sci. 3(1), 67–90 (1979)
Hovy, E.H.: Parsimonious and profligate approaches to the question of discourse structure relations. In: Proceedings of the Fifth International Workshop on Natural Language Generation, pp. 128–136 (1990)
Joty, S., Carenini, G., Ng, R.: A novel discriminative framework for sentence-level discourse analysis. In: Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning. Association for Computational Linguistics, Jeju Island, Korea (2012)
Kamp, H., Reyle, U.: From Discourse to Logic: Introduction to Modeltheoretic Semantics of Natural Language. Formal Logic and Discourse Representation Theory. Kluwer Academic Publishers, The Netherlands (1993)
Landis, J.R., Koch, G.G.: The measurement of observer agreement for categorical data. Biometrics 33(1), 159–174 (1977)
Lascarides, A., Asher, N.: Temporal interpretation, discourse relations and commonsense entailment. Linguist. Philos. 16(5), 437–493 (1993)
Mann, W., Thompson, S.: Rhetorical structure theory: a theory of text organization. Technical report, Information Science Institute (1987)
Marcu, D.: Building up rhetorical structure trees. Proceedings of the thirteenth national conference on Artificial intelligence. AAAI’96, vol. 2, pp. 1069–1074. AAAI press, California (1996)
Mathet, Y., Widlöcher, A.: La plate-forme GLOZZ : environnement d’annotation et d’exploration de corpus. In: Actes de TALN, (ed.) 2009. LIPN, for ATALA, Senlis (2009)
Maudet, N., Muller, P., Prévot, L.: Social constraints on rhetorical relations in dialogue. In: Sidner, C., Harpur, J., Benz, A., Kühnlein, P. (eds.) Proceedings of the Workshop on Constraints in Discourse, pp. 133–139 (2006)
Muller, P., Prévot, L.: An empirical study of acknowledgment structures. In: Proceedings of Diabruck 2003, 7th Workshop on the Semantics and Pragmatics of Dialogue, (Sept 4th–6th) (2003)
Muller, P., Prévot, L.: The rhetorical attachment of questions and answers. In: Korta, K., Garmendia, J. (eds.) Meaning, Intentions, and Argumentation. (CSLI-LN) Center for the Study of Language and Information - Lecture Notes, vol. 186. University of Chicago press, Chicago (2008). http://www.journals.uchicago.edu/
Muller, P., Afantenos, S., Denis, P., Asher, N.: Constrained decoding for text-level discourse parsing. In: Proceedings of COLING (2012a)
Muller, P., Vergez, M., Prévot, L, Asher, N, Benamara, F., Bras, M., Le Draoulec, A., Vieu, L.: Manuel d’annotation en relations de discours du projet Annodis. Technical report 21. CLLE (2012b)
Polanyi, L.: A formal model of the structure of discourse. J. Pragmat. 12, 601–638 (1988)
Polanyi, L., Culy, C., van den Berg, M., Thione, G.L., Ahn, D.: A rule based approach to discourse parsing. In: Strube, M., Sidner, C. (eds.) Proceedings of the 5th SIGdial Workshop on Discourse and Dialogue, pp. 108–117. Association for Computational Linguistics, Cambridge (2004)
Power, R., Scott, D., Bouayad-Agha, N.: Document structure. Comput. Linguist. 2(29), 211–260 (2003)
Prasad, R., Dinesh, N., Lee, A., Miltsakaki, E., Robaldo, L., Joshi, A., Webber, B.: The penn discourse TreeBank 2.0. In: Calzolari, N., Choukri, K., Maegaard, B., Mariani, J., Odjik, J., Piperidis, S., Tapias, D. (eds.) Proceedings of the Sixth International Language Resources and Evaluation (LREC’08). European Language Resources Association (ELRA), Marrakech, Morocco (2008). http://www.lrec-conf.org/proceedings/lrec2008/
Prévot, L., Vieu, L., Asher, N.: Une formalisation plus précise pour une annotation moins confuse: la relation d’Élaboration d’entité. J. Fr. Lang. Stud. 19(2), 207–228 (2009)
Roze, C.: Vers une algèbre des relations de discours. Ph.D. thesis, Université Paris 7 (2013)
Sagae, K.: Analysis of discourse structure with syntactic dependencies and data-driven shift-reduce parsing. Proceedings of the 11th International Conference on Parsing Technologies. IWPT ’09, pp. 81–84. Association for Computational Linguistics, Stroudsburg (2009)
Somasundaran, S.: Discourse-level relations for Opinion Analysis. Ph.D. thesis, University of Pittsburgh (2010)
Subba, R., Di Eugenio, B.: An effective discourse parser that uses rich linguistic information. In: Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, pp. 566–574. Association for Computational Linguistics, Boulder, Colorado (2009)
Trnavac, R., Taboada, M.: The contribution of nonveridical rhetorical relations to evaluation in discourse. Lang. Sci. 34(3), 301–318 (2010)
Turney, P.D.: Thumbs up or thumbs down?: semantic orientation applied to unsupervised classification of reviews. In: Proceedings of Annual Meeting of the Association for Computational Linguistics (2002)
Venant, A., Asher, N., Muller, P., Denis, P., Afantenos, S.: Expressivity and comparison of models of discourse structure. In: Proceedings of the SIGDIAL 2013 Conference, pp. 2–11. Association for Computational Linguistics (2013a)
Vergez-Couret, M.: Etude en corpus des réalisations linguistiques de la relation d’Elaboration. Ph.D. thesis, Université de Toulouse, Toulouse (2010)
Webber, B., Egg, M., Kordoni, V.: Discourse structure and language technology. Nat. Lang. Eng. 18(4), 437–490 (2012)
Wiebe, J., Riloff, E.: Creating subjective and objective sentence classifiers from unannotated texts. In: Proceedings of the International Conference on Intelligent Text Processing and Computational Linguistics (CICLing). Lecture Notes in Computer Science, vol. 3406, pp. 486–497 (2005)
Wolf, F., Gibson, E.: Representing discourse coherence: a corpus based study. Comput. Linguist. 31(2), 249–287 (2005)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer Science+Business Media Dordrecht
About this chapter
Cite this chapter
Asher, N. et al. (2017). ANNODIS and Related Projects: Case Studies on the Annotation of Discourse Structure. In: Ide, N., Pustejovsky, J. (eds) Handbook of Linguistic Annotation. Springer, Dordrecht. https://doi.org/10.1007/978-94-024-0881-2_47
Download citation
DOI: https://doi.org/10.1007/978-94-024-0881-2_47
Published:
Publisher Name: Springer, Dordrecht
Print ISBN: 978-94-024-0879-9
Online ISBN: 978-94-024-0881-2
eBook Packages: Social SciencesSocial Sciences (R0)