Skip to main content

A Study to Improve the Efficiency of a Discourse Parsing System

  • Conference paper
  • First Online:
Computational Linguistics and Intelligent Text Processing (CICLing 2003)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2588))

Abstract

This paper presents a study of the implementation of a discourse parsing system, where only significant features are considered. Rhetorical relations are recognized based on three types of cue phrases (the normal cue phrases, Noun-Phrase cues and Verb-Phrase cues), and different textual coherence devices. The parsing algorithm and its rule set are developed in order to create a system with high accuracy and low complexity. The data used in this system are taken from the RST Discourse Treebank of the Linguistic Data Consortium (LDC).

The biggest discourse corpus nowadays is the RST Discourse Treebank from LDC, with 385 Wall Street Journal articles.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Bouchachia, A., Mittermeir, R., Pozewaunig, H.: Document Identification by Shallow Semantic Analysis. NLDB (2000) 190–202

    Google Scholar 

  2. Carlson, L. and Marcu, D.: Discourse Tagging Manual. ISI Tech Report, ISI-TR-545 (2001)

    Google Scholar 

  3. Corston-Oliver, S.: Computing Representations of the Structure of Written Discourse. PhD Thesis. University of California, Santa Barbara, CA, U.S.A (1998a)

    Google Scholar 

  4. Corston-Oliver, S.: Beyond string matching and cue phrases: Improving efficiency and coverage in discourse analysis. In: Eduard Hovy and Dragomir Radev: The Spring Symposium. AAAI Technical Report SS-98-06, AAAI Press (1998b) 9–15

    Google Scholar 

  5. Gundel, J., Hegarty, M., Borthen, K.: Information structure and pronominal reference to clausally introduced entities. In: ESSLLI Workshop on Information Structure: Discourse Structure and Discourse Semantics. Helsinki (2001)

    Google Scholar 

  6. Hobbs, J.: On the Coherence and Structure of Discourse. Technical Report CSLI-85-37, Center for the Study of Language and Information (1985)

    Google Scholar 

  7. Hovy, E. H.: Parsimonious and profligate approaches to the question of discourse structure relation. In: Proceedings of the 5th International Workshop on Natural Language Generation. Pittsburgh (1990) 128–136

    Google Scholar 

  8. Knott, A., Dale, R.: Using linguistic phenomena to motivate a set of coherence relations. Discourse Processes 18 (1995) 35–62

    Article  Google Scholar 

  9. Komagata, N.: Entangled Information Structure: Analysis of Complex Sentence Structures. In: ESSLLI 2001 Workshop on Information Structure, Discourse Structure and Discourse Semantics. Helsinki (2001) 53–66

    Google Scholar 

  10. Mann, W. C. and Thompson, S. A.: Rhetorical Structure Theory: Toward a Functional Theory of Text Organization. Text, vol. 8 (1988) 243–281

    Google Scholar 

  11. Marcu, D.: Building Up Rhetorical Structure Trees. In: Proceedings of the Thirteenth National Conference on Artificial Intelligence (AAAI), volume 2 (1996) 1069–1074

    Google Scholar 

  12. Marcu, D.: The Rhetorical Parsing, Summarization, and Generation of Natural Language Texts. PhD Thesis, Department of Computer Science, University of Toronto (1997)

    Google Scholar 

  13. Marcu, D.: A decision-based approach to rhetorical parsing. The 37th Annual Meeting of the Association for Computational Linguistics (ACL). Maryland (1999) 365–372

    Google Scholar 

  14. Marcu, D., Echihabi, A.: An Unsupervised Approach to Recognizing Discourse Relations. In: Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics (ACL). Philadelphia, PA (2002)

    Google Scholar 

  15. Polanyi, L.: The Linguistic Structure of Discourse (1995)

    Google Scholar 

  16. Poesio, M., Di Eugenio, D.: Discourse Structure and Anaphoric Accessibility. In: ESSLLI Workshop on Information Structure, Discourse Structure and Discourse Semantics. Helsinki (2001)

    Google Scholar 

  17. Redeker, G.: Ideational and pragmatic markers of discourse structure. Journal of Pragmatics (1990) 367–381

    Google Scholar 

  18. Salkie, R.: Text and discourse analysis. London, Routledge (1995)

    Google Scholar 

  19. Webber, B. et al.: D-LTAG System-Discourse Parsing with a Lexicalized Tree Adjoining Grammar. In: ESSLLI Workshop on Information structure, Discourse structure and Discourse Semantics (2001)

    Google Scholar 

  20. Webber, B., Knott, A., Stone, M., Joshi, A.: Discourse Relations: A Structural and Presuppositional Account Using Lexicalised TAG. Meeting of the Association for Computational Linguistics, College Park MD (1999)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2003 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Le, H.T., Abeysinghe, G. (2003). A Study to Improve the Efficiency of a Discourse Parsing System. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2003. Lecture Notes in Computer Science, vol 2588. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-36456-0_11

Download citation

  • DOI: https://doi.org/10.1007/3-540-36456-0_11

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-00532-2

  • Online ISBN: 978-3-540-36456-6

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics