Advertisement

Thai Text Coherence Structuring with Coordinating and Subordinating Relations for Text Summarization

  • Thana Sukvaree
  • Asanee Kawtrakul
  • Jean Caelen
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4635)

Abstract

Text summarization with the consideration of coherence can be achieved by using discourse processing with the Rhetorical Structure Theory (RST). Additional problems on relational ambiguity may arise, especially in Thai. For example, the use of cue words, i.e. “tae/ ” (meaning “but”), can be identified as a contrast relation or an elaboration relation. Therefore, we propose the reduction of the ambiguity level by reducing the relation types to two, namely Coordinating and Subordinating relation. Our framework is to concentrate on coherence structuring which requires the following 3 steps: (1) identify an attachment point for an incoming discourse unit by using our Adaptive Right-frontier algorithm; (2) extract Coordinating and Subordinating relations through the identification of linguistic coherence features in the lexical and phrasal level, using Bayesian techniques; (3) construct coherence tree structures, The accuracy is 70.45% for the first step, 77.47% and 79.89% for COR and SUBR extraction respectively in the second step and 64.94% in constructing coherent tree of the third.

Keywords

Attachment Point Discourse Relation Blast Disease Text Summarization Discourse Marker 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Edmundson, H.P.: New Method in Automatic Extracting. ACM 16(2), 264–285 (1969)zbMATHCrossRefGoogle Scholar
  2. 2.
    Hovy, E., Lin, C.: Automated text summarization in summarist. In: Proceedings of the Workshop on Intelligent Scalable Text Summarization, pp. 18–24 (1977)Google Scholar
  3. 3.
    Marcu, D.: The rhetorical parsing of natural language texts. In: Meeting of the Association for Computational Linguistics, pp. 96–103 (1997)Google Scholar
  4. 4.
    Cristea, D., Postolache, O., Pistol, I.: Summarisation through discourse structure [15], pp. 632–644Google Scholar
  5. 5.
    Mann, W.C., Thompson, S.A.: Rhetorical structure theory: Toward a functional theory of text organization. Text 8(3), 243–281 (1998)Google Scholar
  6. 6.
    Moore, J.D., Pollack, M.E.: A problem for RST: The need for multi-level discourse analysis. Computational Linguistics 18(4), 537–544 (1992)Google Scholar
  7. 7.
    Hovy, E., Maier, E.: Parsimonious or profligate: How many and which discourse structure relations. In: Discourse Processes, pp. 18–24 (1977)Google Scholar
  8. 8.
    Asher, N., Lascarides, A.: Logics of Conversation. Studies in Natural Language Processing. Cambridge University Press, Cambridge (2005)Google Scholar
  9. 9.
    Polanyi, L.: A formal model of the structure of discourse. Journal of Pragmatics 12, 601–638 (1988)CrossRefGoogle Scholar
  10. 10.
    Sassen, C., Kühnlein, P.: The right frontier constraint as conditional [15], pp. 222–225Google Scholar
  11. 11.
    Grosz, B.J., Joshi, A.K., Weinstein, S.: Centering: A framework for modeling the local coherence of discourse. Computational Linguistics 21(2), 203–225 (1995)Google Scholar
  12. 12.
    Salton, G., Wong, A., Yang, C.S.: A vector space model for automatic indexing. Commun. ACM 18(11), 613–620 (1975)zbMATHCrossRefGoogle Scholar
  13. 13.
    Kongwa, A., Kawtrakul, A.: Know-what: A development of object-property extraction from thai texts and query system. In: Proceeding of the Sixth Symposium on Natural Language Processing (2005)Google Scholar
  14. 14.
    Wattanamethanont, M., T.S., Kawtrakul, A.: Thai discourse relations recognition by using naive bayes classifier. In: The Proceedings of the Sixth Symposium on Natural Language Processing (2005)Google Scholar
  15. 15.
    Gelbukh, A. (ed.): CICLing 2005. LNCS, vol. 3406. Springer, Heidelberg (2005)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2007

Authors and Affiliations

  • Thana Sukvaree
    • 1
  • Asanee Kawtrakul
    • 1
  • Jean Caelen
    • 2
  1. 1.Department of Computer Engineering, Kasetsart University, BangkokThailand
  2. 2.Laboratory CLIPS, University of Joseph Fourier, Grenoble Cedex 9France

Personalised recommendations