Advertisement

Review and Evaluation of DiZer – An Automatic Discourse Analyzer for Brazilian Portuguese

  • Thiago Alexandre Salgueiro Pardo
  • Maria das Graças Volpe Nunes
Part of the Lecture Notes in Computer Science book series (LNCS, volume 3960)

Abstract

This paper presents the review and evaluation of DiZer – an automatic discourse analyzer for Brazilian Portuguese. Based on Rhetorical Structure Theory, DiZer is a symbolic analyzer that makes use of linguistic patterns learned from a corpus of scientific texts to identify and build the discourse structure of texts. DiZer evaluation shows satisfactory results for scientific texts. In order to test its portability, DiZer is also evaluated with news texts and presents acceptable performance.

Keywords

Baseline Method Discourse Structure Scientific Text Text Segment Order Satellite 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Aires, R.V.X., Aluísio, S.M., Kuhn, D.C.S., Andreeta, M.L.B., Oliveira Jr., O.N.: Combining Multiple Classifiers to Improve Part of Speech Tagging: A Case Study for Brazilian Portuguese. In: The Proceedings of the Brazilian AI Symposium – SBIA, pp. 20–22 (2000)Google Scholar
  2. Carlson, L., Marcu, D.: Discourse Tagging Reference Manual. ISI Technical Report ISI-TR-545 (2001)Google Scholar
  3. Corston-Oliver, S.: Computing Representations of the Structure of Written Discourse. PhD Thesis, University of California, Santa Barbara, CA, USA (1998)Google Scholar
  4. Cristea, D., Ide, N., Romary, L.: Veins Theory, An Approach to Global Cohesion and Coherence. In: The Proceedings of Coling/ACL (1998)Google Scholar
  5. Grosz, B., Sidner, C.: Attention, Intentions, and the Structure of Discourse. Computational Linguistics 12(3) (1986)Google Scholar
  6. Jordan, M.P.: An Integrated Three-Pronged Analysis of a Fund-Raising Letter. In: Mann, W.C., Thompson, S.A. (eds.) Discourse Description: Diverse Linguistic Analyses of a Fund-Raising Text, pp. 171–226 (1992)Google Scholar
  7. Kehler, A.: Coherence, Reference and the Theory of Grammar. CSLI Publications (2002)Google Scholar
  8. Mann, W.C. and Thompson, S.A, Rhetorical Structure Theory: A Theory of Text Organization. Technical Report ISI/RS-87-190 (1987)Google Scholar
  9. Marcu, D.: The Rhetorical Parsing, Summarization, and Generation of Natural Language Texts. PhD Thesis, Department of Computer Science, University of Toronto (1997)Google Scholar
  10. Marcu, D.: The Theory and Practice of Discourse Parsing and Summarization. The MIT Press, Cambridge (2000)Google Scholar
  11. O’Donnell, M.: Variable-Length On-Line Document Generation. In: The Proceedings of the 6th European Workshop on Natural Language Generation. Gerhard-Mercator University, Duisburg (1997)Google Scholar
  12. Pardo, T.A.S.: Métodos para Análise Discursiva Automática. PhD Thesis. Instituto de Ciências Matemáticas e de Computação, Universidade de São Paulo. São Carlos-SP, June 2005, 211p. (2005)Google Scholar
  13. Pardo, T.A.S., Nunes, M.G.V.: A Construção de um Corpus de Textos Científicos em Português do Brasil e sua Marcação Retórica. Technical Report N. 212. Instituto de Ciências Matemáticas e de Computação, Universidade de São Paulo. São Carlos-SP, September 2003, 26p. (2003)Google Scholar
  14. Pardo, T.A.S., Nunes, M.G.V.: Relações Retóricas e seus Marcadores Superficiais: Análise de um Corpus de Textos Científicos em Português do Brasil. Technical Report N. 231. Instituto de Ciências Matemáticas e de Computação, Universidade de São Paulo. São Carlos-SP, April 2004, 73p. (2004)Google Scholar
  15. Pardo, T.A.S., Nunes, M.G.V., Rino, L.H.M.: DiZer: An Automatic Discourse Analyzer for Brazilian Portuguese. In: Bazzan, A.L.C., Labidi, S. (eds.) SBIA 2004. LNCS (LNAI), vol. 3171, pp. 224–234. Springer, Heidelberg (2004)CrossRefGoogle Scholar
  16. Pardo, T.A.S., Seno, E.M.R.: Rhetalho: um corpus de referência anotado retoricamente. In: Anais do V Encontro de Corpora. São Carlos-SP, November 24-25 (2005)Google Scholar
  17. Pereira, F.C.N., Warren, D.H.D.: Definite Clause Grammars for Language Analysis – A Survey of the Formalism and Comparison with Augmented Transition Networks. In: Artificial Intelligence, vol. 13, pp. 231–278 (1980)Google Scholar
  18. Schauer, H.: Referential Structure and Coherence Structure. In: The Proceedings of TALN. Lausanne, Switzerland (2000)Google Scholar
  19. Soricut, R., Marcu, D.: Sentence Level Discourse Parsing using Syntactic and Lexical Information. In: The Proceedings of HLT/NAACL (2003)Google Scholar
  20. Sumita, K., Ono, K., Chino, T., Ukita, T., Amano, S.: A discourse structure analyzer for Japonese text. In: The Proceedings of the International Conference on Fifth Generation Computer Systems, Tokyo, Japan, vol. 2, pp. 1133–1140 (1992)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2006

Authors and Affiliations

  • Thiago Alexandre Salgueiro Pardo
    • 1
  • Maria das Graças Volpe Nunes
    • 1
  1. 1.Núcleo Interinstitucional de Lingüística Computacional (NILC)São CarlosBrasil

Personalised recommendations