Discourse Segmentation for Spanish Based on Shallow Parsing

  • Iria da Cunha
  • Eric SanJuan
  • Juan-Manuel Torres-Moreno
  • Marina Lloberes
  • Irene Castellón
Conference paper

DOI: 10.1007/978-3-642-16761-4_2

Part of the Lecture Notes in Computer Science book series (LNCS, volume 6437)
Cite this paper as:
da Cunha I., SanJuan E., Torres-Moreno JM., Lloberes M., Castellón I. (2010) Discourse Segmentation for Spanish Based on Shallow Parsing. In: Sidorov G., Hernández Aguirre A., Reyes García C.A. (eds) Advances in Artificial Intelligence. MICAI 2010. Lecture Notes in Computer Science, vol 6437. Springer, Berlin, Heidelberg

Abstract

Nowadays discourse parsing is a very prominent research topic. However, there is not a discourse parser for Spanish texts. The first stage in order to develop this tool is discourse segmentation. In this work, we present DiSeg, the first discourse segmenter for Spanish, which uses the framework of Rhetorical Structure Theory and is based on lexical and syntactic rules. We describe the system and we evaluate its performance against a gold standard corpus, obtaining promising results.

Keywords

Discourse Parsing Discourse Segmentation Rhetorical Structure Theory 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Copyright information

© Springer-Verlag Berlin Heidelberg 2010

Authors and Affiliations

  • Iria da Cunha
    • 1
    • 2
    • 3
  • Eric SanJuan
    • 2
  • Juan-Manuel Torres-Moreno
    • 2
    • 4
  • Marina Lloberes
    • 5
  • Irene Castellón
    • 5
  1. 1.Institute for Applied Linguistics (UPF)BarcelonaSpain
  2. 2.Laboratoire Informatique d’AvignonAvignon Cedex 9France
  3. 3.Instituto de Ingeniería (UNAM)Ciudad UniversitariaMexico
  4. 4.École Polytechnique de Montréal/DGIMontréalCanada
  5. 5.GRIALUniversitat de BarcelonaBarcelonaSpain

Personalised recommendations