A Simple Spanish Part of Speech Tagger for Detection and Correction of Accentuation Error

  • S. N. Galicia-Haro
  • I. A. Bolshakov
  • A. F. Gelbukh
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 1692)


One of the most frequent kind of typographic errors specific to Spanish is connected with accentuation, namely, with omission of an obligatory stress mark or insertion of a superfluous one. If such an error transforms one word to another existing one, the latter cannot be detected by usual spell-checkers, since some context analysis is necessary. A simple procedure is proposed for this task. It relies on (1) some simple heuristics that determine linear context and (2) on a small list of pairs of words that differ only in accentuation mark. This idea is applied to numerous nouns or adjectives like número that pass to quasi-homonymous personal verb forms if they lose their stress marks.


Stress Mark Word Form Spanish Word Speech Tagger Spanish Text 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Bolshakov, I.A.: El modelo morfólogico formal para sustantivos y adjetivos en el español. Computacion y Sistemas, 1 (1996).Google Scholar
  2. 2.
    Word for Windows95. User’s Guide. MicrosoftCorp. (1995).Google Scholar
  3. 3.
    RAE Diccionario de la lengua Española. Real Academia Espa~nola, Edición en CD-ROM (1996).Google Scholar
  4. 4.
    Diccionario del Español contemporáneo. Grupo ANAYA,
  5. 5.
    Cutting, D., et al.: A Practical Part-of-Speech Tagger. In: Proceedings of the Third Conference on Applied Natural Language Processing. Trento, Italy (ACL) (1992).Google Scholar
  6. 6.
    Ashmanov, I.: Grammar and Style Corrector for Russian Texts (in Russian). In: Proc. Of InternationalWorkshop on Computational Linguistics and its Applications, Dialogue-95, Kazan, Russia (1995).Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 1999

Authors and Affiliations

  • S. N. Galicia-Haro
    • 1
  • I. A. Bolshakov
    • 1
  • A. F. Gelbukh
    • 1
  1. 1.Laboratorio de Lenguaje NaturalC.I.C., I.P.N.Mexico CityMexico

Personalised recommendations