Recognition of Named Entities in Spanish Texts

  • Sofía N. Galicia-Haro
  • Alexander Gelbukh
  • Igor A. Bolshakov
Part of the Lecture Notes in Computer Science book series (LNCS, volume 2972)


Proper name recognition is a subtask of Name Entity Recognition in Message Understanding Conference. For our corpus annotation proper name recognition is a crucial task since proper names appear approximately in more than 50% of total sentences of the electronic texts that we collected for such purpose. Our work is focused on composite proper names (names with coordinated constituents, names with several prepositional phrases, and names of songs, books, movies, etc.) We describe a method based on heterogeneous knowledge and simple resources, and the preliminary obtained results.


Quotation Mark Entity Recognition Prepositional Phrase Punctuation Mark Electronic Text 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Bolshakov, I.A., Gelbukh, A.F., Galicia-Haro, S.N.: Stable Coordinated Pairs in Text Processing. In: Matoušek, V., Mautner, P. (eds.) TSD 2003. LNCS (LNAI), vol. 2807, pp. 27–35. Springer, Heidelberg (2003)CrossRefGoogle Scholar
  2. 2.
    Borthwick, A., et al.: Exploiting Diverse Knowledge Sources via Maximum Entropy in Named Entity Recognition. In: Proceedings of the Sixth Workshop on Very Large Corpora (1998)Google Scholar
  3. 3.
    Carreras, X., Márques, L., Padró, L.: Named Entity Extraction using AdaBoost. In: Proceedings of CoNLL 2002, Taipei, Taiwan, pp. 167–170 (2002)Google Scholar
  4. 4.
    Chinchor N.: MUC-7 Named Entity Task Definition (version 3.5) (1997),
  5. 5.
    Friburger, N., Maurel, D.: Textual Similarity Based on Proper Names. In: Mathematical Formal Information Retrieval (MFIR 2002), pp. 155–167 (2002)Google Scholar
  6. 6.
    Krupka, G., Hausman, K.: Description of the NetOwl (TM) extractor system as used for MUC-7. In: Sixth Message Understanding Conference MUC-7 (1998)Google Scholar
  7. 7.
    Mani, I., McMillian, R., Luperfoy, S., Lusher, E., Laskowski, S.: Identifying unknown proper names in newswire text. In: Pustejovsky, J., Boguraev, B. (eds.) Corpus processing for lexical acquisition, MIT Press, Cambridge (1996)Google Scholar
  8. 8.
    Mikheev, A.: A Knowledge-free Method for Capitalized Word Disambiguation. In: Proceedings of the 37th Annual Meeting of the Association for Computational Linguistics, pp. 159–166 (1999)Google Scholar
  9. 9.
    Mikheev, A.: Periods, Capitalized Words, etc. Computational Linguistics 28-3, 289–318 (2002)CrossRefGoogle Scholar
  10. 10.
    Mikheev, A., Moens, M., Grover, C.: Named Entity Recognition without Gazetteers. In: Proceedings of the EACL (1999)Google Scholar
  11. 11.
    MUC: Proceedings of the Sixth Message Understanding Conference. (MUC-6). Morgan Kaufmann (1995) Google Scholar
  12. 12.
    Stevenson, M., Gaizauskas, R.: Using Corpus-derived Name List for name Entity Recognition In: Proc. of ANLP, Seattle, pp. 290–295 (2000)Google Scholar
  13. 13.
    Tjong Kim Sang, E.F.: Introduction to the CoNLL 2002 Shared Task: Language- Independent Named Entity Recognition. In: Proceedings of CoNLL 2002, Taipei, Taiwan, pp. 155–158 (2002)Google Scholar
  14. 14.
    Wakao, T., Gaizauskas, R., Wilks, Y.: Evaluation of an Algorithm for the Recognition and Classification of Proper Names. In: Proceedings of the 16th International Conference on Computational Linguistics (COLING 1996), Copenhagen, pp. 418–423 (1996)Google Scholar
  15. 15.
    Wilks, Y.: Information Extraction as a core language technology. In: Pazienza, M.T. (ed.) Information Extraction, Springer, Berlin (1997)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2004

Authors and Affiliations

  • Sofía N. Galicia-Haro
    • 1
  • Alexander Gelbukh
    • 2
    • 3
  • Igor A. Bolshakov
    • 2
  1. 1.Faculty of Sciences UNAM Ciudad UniversitariaMexico CityMexico
  2. 2.Center for Computing Research National Polytechnic InstituteMexico CityMexico
  3. 3.Department of Computer Science and EngineeringChung-Ang UniversitySeoulKorea

Personalised recommendations