Identification of Composite Named Entities in a Spanish Textual Database

  • Sofía N. Galicia-Haro
  • Alexander Gelbukh
  • Igor A. Bolshakov
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 3136)


Named entities (NE) mentioned in textual databases constitute an important part of their semantics. Lists of those NE are an important knowledge source for diverse tasks. We present a method for NE identification focused on composite proper names (names with coordinated constituents and names with several prepositional phrases.) We describe a method based on heterogeneous knowledge and simple resources, and the preliminary obtained results.


Name Entity Recognition Entity Recognition Prepositional Phrase Punctuation Mark Name Entity 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Bolshakov, I.A., Gelbukh, A.F., Galicia-Haro, S.N.: Stable Coordinated Pairs in Text Processing. In: Matoušek, V., Mautner, P. (eds.) TSD 2003. LNCS (LNAI), vol. 2807, pp. 27–35. Springer, Heidelberg (2003)CrossRefGoogle Scholar
  2. 2.
    Borthwick et al.: Exploiting Diverse Knowledge Sources via Maximum Entropy in Named Entity Recognition. In: Proceedings of the Sixth Workshop on Very Large Corpora (1998)Google Scholar
  3. 3.
    Carreras, X., Márques, L., Padró, L.: Named Entity Extraction using AdaBoost. In: Proceedings of CoNLL 2002, Taipei, Taiwan, pp. 167–170 (2002)Google Scholar
  4. 4.
    Chinchor, N.: MUC-7 Named Entity Task Definition, version 3.5 (1997),
  5. 5.
    Friburger, N., Maurel, D.: Textual Similarity Based on Proper Names. In: Mathematical Formal Information Retrieval (MFIR 2002), pp. 155–167 (2002)Google Scholar
  6. 6.
    Krupka, G., Hausman, K.: Description of the NetOwl(TM) extractor system as used for MUC-7. In: Sixth Message Understanding Conference MUC-7 (1998)Google Scholar
  7. 7.
    Mikheev, A., Moens, M., Grover, C.: Named Entity Recognition without Gazetteers. In: Proceedings of the EACL (1999)Google Scholar
  8. 8.
    MUC: Proceedings of the Sixth Message Understanding Conference. (MUC-6). Morgan Kaufmann (1995) Google Scholar
  9. 9.
    Stevenson, M., Gaizauskas, R.: Using Corpus-derived Name List for name Entity Recognition. In: Proc. of ANLP, Seattle, pp. 290–295 (2000)Google Scholar
  10. 10.
    Tjong Kim Sang, E.F.: Introduction to the CoNLL-2002 Shared Task: Language- Independent Named Entity Recognition. In: Proceedings of CoNLL 2002, Taipei, Taiwan, pp. 155–158 (2002)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2004

Authors and Affiliations

  • Sofía N. Galicia-Haro
    • 1
  • Alexander Gelbukh
    • 2
    • 3
  • Igor A. Bolshakov
    • 2
  1. 1.Faculty of SciencesUNAM Ciudad Universitaria
  2. 2.Center for Computing ResearchNational Polytechnic InstituteMexico CityMexico
  3. 3.Department of Computer Science and EngineeringChung-Ang UniversitySeoulKorea

Personalised recommendations