Skip to main content

Identification of Composite Named Entities in a Spanish Textual Database

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 3136))

Abstract

Named entities (NE) mentioned in textual databases constitute an important part of their semantics. Lists of those NE are an important knowledge source for diverse tasks. We present a method for NE identification focused on composite proper names (names with coordinated constituents and names with several prepositional phrases.) We describe a method based on heterogeneous knowledge and simple resources, and the preliminary obtained results.

Work done under partial support of Mexican Government (CONACyT, SNI, COFAA-IPN), Korean Government (KIPA Professorship for Visiting Faculty Positions in Korea), and ITRI of CAU. The second author is currently on Sabbatical leave at Chung-Ang University.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Bolshakov, I.A., Gelbukh, A.F., Galicia-Haro, S.N.: Stable Coordinated Pairs in Text Processing. In: Matoušek, V., Mautner, P. (eds.) TSD 2003. LNCS (LNAI), vol. 2807, pp. 27–35. Springer, Heidelberg (2003)

    Chapter  Google Scholar 

  2. Borthwick et al.: Exploiting Diverse Knowledge Sources via Maximum Entropy in Named Entity Recognition. In: Proceedings of the Sixth Workshop on Very Large Corpora (1998)

    Google Scholar 

  3. Carreras, X., Márques, L., Padró, L.: Named Entity Extraction using AdaBoost. In: Proceedings of CoNLL 2002, Taipei, Taiwan, pp. 167–170 (2002)

    Google Scholar 

  4. Chinchor, N.: MUC-7 Named Entity Task Definition, version 3.5 (1997), http://www.itl.nist.gov/iaui/894.02/relatedprojects/muc/proceedings/muc7toc.html#appendices

  5. Friburger, N., Maurel, D.: Textual Similarity Based on Proper Names. In: Mathematical Formal Information Retrieval (MFIR 2002), pp. 155–167 (2002)

    Google Scholar 

  6. Krupka, G., Hausman, K.: Description of the NetOwl(TM) extractor system as used for MUC-7. In: Sixth Message Understanding Conference MUC-7 (1998)

    Google Scholar 

  7. Mikheev, A., Moens, M., Grover, C.: Named Entity Recognition without Gazetteers. In: Proceedings of the EACL (1999)

    Google Scholar 

  8. MUC: Proceedings of the Sixth Message Understanding Conference. (MUC-6). Morgan Kaufmann (1995)

    Google Scholar 

  9. Stevenson, M., Gaizauskas, R.: Using Corpus-derived Name List for name Entity Recognition. In: Proc. of ANLP, Seattle, pp. 290–295 (2000)

    Google Scholar 

  10. Tjong Kim Sang, E.F.: Introduction to the CoNLL-2002 Shared Task: Language- Independent Named Entity Recognition. In: Proceedings of CoNLL 2002, Taipei, Taiwan, pp. 155–158 (2002)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2004 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Galicia-Haro, S.N., Gelbukh, A., Bolshakov, I.A. (2004). Identification of Composite Named Entities in a Spanish Textual Database. In: Meziane, F., Métais, E. (eds) Natural Language Processing and Information Systems. NLDB 2004. Lecture Notes in Computer Science, vol 3136. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-27779-8_37

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-27779-8_37

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-22564-5

  • Online ISBN: 978-3-540-27779-8

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics