Abstract
Proper name recognition is a subtask of Name Entity Recognition in Message Understanding Conference. For our corpus annotation proper name recognition is a crucial task since proper names appear approximately in more than 50% of total sentences of the electronic texts that we collected for such purpose. Our work is focused on composite proper names (names with coordinated constituents, names with several prepositional phrases, and names of songs, books, movies, etc.) We describe a method based on heterogeneous knowledge and simple resources, and the preliminary obtained results.
Work done under partial support of Mexican Government (CONACyT, SNI, COFAA-IPN), Korean Government (KIPA Professorship for Visiting Faculty Positions in Korea), and ITRI of CAU. The second author is currently on Sabbatical leave at Chung-Ang University.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bolshakov, I.A., Gelbukh, A.F., Galicia-Haro, S.N.: Stable Coordinated Pairs in Text Processing. In: Matoušek, V., Mautner, P. (eds.) TSD 2003. LNCS (LNAI), vol. 2807, pp. 27–35. Springer, Heidelberg (2003)
Borthwick, A., et al.: Exploiting Diverse Knowledge Sources via Maximum Entropy in Named Entity Recognition. In: Proceedings of the Sixth Workshop on Very Large Corpora (1998)
Carreras, X., Márques, L., Padró, L.: Named Entity Extraction using AdaBoost. In: Proceedings of CoNLL 2002, Taipei, Taiwan, pp. 167–170 (2002)
Chinchor N.: MUC-7 Named Entity Task Definition (version 3.5) (1997), http://www.itl.nist.gov/iaui/894.02/relatedprojects/muc/proceedings/muc7toc.html#appendices
Friburger, N., Maurel, D.: Textual Similarity Based on Proper Names. In: Mathematical Formal Information Retrieval (MFIR 2002), pp. 155–167 (2002)
Krupka, G., Hausman, K.: Description of the NetOwl (TM) extractor system as used for MUC-7. In: Sixth Message Understanding Conference MUC-7 (1998)
Mani, I., McMillian, R., Luperfoy, S., Lusher, E., Laskowski, S.: Identifying unknown proper names in newswire text. In: Pustejovsky, J., Boguraev, B. (eds.) Corpus processing for lexical acquisition, MIT Press, Cambridge (1996)
Mikheev, A.: A Knowledge-free Method for Capitalized Word Disambiguation. In: Proceedings of the 37th Annual Meeting of the Association for Computational Linguistics, pp. 159–166 (1999)
Mikheev, A.: Periods, Capitalized Words, etc. Computational Linguistics 28-3, 289–318 (2002)
Mikheev, A., Moens, M., Grover, C.: Named Entity Recognition without Gazetteers. In: Proceedings of the EACL (1999)
MUC: Proceedings of the Sixth Message Understanding Conference. (MUC-6). Morgan Kaufmann (1995)
Stevenson, M., Gaizauskas, R.: Using Corpus-derived Name List for name Entity Recognition In: Proc. of ANLP, Seattle, pp. 290–295 (2000)
Tjong Kim Sang, E.F.: Introduction to the CoNLL 2002 Shared Task: Language- Independent Named Entity Recognition. In: Proceedings of CoNLL 2002, Taipei, Taiwan, pp. 155–158 (2002)
Wakao, T., Gaizauskas, R., Wilks, Y.: Evaluation of an Algorithm for the Recognition and Classification of Proper Names. In: Proceedings of the 16th International Conference on Computational Linguistics (COLING 1996), Copenhagen, pp. 418–423 (1996)
Wilks, Y.: Information Extraction as a core language technology. In: Pazienza, M.T. (ed.) Information Extraction, Springer, Berlin (1997)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Galicia-Haro, S.N., Gelbukh, A., Bolshakov, I.A. (2004). Recognition of Named Entities in Spanish Texts. In: Monroy, R., Arroyo-Figueroa, G., Sucar, L.E., Sossa, H. (eds) MICAI 2004: Advances in Artificial Intelligence. MICAI 2004. Lecture Notes in Computer Science(), vol 2972. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-24694-7_43
Download citation
DOI: https://doi.org/10.1007/978-3-540-24694-7_43
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-21459-5
Online ISBN: 978-3-540-24694-7
eBook Packages: Springer Book Archive