Advertisement

Person Name Discrimination in the Dossier–GPLSI at the University of Alicante

  • Isabel Moreno
  • Rubén Izquierdo
  • Paloma Moreda
Part of the Lecture Notes in Computer Science book series (LNCS, volume 6716)

Abstract

We present the Dossier–GPLSI, a system for the automatic generation of press dossiers for organizations. News are downloaded from online newspapers and are automatically classified. We describe specifically a module for the discrimination of person names. Three different approaches are analyzed and evaluated, each one using different kind of information, as semantic information, domain information and statistical evidence. We demonstrate that this module reaches a very good performance, and can be integrated in the Dossier–GPLSI system.

Keywords

Person Name Discrimination LSA Semantic Information WordNet Domains Statistical Evidence 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Fernádez, J., Gómez, J.M., Martinez-Barco, P.: Evaluación de sistemas de recuperación de información web sobre dominios restringidos. Journal Sociedad Española para el Procesamiento del Lenguaje Natural (SEPLN) 45, 273–276 (2010)Google Scholar
  2. 2.
    Gómez, J.M.: InTiMe: Plataforma de Integración de Recursos de PLN. Journal Sociedad Española para el Procesamiento del Lenguaje Natural (SEPLN) 40, 83–90 (2010)Google Scholar
  3. 3.
    Magnini, B., Cavaglia, G.: Integrating subject fields codes into WordNet. In: Proceedings of the Second International Conference on Language Resources and Evaluation (LREC 2000), Atenes (2000)Google Scholar
  4. 4.
    Kozareva, Z., Vázquez, S., Montoyo, A.: UA-ZSA: Web Page Clustering on the basis of Name Disambiguation. In: Proceedings of the 4th International Workshop on Semantic Evaluations (SemEval), Prague, Czech Republic, June 23-24, pp. 338–341 (2007)Google Scholar
  5. 5.
    Kozareva, Z., Vázquez, S., Montoyo, A.: The Influence of Context during the Categorization and Discrimination of Spanish and Portuguese Person Names. Journal Sociedad Española para el Procesamiento del Lenguaje Natural (SEPLN) 39, 81–88 (2007)Google Scholar
  6. 6.
    Fellbaum, C.: WordNet. An Electronic Lexical Database. MIT Press, Cambridge (1998)zbMATHGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2011

Authors and Affiliations

  • Isabel Moreno
    • 1
  • Rubén Izquierdo
    • 1
  • Paloma Moreda
    • 1
  1. 1.Natural Language Processing Research GroupUniversity of AlicanteAlicanteSpain

Personalised recommendations