IdentityRank: Named Entity Disambiguation in the Context of the NEWS Project

  • Norberto Fernández
  • José M. Blázquez
  • Luis Sánchez
  • Ansgar Bernardi
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4519)

Abstract

In this paper we introduce the IdentityRank algorithm, developed as part of the EU-funded project NEWS to address the problem of named entity disambiguation in the context of semantic annotation of news items. The algorithm provides a ranking of the candidate instances within an ontology which can be associated to a certain entity. In order to do so, it uses as context the metadata available in a certain news item. The algorithm has been evaluated with promising results.

References

  1. 1.
    Aswani, N., Bontcheva, K., Cunningham, H.: Mining Information for Instance Unification. In: Cruz, I., Decker, S., Allemang, D., Preist, C., Schwabe, D., Mika, P., Uschold, M., Aroyo, L.M. (eds.) ISWC 2006. LNCS, vol. 4273, pp. 329–342. Springer, Heidelberg (2006)CrossRefGoogle Scholar
  2. 2.
    Bagga, A., Baldwin, B.: Entity-Based Cross-Document Coreferencing Using the Vector Space Model. In: 17th International Conference on Computational Linguistics, Quebec, Canada (August 1998)Google Scholar
  3. 3.
    Fernández, N., Blázquez, J.M., Fisteus, J.A., Sánchez, L., Sintek, M., Bernardi, A., Fuentes, M., Marrara, A., Ben-Asher, Z.: NEWS: Bringing Semantic Web Technologies into News Agencies. In: Cruz, I., Decker, S., Allemang, D., Preist, C., Schwabe, D., Mika, P., Uschold, M., Aroyo, L.M. (eds.) ISWC 2006. LNCS, vol. 4273, pp. 778–791. Springer, Heidelberg (2006)CrossRefGoogle Scholar
  4. 4.
    Fernández, N., Sánchez, L., Blázquez, J.M., Villamor, J.: The NEWS Ontology for Professional Journalism Applications. In: Ontologies: A Handbook of Principles, Concepts and Applications in Information Systems. Integrated Series in Information Systems, vol. 14, Springer, Heidelberg (2007)Google Scholar
  5. 5.
    Ginter, F., Boberg, J., Ärvinen, J., Salakoski, T.: New Techniques for Disambiguation in Natural Language and their Applications to Biological Text. Journal of Machine Learning Research 5, 605–621 (2004)Google Scholar
  6. 6.
    Han, H., Giles, L., Zha, H., Li, C., Tsioutsiouliklis, K.: Two Supervised Learning Approaches for Name Disambiguation in Author Citations. In: Joint ACM/IEEE Conference on Digital Libraries, Tucson, USA (June 2004)Google Scholar
  7. 7.
    Hassell, J., Aleman-Meza, B., Arpinar, I.B.: Ontology-Driven Automatic Entity Disambiguation in Unstructured Text. In: Cruz, I., Decker, S., Allemang, D., Preist, C., Schwabe, D., Mika, P., Uschold, M., Aroyo, L.M. (eds.) ISWC 2006. LNCS, vol. 4273, pp. 44–57. Springer, Heidelberg (2006)CrossRefGoogle Scholar
  8. 8.
    Ide, N., Véronis, J.: Word Sense Disambiguation: The State of the Art. Computational Linguistics 24(1) (1998)Google Scholar
  9. 9.
    Mann, G.S., Yarowski, D.: Unsupervised Personal Name Disambiguation. In: 7th Conference on Natural Language Learning, Edmonton, Canada (June 2003)Google Scholar
  10. 10.
    Page, L., Brin., S., Motwani, R., Winograd, T.: The PageRank Citation Ranking: Bringing Order to the Web. Stanford Technical Report (1999), available online at http://dbpubs.stanford.edu/pub/1999-66
  11. 11.
    Pedersen, T., Purandare, A., Kulkarni, A.: Name Discrimination by Clustering Similar Contexts. In: Gelbukh, A. (ed.) CICLing 2005. LNCS, vol. 3406, pp. 226–237. Springer, Heidelberg (2005)Google Scholar

Copyright information

© Springer Berlin Heidelberg 2007

Authors and Affiliations

  • Norberto Fernández
    • 1
  • José M. Blázquez
    • 1
  • Luis Sánchez
    • 1
  • Ansgar Bernardi
    • 2
  1. 1.Carlos III University of Madrid, Leganés, MadridSpain
  2. 2.German Research Center for Artificial Intelligence, DFKI GmbH, KaiserslauternGermany

Personalised recommendations