Finding Entities in Wikipedia Using Links and Categories

  • Rianne Kaptein
  • Jaap Kamps
Part of the Lecture Notes in Computer Science book series (LNCS, volume 5631)


In this paper we describe our participation in the INEX Entity Ranking track. We explored the relations between Wikipedia pages, categories and links. Our approach is to exploit both category and link information. Category information is used by calculating distances between document categories and target categories. Link information is used for relevance propagation and in the form of a document link prior. Both sources of information have value, but using category information leads to the biggest improvements.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Hiemstra, D., Robertson, S., Zaragoza, H.: Parsimonious language models for information retrieval. In: Proceedings SIGIR 2004, pp. 178–185. ACM Press, New York (2004)Google Scholar
  2. 2.
    Kamps, J., Koolen, M.: The importance of link evidence in Wikipedia. In: Macdonald, C., Ounis, I., Plachouras, V., Ruthven, I., White, R.W. (eds.) ECIR 2008. LNCS, vol. 4956, pp. 270–282. Springer, Heidelberg (2008)CrossRefGoogle Scholar
  3. 3.
    Strohman, T., Metzler, D., Turtle, H., Croft, W.B.: Indri: a language-model based search engine for complex queries. In: Proceedings of the International Conference on Intelligent Analysis (2005)Google Scholar
  4. 4.
    Tsikrika, T., Serdyukov, P., Rode, H., Westerveld, T., Aly, R., Hiemstra, D., de Vries, A.P.: Structured document retrieval, multimedia retrieval, and entity ranking using PF/Tijah. In: Focused Access to XML Documents, pp. 306–320 (2007)Google Scholar
  5. 5.
    Vercoustre, A.M., Pehcevski, J., Thom, J.A.: Using wikipedia categories and links in entity ranking. In: Focused Access to XML Documents, pp. 321–335 (2007)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2009

Authors and Affiliations

  • Rianne Kaptein
    • 1
  • Jaap Kamps
    • 1
    • 2
  1. 1.Archives and Information Studies, Faculty of HumanitiesUniversity of AmsterdamNetherlands
  2. 2.ISLA, Faculty of ScienceUniversity of AmsterdamNetherlands

Personalised recommendations