A Vector Space Model for Ranking Entities and Its Application to Expert Search

  • Gianluca Demartini
  • Julien Gaugaz
  • Wolfgang Nejdl
Part of the Lecture Notes in Computer Science book series (LNCS, volume 5478)

Abstract

Entity Ranking has recently become an important search task in Information Retrieval. The goal is not to find documents matching query terms, but, instead, finding entities. In this paper we propose a formal model to search entities as well as a complete Entity Ranking system, providing examples of its application to the enterprise context. We experimentally evaluate our system on the Expert Search task in order to show how it can be adapted to different scenarios. The results show that combining simple IR techniques we improve of 53% in terms of P@10 over our baseline.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Anick, P.G., Tipirneni, S.: The Paraphrase Search Assistant: Terminological Feedback for Iterative Information Seeking. In: SIGIR, pp. 153–159 (1999)Google Scholar
  2. 2.
    Bailey, P., Craswell, N., Soboroff, I., de Vries, A.P.: The CSIRO enterprise search test collection. SIGIR Forum 41(2), 42–45 (2007)CrossRefGoogle Scholar
  3. 3.
    Balog, K., Azzopardi, L., de Rijke, M.: Formal models for expert finding in enterprise corpora. In: SIGIR, pp. 43–50 (2006)Google Scholar
  4. 4.
    Bast, H., Chitea, A., Suchanek, F., Weber, I.: ESTER: efficient search on text, entities, and relations. In: SIGIR, pp. 671–678 (2007)Google Scholar
  5. 5.
    Chakrabarti, S.: Dynamic personalized pagerank in entity-relation graphs. In: WWW, pp. 571–580 (2007)Google Scholar
  6. 6.
    Cheng, T., Chang, K.C.C.: Entity Search Engine: Towards Agile Best-Effort Information Integration over the Web. In: Proc. of CIDR 2007, pp. 108–113 (2007)Google Scholar
  7. 7.
    Cheng, T., Yan, X., Chang, K.C.-C.: EntityRank: Searching Entities Directly and Holistically. In: VLDB, pp. 387–398 (2007)Google Scholar
  8. 8.
    Chirita, P.A., Firan, C.S., Nejdl, W.: Summarizing Local Context to Personalize Global Web Search. In: CIKM, pp. 287–296 (2006)Google Scholar
  9. 9.
    Craswell, N., Hawking, D.: Overview of the TREC-2004 Web Track. In: The Thirteenth Text REtrieval Conference (TREC 2004) (2004)Google Scholar
  10. 10.
    Craswell, N., Hawking, D., Vercoustre, A., Wilkins, P.: P@noptic Expert: Searching for Experts not just for Documents. Ausweb (2001)Google Scholar
  11. 11.
    de Vries, A.P., Vercoustre, A.-M., Thom, J.A., Craswell, N., Lalmas, M.: Overview of the INEX 2007 Entity Ranking Track. In: INEX, pp. 245–251 (2007)Google Scholar
  12. 12.
    Deerwester, S., Dumais, S.T., Furnas, G.W., Landauer, T.K., Harshman, R.: Indexing by latent semantic analysis. JASIS 41(6), 391–407 (1990)CrossRefGoogle Scholar
  13. 13.
    Demartini, G., Firan, C.S., Iofciu, T., Krestel, R., Nejdl, W.: A Model for Ranking Entities and Its Application to Wikipedia. In: Proceedings of The Latin-American Web Conference (LA-WEB 2008) (2008)Google Scholar
  14. 14.
    Demartini, G., Firan, C.S., Iofciu, T., Nejdl, W.: Semantically Enhanced Entity Ranking. In: Bailey, J., Maier, D., Schewe, K.-D., Thalheim, B., Wang, X.S. (eds.) WISE 2008. LNCS, vol. 5175, pp. 176–188. Springer, Heidelberg (2008)CrossRefGoogle Scholar
  15. 15.
    Denoyer, L., Gallinari, P.: The Wikipedia XML corpus. SIGIR Forum 40(1), 64–69 (2006)CrossRefGoogle Scholar
  16. 16.
    Dumais, S.T.: Improving the retrieval of information from external sources. Behavior Research Methods, Instruments and Computers 23(2), 229–236 (1991)CrossRefGoogle Scholar
  17. 17.
    Macdonald, C., Ounis, I.: Voting for Candidates: Adapting Data Fusion Techniques for an Expert Search Task. In: CIKM, pp. 387–396 (2006)Google Scholar
  18. 18.
    Macdonald, C., Ounis, I.: Using relevance feedback in expert search. In: Amati, G., Carpineto, C., Romano, G. (eds.) ECIR 2007. LNCS, vol. 4425, pp. 431–443. Springer, Heidelberg (2007)CrossRefGoogle Scholar
  19. 19.
    Macdonald, C., Hannah, D., Ounis, I.: High quality expertise evidence for expert search. In: Macdonald, C., Ounis, I., Plachouras, V., Ruthven, I., White, R.W. (eds.) ECIR 2008. LNCS, vol. 4956, pp. 283–295. Springer, Heidelberg (2008)CrossRefGoogle Scholar
  20. 20.
    Manning, C.D., Raghavan, P., Schütze, H.: Introduction to Information Retrieval. Cambridge University Press, Cambridge (2008)CrossRefMATHGoogle Scholar
  21. 21.
    McLean, A., Vercoustre, A.M., Wu, M.: Enterprise PeopleFinder: Combining Evidence from Web Pages and Corporate Data. In: Proceedings of Australian Document Computing Symposium (2003)Google Scholar
  22. 22.
    Pehcevski, J., Vercoustre, A.-M., Thom, J.A.: Exploiting Locality of Wikipedia Links in Entity Ranking. In: Macdonald, C., Ounis, I., Plachouras, V., Ruthven, I., White, R.W. (eds.) ECIR 2008. LNCS, vol. 4956, pp. 258–269. Springer, Heidelberg (2008)CrossRefGoogle Scholar
  23. 23.
    Rölleke, T., Tsikrika, T., Kazai, G.: A general matrix framework for modelling Information Retrieval. Information Processing and Management 42(1), 4–30 (2006)CrossRefMATHGoogle Scholar
  24. 24.
    Salton, G., Wong, A., Yang, C.S.: A vector space model for automatic indexing. Communications of the ACM 18(11), 613–620 (1975)CrossRefMATHGoogle Scholar
  25. 25.
    Zirn, C., Nastase, V., Strube, M.: Distinguishing between Instances and Classes in the Wikipedia Taxonomy. In: Bechhofer, S., Hauswirth, M., Hoffmann, J., Koubarakis, M. (eds.) ESWC 2008. LNCS, vol. 5021, pp. 376–387. Springer, Heidelberg (2008)CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2009

Authors and Affiliations

  • Gianluca Demartini
    • 1
  • Julien Gaugaz
    • 1
  • Wolfgang Nejdl
    • 1
  1. 1.L3S Research CenterLeibniz Universität HannoverGermany

Personalised recommendations