Associating People and Documents

  • Krisztian Balog
  • Maarten de Rijke
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4956)

Abstract

Since the introduction of the Enterprise Track at TREC in 2005, the task of finding experts has generated a lot of interest within the research community. Numerous models have been proposed that rank candidates by their level of expertise with respect to some topic. Common to all approaches is a component that estimates the strength of the association between a document and a person. Forming such associations, then, is a key ingredient in expertise search models. In this paper we introduce and compare a number of methods for building document-people associations. Moreover, we make underlying assumptions explicit, and examine two in detail: (i) independence of candidates, and (ii) frequency is an indication of strength. We show that our refined ways of estimating the strength of associations between people and documents leads to significant improvements over the state-of-the-art in the end-to-end expert finding task.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Balog, K., Azzopardi, L., de Rijke, M.: Formal models for expert finding in enterprise corpora. In: SIGIR 2006, New York, NY, USA, pp. 43–50 (2006)Google Scholar
  2. 2.
    Bao, S., Duan, H., Zhou, Q., Xiong, M., Cao, Y., Yu, Y.: Research on Expert Search at Enterprise Track of TREC 2006. In: The Fifteenth Text REtrieval Conference Proceedings (TREC 2006) (2007)Google Scholar
  3. 3.
    Cao, Y., Liu, J., Bao, S., Li, H.: Research on Expert Search at Enterprise Track of TREC 2005. In: The Fourteenth Text REtrieval Conference Proceedings (TREC 2005) (2006)Google Scholar
  4. 4.
    Craswell, N., de Vries, A.P., Soboroff, I.: Overview of the TREC-2005 Enterprise Track. In: The Fourteenth Text REtrieval Conference Proceedings (TREC 2005) (2006)Google Scholar
  5. 5.
    Fang, H., Zhai, C.: Probabilistic Models for Expert Finding. In: Amati, G., Carpineto, C., Romano, G. (eds.) ECIR 2007. LNCS, vol. 4425, pp. 418–430. Springer, Heidelberg (2007)CrossRefGoogle Scholar
  6. 6.
    Fu, Y., Yu, W., Li, Y., Liu, Y., Zhang, M.: THUIR at TREC 2005: Enterprise Track. In: The Fourteenth Text REtrieval Conference Proceedings (TREC 2005) (2006)Google Scholar
  7. 7.
    Macdonald, C., Ounis, I.: Voting for candidates: adapting data fusion techniques for an expert search task. In: CIKM 2006, pp. 387–396 (2006)Google Scholar
  8. 8.
    Macdonald, C., Plachouras, V., He, B., Ounis, I.: University of Glasgow at TREC2005: Experiments in Terabyte and Enterprise tracks with Terrier. In: Proceedings of the 14th Text REtrieval Conference (TREC 2005) (2005)Google Scholar
  9. 9.
    Petkova, D., Croft, W.B.: Hierarchical language models for expert finding in enterprise corpora. In: ICTAI 2006, pp. 599–608 (2006)Google Scholar
  10. 10.
    Soboroff, I., de Vries, A.P., Craswell, N.: Overview of the TREC 2006 Enterprise Track. In: TREC 2006 Working Notes (2006)Google Scholar
  11. 11.
  12. 12.
    Yimam-Seid, D., Kobsa, A.: Expert finding systems for organizations: Problem and domain analysis and the demoir approach. Journal of Organizational Computing and Electronic Commerce 13(1), 1–24 (2003)CrossRefGoogle Scholar
  13. 13.
    Zhai, C., Lafferty, J.: A study of smoothing methods for language models applied to ad hoc information retrieval. In: SIGIR 2001, pp. 334–342 (2001)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2008

Authors and Affiliations

  • Krisztian Balog
    • 1
  • Maarten de Rijke
    • 1
  1. 1.ISLA, University of Amsterdam 

Personalised recommendations