A Statistical View of Binned Retrieval Models

  • Donald Metzler
  • Trevor Strohman
  • W. Bruce Croft
Conference paper

DOI: 10.1007/978-3-540-78646-7_18

Part of the Lecture Notes in Computer Science book series (LNCS, volume 4956)
Cite this paper as:
Metzler D., Strohman T., Croft W.B. (2008) A Statistical View of Binned Retrieval Models. In: Macdonald C., Ounis I., Plachouras V., Ruthven I., White R.W. (eds) Advances in Information Retrieval. ECIR 2008. Lecture Notes in Computer Science, vol 4956. Springer, Berlin, Heidelberg

Abstract

Many traditional information retrieval models, such as BM25 and language modeling, give good retrieval effectiveness, but can be difficult to implement efficiently. Recently, document-centric impact models were developed in order to overcome some of these efficiency issues. However, such models have a number of problems, including poor effectiveness, and heuristic term weighting schemes. In this work, we present a statistical view of document-centric impact models. We describe how such models can be treated statistically and propose a supervised parameter estimation technique. We analyze various theoretical and practical aspects of the model and show that weights estimated using our new estimation technique are significantly better than the integer-based weights used in previous studies.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Copyright information

© Springer-Verlag Berlin Heidelberg 2008

Authors and Affiliations

  • Donald Metzler
    • 1
  • Trevor Strohman
    • 2
  • W. Bruce Croft
    • 2
  1. 1.Yahoo! ResearchSanta Clara 
  2. 2.University of MassachusettsAmherst 

Personalised recommendations