Skip to main content

Probabilistic Relevance Models Based on Document and Query Generation

  • Chapter
Language Modeling for Information Retrieval

Part of the book series: The Springer International Series on Information Retrieval ((INRE,volume 13))

Abstract

We give a unified account of the probabilistic semantics underlying the language modeling approach and the traditional probabilistic model for information retrieval, showing that the two approaches can be viewed as being equivalent probabilistically, since they are based on different factorizations of the same generative relevance model. We also discuss how the two approaches lead to different retrieval frameworks in practice, since they involve component models that are estimated quite differently.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • Berger, A. and Lafferty, J. (1999). Information retrieval as statistical transla tion. In Proceedings of the 22nd International Conference on Research and Development in Information Retrieval (SIGIR’99), pages 222–229.

    Google Scholar 

  • Brin, S. and Page, L. (1998). Anatomy of a large-scale hypertextual web search engine. In Proceedings of the 7th International World Wide Web Conference.

    Google Scholar 

  • Brown, P. F., Cocke, J., Della Pietra, S. A., Della Pietra, V. J., Jelinek, F., Lafferty, J. D., Mercer, R. L., and Roossin, P. S. (1990). A statistical approach to machine translation. Computational Linguistics, 16 (2): 79–85.

    Google Scholar 

  • Croft, W. B. and Harper, D. (1979). Using probabilistic models of document retrieval without relevance information. Journal of Documentation, 35: 285295.

    Google Scholar 

  • Fuhr, N. (1992). Probabilistic models in information retrieval. Computer Journal, 35: 243–255.

    Article  MATH  Google Scholar 

  • Lavrenko, V. and Croft, W. B. (2001). Relevance-based language models. In 24th ACM SIGIR Conference on Research and Development in Information Retrieval.

    Google Scholar 

  • Maron, M. and Kuhns, J. (1960). On relevance, probabilistic indexing and information retrieval. Journal of the ACM, 7: 216–244.

    Article  Google Scholar 

  • Miller, D., Leek, T., and Schwartz, R. (1999). A hidden Markov model information retrieval system. In Proceedings of the 22nd International Conference on Research and Development in Information Retrieval (SIGIR’99), pages 214–221.

    Google Scholar 

  • Ponte, J. and Croft, W. B. (1998). A language modeling approach to information retrieval. In Proceedings of the 21st International Conference on Research and Development in Information Retrieval (SIGIR’98), pages 275–281

    Google Scholar 

  • Robertson, S. (1977). The probability ranking principle in IR. Journal of Documentation, 33: 294–304.

    Article  Google Scholar 

  • Robertson, S. and Sparck Jones, K. (1976). Relevance weighting of search terms. Journal of the American Society for Information Science, 27: 129–146.

    Article  Google Scholar 

  • Robertson, S. and Walker, S. (1997). On relevance weights with little relevance information. In Proceedings of SIGIR’97, pages 16–24.

    Google Scholar 

  • Robertson, S. E. (1994). Query-document symmetry and dual models. Journal of Documentation, 50 (3): 233–238.

    Article  Google Scholar 

  • Robertson, S. E., Maron, M. E., and Cooper, W. S. (1982). Probability of relevance: a unification of two competing models for information retrieval. Information Technology - Research and Development, 1:1–21.

    Google Scholar 

  • Sparck Jones, K., Walker, S., and Robertson, S. E. (2000). A probabilistic model of information retrieval: development and comparative experiments, Part 1. Information Processing and Management, 36: 779–808.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2003 Springer Science+Business Media Dordrecht

About this chapter

Cite this chapter

Lafferty, J., Zhai, C. (2003). Probabilistic Relevance Models Based on Document and Query Generation. In: Croft, W.B., Lafferty, J. (eds) Language Modeling for Information Retrieval. The Springer International Series on Information Retrieval, vol 13. Springer, Dordrecht. https://doi.org/10.1007/978-94-017-0171-6_1

Download citation

  • DOI: https://doi.org/10.1007/978-94-017-0171-6_1

  • Publisher Name: Springer, Dordrecht

  • Print ISBN: 978-90-481-6263-5

  • Online ISBN: 978-94-017-0171-6

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics