Skip to main content

Revisiting Rocchio’s Relevance Feedback Algorithm for Probabilistic Models

  • Conference paper
Information Retrieval Technology (AIRS 2010)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 6458))

Included in the following conference series:

Abstract

Rocchio’s relevance feedback method enhances the retrieval performance of the classical vector space model. However, its application to the probabilistic models is not adequately explored. In this paper, we revisit Rocchio’s algorithm by proposing to integrate this classical feedback method into the divergence from randomness (DFR) probabilistic framework for pseudo relevance feedback (PRF). Such an integration is denoted by RocDFR in this paper. In addition, we further improve RocDFR’s robustness by proposing a quality-biased feedback method, called QRocDFR. Extensive experiments on standard TREC test collections show that our proposed RocDFR and QRocDFR methods significantly outperform the relevance model (RM3), which is a representative feedback model in the language modeling framework. Moreover, the QRocDFR method considerably improves the robustness of RocDFR’s retrieval performance with respect to the size of feedback document set.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Amati, G.: Probabilistic models for information retrieval based on divergence from randomness. Ph.D. thesis, Department of Computing Science, University of Glasgow (2003)

    Google Scholar 

  2. Amati, G., Ambrosi, E., Bianchi, M., Gaibisso, C., Gambosi, G.: FUB, IASI-CNR and University of Tor Vergata at TREC 2007 blog track. In: Voorhees, E.M., Buckland, L.P. (eds.) TREC. National Institute of Standards and Technology (NIST), vol. Special Publication 500–274 (2007)

    Google Scholar 

  3. Carpineto, C., de Mori, R., Romano, G., Bigi, B.: An information-theoretic approach to automatic query expansion. ACM Trans. Inf. Syst. 19(1), 1–27 (2001)

    Article  Google Scholar 

  4. Diaz, F., Metzler, D.: Improving the estimation of relevance models using large external corpora. In: Efthimiadis, E.N., Dumais, S.T., Hawking, D., Järvelin, K. (eds.) SIGIR, pp. 154–161. ACM, New York (2006)

    Google Scholar 

  5. He, B.: Query expansion models. In: Encyclopedia of Database Systems, pp. 2257–2260 (2009)

    Google Scholar 

  6. He, B., Macdonald, C., Ounis, I., Peng, J., Santos, R.L.T.: University of glasgow at trec 2008: Experiments in blog, enterprise, and relevance feedback tracks with terrier. In: Voorhees, E.M., Buckland, L.P. (eds.) TREC. National Institute of Standards and Technology (NIST), Special Publication 500–277 (2008)

    Google Scholar 

  7. Lavrenko, V., Croft, W.B.: Relevance-based language models. In: Croft, W.B., Harper, D.J., Kraft, D.H., Zobel, J. (eds.) SIGIR, pp. 120–127. ACM, New York (2001)

    Google Scholar 

  8. Lv, Y., Zhai, C.: A comparative study of methods for estimating query language models with pseudo feedback. In: CIKM, pp. 1895–1898 (2009)

    Google Scholar 

  9. Ounis, I., Amati, G., Plachouras, V., He, B., Macdonald, C., Johnson, D.: Terrier information retrieval platform. In: Losada, D.E., Fernández-Luna, J.M. (eds.) ECIR 2005. LNCS, vol. 3408, pp. 517–519. Springer, Heidelberg (2005)

    Chapter  Google Scholar 

  10. Ounis, I., Amati, G., Plachouras, V., He, B., Macdonald, C., Lioma, C.: Terrier: A high performance and scalable information retrieval platform. In: Proceedings of ACM SIGIR 2006 Workshop on Open Source Information Retrieval, OSIR 2006 (2006)

    Google Scholar 

  11. Ponte, J.M., Croft, W.B.: A language modeling approach to information retrieval. In: SIGIR, pp. 275–281. ACM, New York (1998)

    Google Scholar 

  12. Robertson, S.E.: On term selection for query expansion. Journal of Documentation 46(4), 359–364 (1990)

    Article  Google Scholar 

  13. Robertson, S.E., Walker, S., Hancock-Beaulieu, M., Gatford, M., Payne, A.: Okapi at TREC-4. In: TREC, pp. 73–97 (1995)

    Google Scholar 

  14. Rocchio, J.: Relevance feedback in information retrieval, pp. 313–323. Prentice-Hall, Englewood Cliffs (1971)

    Google Scholar 

  15. Salton, G., Buckley, C.: Improving retrieval performance by relevance feedback. Journal of the American Society for Information Science 41, 288–297 (1990)

    Article  Google Scholar 

  16. Zhai, C., Lafferty, J.D.: Model-based feedback in the language modeling approach to information retrieval. In: CIKM, pp. 403–410 (2001)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Ye, Z., He, B., Huang, X., Lin, H. (2010). Revisiting Rocchio’s Relevance Feedback Algorithm for Probabilistic Models. In: Cheng, PJ., Kan, MY., Lam, W., Nakov, P. (eds) Information Retrieval Technology. AIRS 2010. Lecture Notes in Computer Science, vol 6458. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-17187-1_14

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-17187-1_14

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-17186-4

  • Online ISBN: 978-3-642-17187-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics