Abstract
Rocchio’s relevance feedback method enhances the retrieval performance of the classical vector space model. However, its application to the probabilistic models is not adequately explored. In this paper, we revisit Rocchio’s algorithm by proposing to integrate this classical feedback method into the divergence from randomness (DFR) probabilistic framework for pseudo relevance feedback (PRF). Such an integration is denoted by RocDFR in this paper. In addition, we further improve RocDFR’s robustness by proposing a quality-biased feedback method, called QRocDFR. Extensive experiments on standard TREC test collections show that our proposed RocDFR and QRocDFR methods significantly outperform the relevance model (RM3), which is a representative feedback model in the language modeling framework. Moreover, the QRocDFR method considerably improves the robustness of RocDFR’s retrieval performance with respect to the size of feedback document set.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Amati, G.: Probabilistic models for information retrieval based on divergence from randomness. Ph.D. thesis, Department of Computing Science, University of Glasgow (2003)
Amati, G., Ambrosi, E., Bianchi, M., Gaibisso, C., Gambosi, G.: FUB, IASI-CNR and University of Tor Vergata at TREC 2007 blog track. In: Voorhees, E.M., Buckland, L.P. (eds.) TREC. National Institute of Standards and Technology (NIST), vol. Special Publication 500–274 (2007)
Carpineto, C., de Mori, R., Romano, G., Bigi, B.: An information-theoretic approach to automatic query expansion. ACM Trans. Inf. Syst. 19(1), 1–27 (2001)
Diaz, F., Metzler, D.: Improving the estimation of relevance models using large external corpora. In: Efthimiadis, E.N., Dumais, S.T., Hawking, D., Järvelin, K. (eds.) SIGIR, pp. 154–161. ACM, New York (2006)
He, B.: Query expansion models. In: Encyclopedia of Database Systems, pp. 2257–2260 (2009)
He, B., Macdonald, C., Ounis, I., Peng, J., Santos, R.L.T.: University of glasgow at trec 2008: Experiments in blog, enterprise, and relevance feedback tracks with terrier. In: Voorhees, E.M., Buckland, L.P. (eds.) TREC. National Institute of Standards and Technology (NIST), Special Publication 500–277 (2008)
Lavrenko, V., Croft, W.B.: Relevance-based language models. In: Croft, W.B., Harper, D.J., Kraft, D.H., Zobel, J. (eds.) SIGIR, pp. 120–127. ACM, New York (2001)
Lv, Y., Zhai, C.: A comparative study of methods for estimating query language models with pseudo feedback. In: CIKM, pp. 1895–1898 (2009)
Ounis, I., Amati, G., Plachouras, V., He, B., Macdonald, C., Johnson, D.: Terrier information retrieval platform. In: Losada, D.E., Fernández-Luna, J.M. (eds.) ECIR 2005. LNCS, vol. 3408, pp. 517–519. Springer, Heidelberg (2005)
Ounis, I., Amati, G., Plachouras, V., He, B., Macdonald, C., Lioma, C.: Terrier: A high performance and scalable information retrieval platform. In: Proceedings of ACM SIGIR 2006 Workshop on Open Source Information Retrieval, OSIR 2006 (2006)
Ponte, J.M., Croft, W.B.: A language modeling approach to information retrieval. In: SIGIR, pp. 275–281. ACM, New York (1998)
Robertson, S.E.: On term selection for query expansion. Journal of Documentation 46(4), 359–364 (1990)
Robertson, S.E., Walker, S., Hancock-Beaulieu, M., Gatford, M., Payne, A.: Okapi at TREC-4. In: TREC, pp. 73–97 (1995)
Rocchio, J.: Relevance feedback in information retrieval, pp. 313–323. Prentice-Hall, Englewood Cliffs (1971)
Salton, G., Buckley, C.: Improving retrieval performance by relevance feedback. Journal of the American Society for Information Science 41, 288–297 (1990)
Zhai, C., Lafferty, J.D.: Model-based feedback in the language modeling approach to information retrieval. In: CIKM, pp. 403–410 (2001)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Ye, Z., He, B., Huang, X., Lin, H. (2010). Revisiting Rocchio’s Relevance Feedback Algorithm for Probabilistic Models. In: Cheng, PJ., Kan, MY., Lam, W., Nakov, P. (eds) Information Retrieval Technology. AIRS 2010. Lecture Notes in Computer Science, vol 6458. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-17187-1_14
Download citation
DOI: https://doi.org/10.1007/978-3-642-17187-1_14
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-17186-4
Online ISBN: 978-3-642-17187-1
eBook Packages: Computer ScienceComputer Science (R0)