Abstract
A new method for information retrieval which is on the basis of language model with relative entropy and feedback is presented in this paper. The method builds a query language model and document language models respectively for the query and the documents. We rank the documents according to the relative entropies of the estimated document language models with respect to the estimated query language model. The feedback documents are used to estimate a query model by the approach that we assume that the feedback documents are generated by a combined model in which one component is the feedback document language model and the other is the collection language model. Experimental results show that the method is effective for feedback documents and performs better than the basic language modeling approach. The results also indicate that the performance of the method is sensitive to both the smoothing parameters and the interpolation coefficients used to estimate the values of the language models.
This research is supported by the Natural Science Foundation Program of the Henan Provincial Educational Department in China(200410464004) and the Science Research Foundation Program of Henan University of Science and Technology in China(2004ZY041).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Cover, T.M., Thomas, J.A.: Elements of Information Theory, pp. 15–18. Tsinghua University Press, Beijing (2003)
Liu, X., Croft, W.B.: Cluster-based retrieval using language models. In: Proceedings of ACM SIGIR 2004 conference, pp. 276–284 (2004)
Miller, D.H., Leek, T., Schwartz, R.: A hidden Markov model information retrieval system. In: Proceedings of ACM SIGIR 1999, pp. 214–221 (1999)
Ponte, J.: Language Models for Relevance Feedback. In: Croft, W.B. (ed.) Advances in Information Retrieval: Recent Research from the CIIR, ch. 3, pp. 73–95. Kluwer Academic Publishers, Dordrecht (2000)
Ponte, J., Croft, W.B.: A language modeling approach to information retrieval. In: Proceedings of ACM SIGIR 1998, pp. 275–281 (1998)
Song, F., Croft, W.B.: A general language model for information retrieval. In: Proceedings of the 22nd annual international ACM-SIGIR 1999, pp. 279–280 (1999)
Zaragoza, H., Hiemstra, D., Tipping, M.: Bayesian extension to the language model for ad hoc information retrieval. In: Proceedings of ACM SIGIR 2003, pp. 325–327 (2003)
Zhai, C., Lafferty, J.: A study of smoothing methods for language models applied to ad hoc information retrieval. In: Proceedings of SIGIR 2001, pp. 334–342 (2001)
Zhai, C., Lafferty, J.: Model-based feedback in the language modeling approach to information retrieval. In: Proceding of SIGIR 2001, pp. 403-410 (2001)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Huo, H., Feng, B. (2005). Retrieval Based on Language Model with Relative Entropy and Feedback. In: Ho, T.B., Cheung, D., Liu, H. (eds) Advances in Knowledge Discovery and Data Mining. PAKDD 2005. Lecture Notes in Computer Science(), vol 3518. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11430919_35
Download citation
DOI: https://doi.org/10.1007/11430919_35
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-26076-9
Online ISBN: 978-3-540-31935-1
eBook Packages: Computer ScienceComputer Science (R0)