Abstract
Query logs record past query sessions across a time span. A statistical model is proposed to explain the log generation process. Within a search engine list of results, the model explains the document selection – a user’s click – by taking into account both a document position and its popularity. We show that it is possible to quantify this influence and consequently estimate document “un-biased” popularities. Among other applications, this allows to re-order the result list to match more closely user preferences and to use the logs as a feedback to improve search engines.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Baeza-Yates, R., Saint-Jean, F.: A three level search engine index based in query log distribution. In: Nascimento, M.A., de Moura, E.S., Oliveira, A.L. (eds.) SPIRE 2003. LNCS, vol. 2857, pp. 56–65. Springer, Heidelberg (2003)
Dupret, G., Mendoza, M.: Recommending better queries from click-through data. In: Consens, M.P., Navarro, G. (eds.) SPIRE 2005. LNCS, vol. 3772, pp. 41–44. Springer, Heidelberg (2005)
Joachims, T.: Evaluating search engines using clickthrough data. Department of Computer Science, Cornell University (2002)
Joachims, T.: Optimizing search engines using clickthrough data. In: KDD 2002: Proceedings of the eighth ACM SIGKDD, pp. 133–142. ACM Press, New York (2002)
Radlinski, F., Joachims, T.: Query chains: learning to rank from implicit feedback. In: KDD 2005: Proceeding of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining, pp. 239–248. ACM Press, New York (2005)
Ribeiro-Neto, B.A., Muntz, R.: A belief network model for IR. In: SIGIR 1996: Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval, pp. 253–260. ACM Press, New York (1996)
Robertson, S.E., Jones, K.S.: Relevance weighting of search terms. Taylor Graham Publishing, London (1988)
Joachims, T.: Unbiased evaluation of retrieval quality using clickthrough data. Technical report, Cornell University, Department of Computer Science (2002), http://www.joachims.org
Wen, J.-R., Nie, J.-Y., Zhang, H.-J.: Clustering user queries of a search engine. In: WWW 2001: Proceedings of the 10th international conference on World Wide Web, pp. 162–168. ACM Press, New York (2001)
Zaïane, O.R., Strilets, A.: Finding similar queries to satisfy searches based on query traces. In: Proceedings of the International Workshop on Efficient Web-Based Information Systems (EWIS), Montpellier, France (September 2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Dupret, G., Piwowarski, B., Hurtado, C., Mendoza, M. (2006). A Statistical Model of Query Log Generation. In: Crestani, F., Ferragina, P., Sanderson, M. (eds) String Processing and Information Retrieval. SPIRE 2006. Lecture Notes in Computer Science, vol 4209. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11880561_18
Download citation
DOI: https://doi.org/10.1007/11880561_18
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-45774-9
Online ISBN: 978-3-540-45775-6
eBook Packages: Computer ScienceComputer Science (R0)