Abstract
A number of explicit and implicit feedback mechanisms have been proposed to improve the quality of the search engine results. The current approaches to information retrieval depends heavily on the web linkage structure which is a form of relevance judgment by the page authors. However, to overcome spamming attempts and the huge volumes of data, it is important to also incorporate the user feedback on the page relevance of a document. Since users hardly give explicit/direct feedback on search quality, it becomes necessary to consider implicit feedback that can be collected from search engine logs. In this article we evaluate two implicit feedback measures, namely click sequence and time spent in reading a document. We develop a mathematical programming model to collate the feedback collected from different sessions into a partial rank ordering of documents. The two implicit feedback measures, namely the click sequence and time spent in reading a document are compared for their feedback information content using Kendall’s τ measure. Experimental results based on actual log data from AlltheWeb.com demonstrate that these two relevance judgment measures are not in perfect aggrement and hence incremental information can be derived from them.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
http://news.netcraft.com/archives/web_server_survey.html (May 2007)
Yuwono, B., Lee, D.L.: Search and ranking algorithms for locating resources on the world wide web. In: ICDE, pp. 164–171 (1996)
Arasu, A., Cho, J., Garcia-Molina, H., Paepcke, A., Raghavan, S.: Searching the web. ACM Trans. Inter. Tech. 1(1), 2–43 (2001)
Brin, S., Page, L.: The anatomy of a large-scale hypertextual web search engine. Computer Networks and ISDN Systems 30(1-7), 107–117 (1998)
Kleinberg, J.M.: Authoritative sources in a hyperlinked environment. J. ACM 46(5), 604–632 (1999)
Salton, G., McGill, M.J.: Introduction to Modern Information Retrieval. McGraw-Hill, New York (1983)
Frakes, W.B., Baeza-Yates, R.: Information retrieval: Data structures and algorithms. Prentice-Hall, Englewood Cliffs (1992)
Goecks, J., Shavlik, J.: Learning users’ interests by unobtrusively observing their normal behavior. In: Proceedings of the 5th international conference on Intelligent user interfaces, pp. 129–132 (2000)
White, R., Ruthven, I., Jose, J.M.: The use of implicit evidence for relevance feedback in web retrieval. In: Proceedings of the 24th BCS-IRSG European Colloquium on IR Research, pp. 93–109. Springer, Heidelberg (2002)
Claypool, M., Le, P., Wased, M., Brown, D.: Implicit interest indicators. In: IUI 2001. Proceedings of the 6th international conference on Intelligent user interfaces, pp. 33–40 (2001)
Kelly, D., Teevan, J.: Implicit feedback for inferring user preference: a bibliography. SIGIR Forum 37(2), 18–28 (2003)
Sriram, S., Shen, X., Zhai, C.: A session-based search engine. In: ACM SIGIR, pp. 492–493 (2004)
Beeferman, D., Berger, A.: Agglomerative clustering of a search engine query log. In: ACM SIGKDD, pp. 407–416 (2000)
Jones, R., Rey, B., Madani, O., Greiner, W.: Generating query substitutions. In: WWW 2006. Proceedings of the 15th international conference on World Wide Web, pp. 387–396 (2006)
Radlinski, F., Joachims, T.: Query chains: learning to rank from implicit feedback. In: ACM SIGKDD, pp. 239–248 (2005)
Kim, J., Oard, D., Romanik, K.: Using implicit feedback for user modeling in internet and intranet searching. Technical report, College of Library and Information Services, University of Maryland at College Park (2000)
Kelly, D., Belkin, N.J.: Display time as implicit feedback: understanding task effects. In: ACM SIGIR, pp. 377–384 (2004)
Joachims, T.: Optimizing search engines using clickthrough data. In: ACM SIGKDD, pp. 133–142 (2002)
Tan, Q., Chai, X., Ng, W., Lee, D.L.: Applying co-training to clickthrough data for search engine adaptation, pp. 519–532 (2004)
Ding, C., Chi, C.H.: Towards an adaptive and task-specific ranking mechanism in web searching (poster session). In: ACM SIGIR, pp. 375–376 (2000)
Ramachandran, P.: Discovering user preferences by using time entries in click-through data to improve search engine results. In: Hoffmann, A., Motoda, H., Scheffer, T. (eds.) DS 2005. LNCS (LNAI), vol. 3735, pp. 383–385. Springer, Heidelberg (2005)
Jansen, B.J., Spink, A.: An analysis of web searching by european alltheweb.com users. Inf. Process. Manage. 41(2), 361–381 (2005)
Kendall, M.: A new measure of rank correlation. Biometrika 30(1/2), 81–93 (1938)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Veilumuthu, A., Ramachandran, P. (2007). Discovering Implicit Feedbacks from Search Engine Log Files. In: Corruble, V., Takeda, M., Suzuki, E. (eds) Discovery Science. DS 2007. Lecture Notes in Computer Science(), vol 4755. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-75488-6_22
Download citation
DOI: https://doi.org/10.1007/978-3-540-75488-6_22
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-75487-9
Online ISBN: 978-3-540-75488-6
eBook Packages: Computer ScienceComputer Science (R0)