Analysing Web Search Logs to Determine Session Boundaries for User-Oriented Learning
Incremental learning approaches based on user search activities provide a means of building adaptive information retrieval systems. To develop more effective user-oriented learning techniques for the Web, we need to be able to identify a meaningful session unit from which we can learn. Without this, we run a high risk of grouping together activities that are unrelated or perhaps not from the same user. We are interested in detecting boundaries of sequences between related activities (sessions) that would group the activities for a learning purpose. Session boundaries, in Reuters transaction logs, were detected automatically. The generated boundaries were compared with human judgements. The comparison confirmed that a meaningful session threshold for establishing these session boundaries was confined to a 11-15 minute range.
KeywordsHuman Judgement User Search Learning Purpose Minute Range Session Interval
Unable to display preview. Download preview PDF.
- Balabanovic M., Shoham Y., and Yun Y.: An Adaptive Agent for Automated Web Browsing. Tech. Rep. CS-TN-97-52, Dept. of Comp. Sci., Stanford University (1997)Google Scholar
- Catledge L. and Pitkow J.: Characterizing Browsing Strategies in the World-Wide Web. In 3rd International World-Wide Web Conference (1995) http://www.igd.fhg.de/archive/1995www95/papers/
- He D. and Goker A.: Detecting session boundaries from Web user logs. In 22nd Annual Colloquium on IR Research IRSG 2000, Cambridge, UK (2000) 57–66Google Scholar
- Joachims T., Freitag D., and Mitchell T.: WebWatcher: A Tour Guide for the World Wide Web. In Proceedings of IJCAI97 (1997) 770–775Google Scholar