Usage Data in Web Search: Benefits and Limitations

  • Ricardo Baeza-Yates
  • Yoelle Maarek
Conference paper

DOI: 10.1007/978-3-642-31235-9_33

Part of the Lecture Notes in Computer Science book series (LNCS, volume 7338)
Cite this paper as:
Baeza-Yates R., Maarek Y. (2012) Usage Data in Web Search: Benefits and Limitations. In: Ailamaki A., Bowers S. (eds) Scientific and Statistical Database Management. SSDBM 2012. Lecture Notes in Computer Science, vol 7338. Springer, Berlin, Heidelberg


Web Search, which takes its root in the mature field of information retrieval, evolved tremendously over the last 20 years. The field encountered its first revolution when it started to deal with huge amounts of Web pages. Then, a major step was accomplished when engines started to consider the structure of the Web graph and link analysis became a differentiator in both crawling and ranking. Finally, a more discrete, but not less critical step, was made when search engines started to monitor and mine the numerous (mostly implicit) signals provided by users while interacting with the search engine. We focus here on this third “revolution” of large scale usage data. We detail the different shapes it takes, illustrating its benefits through a review of some winning search features that could not have been possible without it. We also discuss its limitations and how in some cases it even conflicts with some natural users’ aspirations such as personalization and privacy. We conclude by discussing how some of these conflicts can be circumvented by using adequate aggregation principles to create “ad hoc”crowds.


Web search usage data wisdom of crowds large scale data mining privacy personalization long tail 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Copyright information

© Springer-Verlag Berlin Heidelberg 2012

Authors and Affiliations

  • Ricardo Baeza-Yates
    • 1
  • Yoelle Maarek
    • 2
  1. 1.Yahoo! ResearchBarcelonaSpain
  2. 2.Yahoo! ResearchHaifaIsrael

Personalised recommendations