Advertisement

Adaptive Time-to-Live Strategies for Query Result Caching in Web Search Engines

  • Sadiye Alici
  • Ismail Sengor Altingovde
  • Rifat Ozcan
  • B. Barla Cambazoglu
  • Özgür Ulusoy
Part of the Lecture Notes in Computer Science book series (LNCS, volume 7224)

Abstract

An important research problem that has recently started to receive attention is the freshness issue in search engine result caches. In the current techniques in literature, the cached search result pages are associated with a fixed time-to-live (TTL) value in order to bound the staleness of search results presented to the users, potentially as part of a more complex cache refresh or invalidation mechanism. In this paper, we propose techniques where the TTL values are set in an adaptive manner, on a per-query basis. Our results show that the proposed techniques reduce the fraction of stale results served by the cache and also decrease the fraction of redundant query evaluations on the search engine backend compared to a strategy using a fixed TTL value for all queries.

Keywords

Search engines result cache freshness time-to-live 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Alici, S., Altingovde, I.S., Ozcan, R., Cambazoglu, B.B., Ulusoy, O.: Timestamp-based result cache invalidation for web search engines. In: Proc. 34th Int’l ACM SIGIR Conf. Research and Development in Information Retrieval, pp. 973–982 (2011)Google Scholar
  2. 2.
    Baeza-Yates, R., Gionis, A., Junqueira, F., Murdock, V., Plachouras, V., Silvestri, F.: The impact of caching on search engines. In: Proc. 30th Int’l ACM SIGIR Conf. Research and Development in Information Retrieval, pp. 183–190 (2007)Google Scholar
  3. 3.
    Baeza-Yate, R., Junqueira, F.P., Plachouras, V., Witschel, H.F.: Admission Policies for Caches of Search Engine Results. In: Ziviani, N., Baeza-Yates, R. (eds.) SPIRE 2007. LNCS, vol. 4726, pp. 74–85. Springer, Heidelberg (2007)CrossRefGoogle Scholar
  4. 4.
    Blanco, R., Bortnikov, E., Junqueira, F., Lempel, R., Telloli, L., Zaragoza, H.: Caching search engine results over incremental indices. In: Proc. 33rd Int’l ACM SIGIR Conf. Research and Development in Information Retrieval, pp. 82–89 (2010)Google Scholar
  5. 5.
    Bortnikov, E., Lempel, R., Vornovitsky, K.: Caching for Realtime Search. In: Clough, P., Foley, C., Gurrin, C., Jones, G.J.F., Kraaij, W., Lee, H., Mudoch, V. (eds.) ECIR 2011. LNCS, vol. 6611, pp. 104–116. Springer, Heidelberg (2011)CrossRefGoogle Scholar
  6. 6.
    Broder, A.: A taxonomy of web search. SIGIR Forum 36(2), 3–10 (2002)CrossRefGoogle Scholar
  7. 7.
    Cambazoglu, B.B., Junqueira, F.P., Plachouras, V., Banachowski, S., Cui, B., Lim, S., Bridge, B.: A refreshing perspective of search engine caching. In: Proc. 19th Int’l Conf. World Wide Web, pp. 181–190 (2010)Google Scholar
  8. 8.
    Fetterly, D., Manasse, M., Najork, M., Wiener, J.: A large-scale study of the evolution of web pages. In: Proc. 12th Int’l Conf. World Wide Web, pp. 669–678 (2003)Google Scholar
  9. 9.
    Gan, Q., Suel, T.: Improved techniques for result caching in web search engines. In: Proc. 18th Int’l Conf. World Wide Web, pp. 431–440 (2009)Google Scholar
  10. 10.
    Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., Witten, I.H.: The WEKA data mining software: an update. SIGKDD Explorations 11(1), 10–18 (2009)CrossRefGoogle Scholar
  11. 11.
    Kim, J., Carvalho, V.R.: An Analysis of Time-Instability in Web Search Results. In: Clough, P., Foley, C., Gurrin, C., Jones, G.J.F., Kraaij, W., Lee, H., Mudoch, V. (eds.) ECIR 2011. LNCS, vol. 6611, pp. 466–478. Springer, Heidelberg (2011)CrossRefGoogle Scholar
  12. 12.
    Lempel, R., Moran, S.: Predictive caching and prefetching of query results in search engines. In: Proc. 12th Int’l Conf. World Wide Web, pp. 19–28 (2003)Google Scholar
  13. 13.
    Liu, Y., Zhang, M., Ru, L., Ma, S.: Automatic Query Type Identification Based on Click Through Information. In: Ng, H.T., Leong, M.-K., Kan, M.-Y., Ji, D. (eds.) AIRS 2006. LNCS, vol. 4182, pp. 593–600. Springer, Heidelberg (2006)CrossRefGoogle Scholar
  14. 14.
    Ozcan, R., Altingovde, I.S., Ulusoy, O.: Exploiting navigational queries for result presentation and caching in web search engines. J. Am. Soc. Inf. Sci. Technol. 62(4), 714–726 (2011)CrossRefGoogle Scholar
  15. 15.
    Pass, G., Chowdhury, A., Torgeson, C.: A picture of search. In: Proc. 1st Int’l Conf. Scalable Information Systems (2006)Google Scholar
  16. 16.
    Skobeltsyn, G., Junqueira, F., Plachouras, V., Baeza-Yates, R.: ResIn: a combination of results caching and index pruning for high-performance web search engines. In: Proc. 31st Int’l ACM SIGIR Conf. Research and Development in Information Retrieval, pp. 131–138 (2008)Google Scholar
  17. 17.
    Srinivasan, R., Liang, C., Ramamritham, K.: Maintaining temporal coherency of virtual data warehouses. In: Proc. IEEE Real-Time Systems Symposium, pp. 60–70 (1998)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2012

Authors and Affiliations

  • Sadiye Alici
    • 1
  • Ismail Sengor Altingovde
    • 2
  • Rifat Ozcan
    • 1
  • B. Barla Cambazoglu
    • 3
  • Özgür Ulusoy
    • 1
  1. 1.Computer Engineering DepartmentBilkent UniversityAnkaraTurkey
  2. 2.L3S Research CenterHanoverGermany
  3. 3.Yahoo! ResearchBarcelonaSpain

Personalised recommendations