Advertisement

A New Static Web Caching Mechanism Based on Mutual Dependency Between Result Cache and Posting List Cache

  • Thanh Trinh
  • Dingming WuEmail author
  • Joshua Zhexue Huang
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 10570)

Abstract

Caching is an important optimization technique in search engine architectures. There exist various types of caches, such as result cache, posting list cache, intersection cache, snippet cache, and document cache. However, these caching techniques are studied separately. Although several multiple level caches that integrate different types of caches have been proposed, the relationships among different caches are ignored. In this paper, we study the mutual dependency between the result cache and the posting list cache via empirical experiments and observe duplicate hits in the two types of caches. In order to better utilize the cache space and increase the hit ratio, three algorithms are proposed to implement a static cache mechanism based on the mutual dependency between the result cache and the posting list cache. A series of experiments were conducted on a real data set and the results have demonstrated the improvement of the hit ratio of our proposals.

References

  1. 1.
    Altingovde, I.S., Ozcan, R., Ulusoy, Ö.: A cost-aware strategy for query result caching in web search engines. In: Boughanem, M., Berrut, C., Mothe, J., Soule-Dupuy, C. (eds.) ECIR 2009. LNCS, vol. 5478, pp. 628–636. Springer, Heidelberg (2009). doi: 10.1007/978-3-642-00958-7_59CrossRefGoogle Scholar
  2. 2.
    Baeza-Yates, R.A., Gionis, A., Junqueira, F., Murdock, V., Plachouras, V., Silvestri, F.: The impact of caching on search engines. In: SIGIR, pp. 183–190 (2007)Google Scholar
  3. 3.
    Baeza-Yates, R.A., Gionis, A., Junqueira, F., Murdock, V., Plachouras, V., Silvestri, F.: Design trade-offs for search engine caching. TWEB 2(4), 20:1–20:28 (2008)CrossRefGoogle Scholar
  4. 4.
    Baeza-Yates, R., Jonassen, S.: Modeling static caching in web search engines. In: Baeza-Yates, R., Vries, A.P., Zaragoza, H., Cambazoglu, B.B., Murdock, V., Lempel, R., Silvestri, F. (eds.) ECIR 2012. LNCS, vol. 7224, pp. 436–446. Springer, Heidelberg (2012). doi: 10.1007/978-3-642-28997-2_37CrossRefGoogle Scholar
  5. 5.
    Ceccarelli, D., Lucchese, C., Orlando, S., Perego, R., Silvestri, F.: Caching query-biased snippets for efficient retrieval. In: EDBT, pp. 93–104 (2011)Google Scholar
  6. 6.
    Fagni, T., Perego, R., Silvestri, F., Orlando, S.: Boosting the performance of web search engines: caching and prefetching query results by exploiting historical usage data. ACM Trans. Inf. Syst. 24(1), 51–78 (2006)CrossRefGoogle Scholar
  7. 7.
    Gan, Q., Suel, T.: Improved techniques for result caching in web search engines. In: WWW, pp. 431–440 (2009)Google Scholar
  8. 8.
    Long, X., Suel, T.: Three-level caching for efficient query processing in large web search engines. World Wide Web 9(4), 369–395 (2006)CrossRefGoogle Scholar
  9. 9.
    Marín, M., Costa, V.G., Gómez-Pantoja, C.: New caching techniques for web search engines. In: HPDC, pp. 215–226 (2010)Google Scholar
  10. 10.
    Markatos, E.P.: On caching search engine query results. Comput. Commun. 24(2), 137–143 (2001)CrossRefGoogle Scholar
  11. 11.
    Ozcan, R., Altingövde, I.S., Cambazoglu, B.B., Junqueira, F.P., Ulusoy, Ö.: A five-level static cache architecture for web search engines. Inf. Process. Manage. 48(5), 828–840 (2012)CrossRefGoogle Scholar
  12. 12.
    Ozcan, R., Altingövde, I.S., Cambazoglu, B.B., Ulusoy, Ö.: Second chance: a hybrid approach for dynamic result caching and prefetching in search engines. TWEB 8(1), 3:1–3:22 (2013)CrossRefGoogle Scholar
  13. 13.
    Ozcan, R., Altingövde, I.S., Ulusoy, Ö.: Static query result caching revisited. In: WWW, pp. 1169–1170 (2008)Google Scholar
  14. 14.
    Ozcan, R., Altingövde, I.S., Ulusoy, Ö.: Cost-aware strategies for query result caching in web search engines. TWEB 5(2), 9:1–9:25 (2011)CrossRefGoogle Scholar
  15. 15.
    Podlipnig, S., Böszörményi, L.: A survey of web cache replacement strategies. ACM Comput. Surv. 35(4), 374–398 (2003)CrossRefGoogle Scholar
  16. 16.
    Saraiva, P.C., de Moura, E.S., Fonseca, R.C., Meira, W.Jr., Ribeiro-Neto, B.A., Ziviani, N.: Rank-preserving two-level caching for scalable search engines. In: SIGIR, pp. 51–58 (2001)Google Scholar
  17. 17.
    Tolosa, G., Becchetti, L., Feuerstein, E., Marchetti-Spaccamela, A.: Performance improvements for search systems using an integrated cache of Lists+Intersections. In: Moura, E., Crochemore, M. (eds.) SPIRE 2014. LNCS, vol. 8799, pp. 227–235. Springer, Cham (2014). doi: 10.1007/978-3-319-11918-2_22CrossRefGoogle Scholar
  18. 18.
    Zhang, J., Long, X., Suel, T.: Performance of compressed inverted list caching in search engines. In: WWW, pp. 387–396 (2008)Google Scholar
  19. 19.
    Zhou, W., Li, R., Dong, X., Xu, Z., Xiao, W.: An intersection cache based on frequent itemset mining in large scale search engines. In: IEEE Workshop on Hot Topics in Web Systems and Technologies, pp. 19–24 (2015)Google Scholar

Copyright information

© Springer International Publishing AG 2017

Authors and Affiliations

  • Thanh Trinh
    • 1
  • Dingming Wu
    • 1
    Email author
  • Joshua Zhexue Huang
    • 1
  1. 1.College of Computer Science and Software EngineeringShenzhen UniversityShenzhenChina

Personalised recommendations