Advertisement

Evaluation, Analysis and Adaptation of Web Prefetching Techniques in Current Web

  • Josep Domènech
  • Ana Pont-Sanjuán
  • Julio Sahuquillo
  • José A. Gil
Chapter
Part of the Advanced Information and Knowledge Processing book series (AI&KP)

Abstract

The basics of web prefetching are to preprocess user requests before they are actually demanded. Therefore, the time that the user must wait for the requested documents can be reduced by hiding the request latencies. Prefetching is usually transparent to the user, that is, there is no interaction between the prefetching system and the user. For this reason, systems speculate on the following user’s requests and thus the prediction can fail. In such a case, web prefetching increases the resources requirements, so it should be applied carefully.

This chapter is aimed at describing a methodology in order to evaluate, analyze and improve the performance of web prefetching algorithms. Moreover, we show how this methodology can be used for improving the existing algorithms by considering current workload characteristics.

To do so, in a first step a solid framework and methodology to evaluate web prefetching techniques from the user’s point of view are presented. In a second step, we analyze how prefetching algorithms can be improved from the user’s point of view. We take benefit of the characteristics of current web in order to design a new algorithm that outperforms those existing in the open literature. Finally, we also explore the performance limits of web prefetching to know the potential benefits of this technique depending on the architecture in which it is implemented.

Keywords

Processor Time Prediction Algorithm Dependency Graph Proxy Server Latency Reduction 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

  1. 1.
    Google web accelerator. http://webaccelerator.google.com/
  2. 2.
    Albrecht, D.W., Zukerman, I., Nicholson, A.E.: Pre-sending documents on the WWW: A comparative study. In: Proceedings of the Sixteenth International Joint Conference on Artificial Intelligence. Stockholm, Sweden (1999)Google Scholar
  3. 3.
    Bestavros, A.: Using speculation to reduce server load and service time on the WWW. In: Proceedings of the 4th ACM International Conference on Information and Knowledge Management. Baltimore, USA (1995)Google Scholar
  4. 4.
    Bonino, D., Corno, F., Squillero, G.: A real-time evolutionary algorithm for web prediction. In: Proceedings of the IEEE/WIC International Conference on Web Intelligence. Halifax, Canada (2003)Google Scholar
  5. 5.
    Bouras, C., Konidaris, A., Kostoulas, D.: Efficient reduction of web latency through predictive prefetching on a WAN. In: Proceedings of the 4th International Conference on Advances in Web-Age Information Management, pp. 25–36. Chengdu, China (2003)Google Scholar
  6. 6.
    Bouras, C., Konidaris, A., Kostoulas, D.: Predictive prefetching on the web and its potential impact in the wide area. World Wide Web 7(2), 143–179 (2004)CrossRefGoogle Scholar
  7. 7.
    Cao, P.: Wisconsin web cache simulator. http://www.cs.wisc.edu/cao
  8. 8.
    Chen, X., Zhang, X.: Popularity-based PPM: An effective web prefetching technique for high accuracy and low storage. In: Proceedings of the International Conference on Parallel Processing. Vancouver, Canada (2002)Google Scholar
  9. 9.
    Chen, X., Zhang, X.: A popularity-based prediction model for web prefetching. IEEE Computer 36(3), 63–70 (2003)zbMATHGoogle Scholar
  10. 10.
    Chen, X., Zhang, X.: Coordinated data prefetching for web contents. Computer Communications 28, 1947–1958 (2005)CrossRefGoogle Scholar
  11. 11.
    Cherkasova, L., Ciardo, G.: Characterizing temporal locality and its impact on web server performance. In: Proceedings of the 9th International Conference on Computer Communication and Networks. Las Vegas, USA (2000)Google Scholar
  12. 12.
    Cohen, E., Kaplan, H.: Prefetching the means for document transfer: a new approach for reducing web latency. Computer Networks 39(4), 437–455 (2002)CrossRefMathSciNetGoogle Scholar
  13. 13.
    Cohen, E., Krishnamurthy, B., Rexford, J.: Efficient algorithms for predicting requests to web servers. In: Proceedings of the IEEE INFOCOM ’99 Conference. New York, USA (1999)Google Scholar
  14. 14.
    Cooley, R., Mobasher, B., Srivastava, J.: Data preparation for mining World Wide Web browsing patterns. Knowledge and information systems 1(1), 5–32 (1999)Google Scholar
  15. 15.
    Crdenas, L.G., Sahuquillo, J., Pont, A., Gil, J.A.: The multikey web cache simulator: A platform for designing proxy cache management techniques. In: Proceedings of the 12th Euromicro Conference on Parallel, Distributed and Network based Processing. La Corua, Spain (2004)Google Scholar
  16. 16.
    Crovella, M., Barford, P.: The network effects of prefetching. In: Proceedings of the IEEE INFOCOM’98 Conference. San Francisco, USA (1998)Google Scholar
  17. 17.
    Davison, B.D.: Assertion: Prefetching with GET is not good. In: Proceedings of the 6th International Workshop on Web Caching and Content Distribution. Boston, USA (2001)Google Scholar
  18. 18.
    Davison, B.D.: NCS: Network and cache simulator – an introduction. Tech. rep., Department of Computer Science, Rutgers University (2001)Google Scholar
  19. 19.
    Davison, B.D.: Learning web request patterns. In: Web Dynamics – Adapting to Change in Content, Size, Topology and Use, pp. 435–460. Springer (2004)Google Scholar
  20. 20.
    Domènech, J., Pont, A., Sahuquillo, J., Gil, J.A.: A user-focused evaluation of web prefetching algorithms. Computer Communications 10(30), 2213–2224 (2007)CrossRefGoogle Scholar
  21. 21.
    Domènech, J., Gil, J.A., Sahuquillo, J., Pont, A.: DDG: An efficient prefetching algorithm for current web generation. In: Proceedings of the 1st IEEE Workshop on Hot Topics in Web Systems and Technologies (HotWeb). Boston, USA (2006)Google Scholar
  22. 22.
    Domènech, J., Gil, J.A., Sahuquillo, J., Pont, A.: Web prefetching performance metrics: A survey. Performance Evaluation 63(9–10), 988–1004 (2006)CrossRefGoogle Scholar
  23. 23.
    Domènech, J., Pont, A., Gil, J.A., Sahuquillo, J.: Guidelines for evaluating and adapting web prefetching techniques. In: Proceedings of the XVII Jornadas de Paralelismo. Albacete, Spain (2006)Google Scholar
  24. 24.
    Domènech, J., Pont, A., Sahuquillo, J., Gil, J.A.: A comparative study of web prefetching techniques focusing on user’s perspective. In: Proceedings of the IFIP International Conference on Network and Parallel Computing (NPC 2006). Tokyo, Japan (2006)Google Scholar
  25. 25.
    Domènech, J., Pont, A., Sahuquillo, J., Gil, J.A.: Cost–benefit analysis of web prefetching algorithms from the user’s point of view. In: Proceedings of the 5th International IFIP Networking Conference. Coimbra, Portugal (2006)Google Scholar
  26. 26.
    Domènech, J., Sahuquillo, J., Gil, J.A., Pont, A.: The impact of the web prefetching architecture on the limits of reducing user’s perceived latency. In: Proceedings of the 2006 IEEE / WIC / ACM International Conference on Web Intelligence. Hong Kong, China (2006)Google Scholar
  27. 27.
    Domènech, J., Sahuquillo, J., Pont, A., Gil, J.A.: How current web generation affects prediction algorithms performance. In: Proceedings of the 13th International Conference on Software, Telecommunications and Computer Networks (SoftCOM). Split, Croatia (2005)Google Scholar
  28. 28.
    Dongshan, X., Junyi, S.: A new Markov model for web access prediction. Computing in Science and Engineering 4(6), 34–39 (2002). DOI http://dx.doi.org/10.1109/MCISE.2002.1046594
  29. 29.
    Duchamp, D.: Prefetching hyperlinks. In: Proceedings of the 2nd USENIX Symposium on Internet Technologies and Systems. Boulder, USA (1999)Google Scholar
  30. 30.
    Fan, L., Cao, P., Lin, W., Jacobson, Q.: Web prefetching between low-bandwidth clients and proxies: Potential and performance. In: Proceedings of the ACM SIGMETRICS Conference on Measurement and Modeling Of Computer Systems, pp. 178–187. Atlanta, USA (1999)Google Scholar
  31. 31.
    Fielding, R., Gettys, J., Mogul, J., Frystyk, H., Masinter, L., Leach, P., Berners-Lee, T.: Hypertext transfer protocol – HTTP/1.1 (1999)Google Scholar
  32. 32.
    Fisher, D., Saksena, G.: Link prefetching in Mozilla: A server driven approach. In: Proceedings of the 8th International Workshop on Web Content Caching and Distribution (WCW 2003). New York, USA (2003)Google Scholar
  33. 33.
    Group, N.W.: Internet web replication and caching taxonomy. RFC 3040 (2001)Google Scholar
  34. 34.
    Ibrahim, T.I., Xu, C.Z.: Neural nets based predictive prefetching to tolerate WWW latency. In: Proceedings of the 20th IEEE International Conference on Distributed Computing Systems. Taipei, Taiwan (2000)Google Scholar
  35. 35.
    Jiang, Y., Wu, M.Y., Shu, W.: Web prefetching: Costs, benefits and performance. In: Proceedings of the 7th International Workshop on Web Content Caching and Content Distribution. Boulder, USA (2002)Google Scholar
  36. 36.
    Jiang, Z., Kleinrock, L.: Prefetching links on the WWW. In: Proceedings of the IEEE International Conference on Communications. Montreal, Canada (1997)Google Scholar
  37. 37.
    Jiang, Z., Kleinrock, L.: An adaptive network prefetch scheme. IEEE Journal on Selected Areas in Communications 16(3), 358–368 (1998)CrossRefGoogle Scholar
  38. 38.
    Khan, J.I., Tao, Q.: Partial prefetch for faster surfing in composite hypermedia. In: Proceedings of the 3rd USENIX Symposium on Internet Technologies and Systems. San Francisco, USA (2001)Google Scholar
  39. 39.
    Khan, J.I., Tao, Q.: Exploiting webspace organization for accelerating web prefetching. In: Proceedings of the IEEE/WIC International Conference on Web Intelligence. Halifax, Canada (2003)Google Scholar
  40. 40.
    Kim, Y., Kim, J.: Web prefetching using display-based prediction. In: Proceedings of the IEEE/WIC International Conference on Web Intelligence. Halifax, Canada (2003)Google Scholar
  41. 41.
    Kokku, R., Yalagandula, P., Venkataramani, A., Dahlin, M.: NPS: A non-interfering deployable web prefetching system. In: Proceedings of the USENIX Symposium on Internet Technologies and Systems. Palo Alto, USA (2003)Google Scholar
  42. 42.
    Kroeger, T.M., Long, D.D., Mogul, J.C.: Exploring the bounds of web latency reduction from caching and prefetching. In: Proceedings of the 1st USENIX Symposium on Internet Technologies and Systems. Monterey, USA (1997)Google Scholar
  43. 43.
    Lau, K., Ng, Y.K.: A client-based web prefetching management system based on detection theory. In: Proceedings of the Web Content Caching and Distribution: 9th International Workshop (WCW 2004), pp. 129–143. Beijing, China (2004)Google Scholar
  44. 44.
    Lee, H.K., Vageesan, G., Yum, K.H., Kim, E.J.: A proactive request distribution (prord) using web log mining in a cluster-based web server. In: Proceedings of the International Conference on Parallel Processing (ICPP’06). Columbus, USA (2006)Google Scholar
  45. 45.
    Loon, T.S., Bharghavan, V.: Alleviating the latency reduction and bandwidth problems in WWW browsing. In: Proceedings of the 1st USENIX Symposium on Internet Technologies and Systems. Monterey, USA (1997)Google Scholar
  46. 46.
    Markatos, E., Chronaki, C.: A top-10 approach to prefetching on the web. In: Proceedings of the INET’ 98. Geneva, Switzerland (1998)Google Scholar
  47. 47.
  48. 48.
    Nanopoulos, A., Katsaros, D., Manolopoulos, Y.: Exploiting Web Log Mining for Web Cache Enhancement, vol. 2356, chap. in Lecture Notes in Artificial Intelligence (LNAI), pp. 68–87. Springer-Verlag (2002)Google Scholar
  49. 49.
    Nanopoulos, A., Katsaros, D., Manolopoulos, Y.: A data mining algorithm for generalized web prefetching. IEEE Trans. Knowl. Data Eng. 15(5), 1155–1169 (2003)CrossRefGoogle Scholar
  50. 50.
    Padmanabhan, V.N., Mogul, J.C.: Using predictive prefetching to improve World Wide Web latency. Computer Communication Review 26(3), 22–36 (1996)CrossRefGoogle Scholar
  51. 51.
    Palpanas, T., Mendelzon, A.: Web prefetching using partial match prediction. In: Proceedings of the 4th International Web Caching Workshop. San Diego, USA (1999)Google Scholar
  52. 52.
  53. 53.
    Rabinovich, M., Spatscheck, O.: Web Caching and Replication. Addison-Wesley (2002)Google Scholar
  54. 54.
    Sarukkai, R.: Link prediction and path analysis using Markov chains. Computer Networks 33(1-6), 377–386 (2000)CrossRefGoogle Scholar
  55. 55.
    Schechter, S., Krishnan, M., Smith, M.D.: Using path profiles to predict http requests. In: Proceedings of the 7th International World Wide Web Conference. Brisbane, Australia (1998)Google Scholar
  56. 56.
    Tao, Q.: Impact of webspace organization and user interaction behavior on a prefetching proxy. Ph.D. thesis, Kent State University (2002)Google Scholar
  57. 57.
    Teng, W.G., Chang, C.Y., Chen, M.S.: Integrating web caching and web prefetching in client-side proxies. IEEE Transactions on Parallel and Distributed Systems 16(5), 444–455 (2005)CrossRefGoogle Scholar
  58. 58.
    UCB, LBNL, VINT: Network simulator ns (version 2). http://www.isi.edu/nsnam/ns
  59. 59.
    Vahdat, A., Anderson, T., Dahlin, M., Belani, E., Culler, D., Eastham, P., Yoshikawa, C.: WebOS: Operating System Services for Wide Area Applications. In: Proceedings of the 7th Symposium on High Performance Distributed Computing Systems. Chicago, USA (1998)Google Scholar
  60. 60.
    Venkataramani, A., Yalagandula, P., Kokku, R., Sharif, S., Dahlin, M.: The potential costs and benefits of long-term prefetching for content distribution. Computer Communications 25, 367–375 (2002)CrossRefGoogle Scholar
  61. 61.
    Wu, B., Kshemkalyani, A.D.: Objective-optimal algorithms for long-term web prefetching. IEEE Transactions on Computers 55(1), 2–17 (2006)CrossRefGoogle Scholar
  62. 62.
    Yang, Q., Huang, J.Z., Ng, M.: A data cube model for prediction-based web prefetching. Journal of Intelligent Information Systems 20(1), 11–30 (2003). DOI http://dx.doi.org/10.1023/A:1020990805004 CrossRefGoogle Scholar

Copyright information

© Springer-Verlag London Limited 2010

Authors and Affiliations

  • Josep Domènech
    • 1
  • Ana Pont-Sanjuán
    • 1
  • Julio Sahuquillo
    • 1
  • José A. Gil
    • 1
  1. 1.Universitat Politècnica de ValènciaValènciaSpain

Personalised recommendations