Advertisement

The VLDB Journal

, Volume 17, Issue 3, pp 489–513 | Cite as

Form-based proxy caching for database-backed web sites: keywords and functions

  • Qiong Luo
  • Jeffrey F. Naughton
  • Wenwei Xue
Regular Paper

Abstract

Web caching proxy servers are essential for improving web performance and scalability, and recent research has focused on making proxy caching work for database-backed web sites. In this paper, we explore a new proxy caching framework that exploits the query semantics of HTML forms. We identify two common classes of form-based queries from real-world database-backed web sites, namely, keyword-based queries and function-embedded queries. Using typical examples of these queries, we study two representative caching schemes within our framework: (i) traditional passive query caching, and (ii) active query caching, in which the proxy cache can service a request by evaluating a query over the contents of the cache. Results from our experimental implementation show that our form-based proxy is a general and flexible approach that efficiently enables active caching schemes for database-backed web sites. Furthermore, handling query containment at the proxy yields significant performance advantages over passive query caching, but extending the power of the active cache to do full semantic caching appears to be less generally effective.

Keywords

Web proxy caching Database-backed web sites 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Adali, S., Selçuk Candan, K., Papakonstantinou, Y., Subrahmanian, V.S.: Query caching and optimization in distributed mediator systems. In: Proceedings of the 1996 ACM SIGMOD International Conference on Management of Data, Montreal, 4–6 June 1996, pp. 137–148Google Scholar
  2. 2.
    Altinel, M., Franklin, M.J.: Efficient filtering of XML documents for selective dissemination of information. In: Proceedings of the 26th International Conference on Very Large Data Bases (VLDB), Cairo, 10–14 September 2000, pp. 53–64Google Scholar
  3. 3.
    Amiri, K., Park, S., Tewari, R., Padmanabhan, S.: Scalable template-based query containment checking for web semantic caches. In: Proceedings of the 19th International Conference on Data Engineering (ICDE), Bangalore 5–8 March 2003, pp. 493–504Google Scholar
  4. 4.
    Apache Tomcat Servlet Engine. http://jakarta.apache.org/ tomcat/index.htmlGoogle Scholar
  5. 5.
    Atzeni, P., Mecca, G., Merialdo, P.: To weave the web. In: Proceedings of the 23rd International Conference on Very Large Data Bases (VLDB), Athens, 25–29 August 1997, pp. 206–215Google Scholar
  6. 6.
    Selçuk Candan, K., Li, W.-S., Luo, Q., Hsiung, W.-P., Agrawal, D.: Enabling dynamic content caching for database-driven web sites. In: Proceedings of the 2001 ACM SIGMOD International Conference on Management of Data, santa Barbara, 21–24 May 2001, pp. 532–543Google Scholar
  7. 7.
    Cao, P., Zhang, J., Beach, K.: Active cache: Caching dynamic contents on the web. In: Proceedings of IFIP International Conference on Distributed Systems Platforms and Open Distributed Processing (Middleware ′98),The Lake District, 15–18 September 1998, pp. 373–388Google Scholar
  8. 8.
    Challenger, J., Iyengar, A., Dantzig, P.: A scalable system for consistently caching dynamic web data. In: Proceedings of IEEE INFOCOM ′99, The Conference on Computer Communications, 18th Annual Joint Conference of the IEEE Computer and Communications Societies, New York 21–25 March 1999, vol. 1, pp 294–303Google Scholar
  9. 9.
    Chidlovskii B., Roncancio C., Schneider M.-L. (1999) Semantic cache mechanism for heterogeneous web querying. Comput. Netw. 31(11–16): 1347–1360CrossRefGoogle Scholar
  10. 10.
    Dar, S., Franklin, M.J., Þór Jónsson, B., Srivastava, D., Tan, M.: Semantic data caching and replacement. In: Proceedings of 22nd International Conference on Very Large Data Bases (VLDB), Mumbai (Bombay), 3–6 September 1996, pp. 330–341Google Scholar
  11. 11.
    Datta, A., Dutta, K., Thomas, H., Vandermeer, D., Suresha, Ramamritham, K.: Proxy-based acceleration of dynamically generated content on the world wide web: an approach and implementation. In: Proceedings of the 2002 ACM SIGMOD International Conference on Management of Data, Madison, 3–6 June 2002, pp. 97–108Google Scholar
  12. 12.
    Deshpande, P., Ramasamy, K., Shukla, A., Naughton, J.F.: Caching multidimensional queries using chunks. In: Proceedings of the 1998 ACM SIGMOD International Conference on Management of Data, Seattle, 2–4 June 1998, pp. 259–270Google Scholar
  13. 13.
    Excite Search Engine. http://www.excite.comGoogle Scholar
  14. 14.
    Fernandez, M.F., Florescu, D., Kang, J., Levy, A.Y., Suciu, D.: Catching the boat with strudel: experiences with a web-site management system. In: Proceedings of the 1998 ACM SIGMOD International Conference on Management of Data, Seattle, 2–4 June 1998, pp. 414–425Google Scholar
  15. 15.
    Goldstein, J., Larson, P.-Å.: Optimizing queries using materialized views: a practical, scalable solution. In: Proceedings of the 2001 ACM SIGMOD International Conference on Management of Data, Santa Barbara, 21–24 May 2001, pp. 331–342Google Scholar
  16. 16.
    Guttman, A.: R-trees: a dynamic index structure for spatial searching. In: Proceedings of the 1984 ACM SIGMOD International Conference on Management of Data, Boston, 18–21 June 1984, pp 47–57Google Scholar
  17. 17.
    Halevy A.Y. (2000) Theory of answering queries using views. SIGMOD Record 29(4): 40–47CrossRefGoogle Scholar
  18. 18.
    Hellerstein, J.M., Naughton, J.F.: Query execution techniques for caching expensive methods. In: Proceedings of the 1996 ACM SIGMOD International Conference on Management of Data, Montreal, 4–6 June 1996, pp. 423–434 (1996)Google Scholar
  19. 19.
    Keller A.M., Basu J. (1996) A predicate-based caching scheme for client-server database architectures. In: VLDB J. 5(1): 35–47CrossRefGoogle Scholar
  20. 20.
    Labrinidis, A., Roussopoulos, N.: WebView materialization. In: Proceedings of the 2000 ACM SIGMOD International Conference on Management of Data, Dallas, 16–18 May 2000, pp. 367–378Google Scholar
  21. 21.
    Larson, P.-Å., Yang, H.Z.: Computing queries from derived relations. In: Proceedings of the 11th International Conference on Very Large Data Bases (VLDB), Stockholm, 21–23 August 1985, pp. 259–269Google Scholar
  22. 22.
    Lee, D., Chu, W.W.: Semantic caching via query matching for web sources. In: Proceedings of the 8th International Conference on Information and Knowledge Management (CIKM), Kansas City, 2–6 November 1999, pp. 77–85Google Scholar
  23. 23.
    Levy, A.Y., Mendelzon, A.O., Sagiv, Y., Srivastava, D.: Answering queries using views. In: Proceedings of the 14th ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems (PODS), San Jose, 22–25 May 1995, pp. 95–104Google Scholar
  24. 24.
    Loukopoulos, T., Kalnis, P., Ahmad, I., Papadias, D.: Active caching of on-line-analytical-processing queries in WWW proxies. In: Proceedings of the 2001 International Conference on Parallel Processing (ICPP), Valencia, 3–7 September 2001, pp. 419–426Google Scholar
  25. 25.
    Luo, Q., Krishnamurthy, S., Mohan, C., Pirahesh, H., Woo, H., Lindsay, B.G., Naughton, J.F.: Middle-tier database caching for e-business. In: Proceedings of the 2002 ACM SIGMOD International Conference on Management of Data, Madison, 3–6 June 2002, pp. 600–611Google Scholar
  26. 26.
    Luo, Q., Naughton, J.F., Krishnamurthy, R., Cao, P., Li, Y.: Active query caching for database web servers. In: Proceedings of the 3rd International Workshop on Web and Databases (WebDB), Dallas, 18–19 May 2000, pp. 92–104Google Scholar
  27. 27.
    Luo, Q., Naughton, J.F.: Form-based proxy caching for database-backed web sites. In: Proceedings of the 27th International Conference on Very Large Data Bases (VLDB), Roma, 11–14 September 2001, pp. 191–200Google Scholar
  28. 28.
    Luo, Q., Xue, W.: Template-based proxy caching for table-valued functions. In: Proceedings of 9th International Conference on Database Systems for Advances Applications (DASFAA), Jeju Island, 17–19 March 2004, pp. 339–351Google Scholar
  29. 29.
    Markatos, E.P.: On caching search engine query results. In: Proceedings of the 5th International Web Caching and Content Delivery Workshop, Lisbon, 22–24 May 2000 (2000)Google Scholar
  30. 30.
    MAST SDSS Query Interface. http://archive.stsci.edu/cgi-bin/ sdss/catalogGoogle Scholar
  31. 31.
    Multidimensional Geometry from MathWorld. http:// mathworld.wolfram.com/topics/Multidimensional Geometry.htmlGoogle Scholar
  32. 32.
    Oracle Application Server. http://www.oracle.com/appserver/Google Scholar
  33. 33.
    Pottinger, R., Halevy, A.: A scalable algorithm for answering queries using views. In: Proceedings of the 26th International Conference on Very Large Data Bases (VLDB), Cairo, 10–14 September 2000, pp. 484–495Google Scholar
  34. 34.
    Rajaraman, A., Sagiv, Y., Ullman, J.D.: Answering queries using templates with binding patterns. In: Proceedings of the 14th ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems (PODS), San Jose, 22–25 May 1995, pp. 105–112Google Scholar
  35. 35.
    Rodriguez-Martinez, M., Roussopoulos, N.: MOCHA: a self-extensible database middleware system for distributed data sources. In: Proceedings of the 2000 ACM SIGMOD International Conference on Management of Data, Dallas, 16–18 May 2000, pp. 213–224Google Scholar
  36. 36.
    Sellis T.K. (1988) Intelligent caching and indexing techniques for relational database systems. Informat. Syst. 13(2): 175–185CrossRefGoogle Scholar
  37. 37.
    Silverstein, C., Henzinger, M., Marais, H., Moicz, M.: Analysis of a very large AltaVista query log. In: Compaq SRC Technical Note 1998–014, October 1998Google Scholar
  38. 38.
    SkyServer. http://skyserver.sdss.org/Google Scholar
  39. 39.
    Smith, B., Acharya, A., Yang, T., Zhu, H.: Exploiting result equivalence in caching dynamic web content. In: Proceedings of the 2nd USENIX Symposium on Internet Technologies and Systems, Boulder, 11–14 October 1999, pp. 209–220Google Scholar
  40. 40.
    Srivastava, D., Dar, S., Jagadish, H.V., Levy, A.Y.: Answering queries with aggregations using views. In: Proceedings of 22th International Conference on Very Large Data Bases (VLDB), Mumbai (Bombay), 3–6 September 1996, pp. 318–329Google Scholar
  41. 41.
    Szalay, A.S., Gray, J., Thakar, A.R., Kunszt, P.Z., Malik, T., Raddick, J., Stoughton, C., vandenBerg, J.: The SDSS skyserver – public access to the sloan digital sky server data. In: Proceedings of the 2002 ACM SIGMOD International Conference on Management of Data, Madison, 3–6 June 2002, pp. 570–581Google Scholar
  42. 42.
    Transaction Processing Performance Council (TPC). TPC BenchmarkTH W (Web Commerce) Specification Version 1.1., 27 June 2000Google Scholar
  43. 43.
    Yagoub, K., Florescu, D., Issarny, V., Valduriez, P.: Building and customizing data-intensive web sites using weave. In: Proceedings of the 26th International Conference on Very Large Data Bases (VLDB), Cairo 10–14 September 2000, pp. 607–610Google Scholar

Copyright information

© Springer-Verlag 2006

Authors and Affiliations

  1. 1.Department of Computer ScienceHong Kong University of Science and TechnologyKowloon, Hong KongChina
  2. 2.Computer Sciences DepartmentUniversity of Wisconsin-MadisonMadisonUSA

Personalised recommendations