Skip to main content
Log in

Form-based proxy caching for database-backed web sites: keywords and functions

  • Regular Paper
  • Published:
The VLDB Journal Aims and scope Submit manuscript

Abstract

Web caching proxy servers are essential for improving web performance and scalability, and recent research has focused on making proxy caching work for database-backed web sites. In this paper, we explore a new proxy caching framework that exploits the query semantics of HTML forms. We identify two common classes of form-based queries from real-world database-backed web sites, namely, keyword-based queries and function-embedded queries. Using typical examples of these queries, we study two representative caching schemes within our framework: (i) traditional passive query caching, and (ii) active query caching, in which the proxy cache can service a request by evaluating a query over the contents of the cache. Results from our experimental implementation show that our form-based proxy is a general and flexible approach that efficiently enables active caching schemes for database-backed web sites. Furthermore, handling query containment at the proxy yields significant performance advantages over passive query caching, but extending the power of the active cache to do full semantic caching appears to be less generally effective.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Adali, S., Selçuk Candan, K., Papakonstantinou, Y., Subrahmanian, V.S.: Query caching and optimization in distributed mediator systems. In: Proceedings of the 1996 ACM SIGMOD International Conference on Management of Data, Montreal, 4–6 June 1996, pp. 137–148

  2. Altinel, M., Franklin, M.J.: Efficient filtering of XML documents for selective dissemination of information. In: Proceedings of the 26th International Conference on Very Large Data Bases (VLDB), Cairo, 10–14 September 2000, pp. 53–64

  3. Amiri, K., Park, S., Tewari, R., Padmanabhan, S.: Scalable template-based query containment checking for web semantic caches. In: Proceedings of the 19th International Conference on Data Engineering (ICDE), Bangalore 5–8 March 2003, pp. 493–504

  4. Apache Tomcat Servlet Engine. http://jakarta.apache.org/ tomcat/index.html

  5. Atzeni, P., Mecca, G., Merialdo, P.: To weave the web. In: Proceedings of the 23rd International Conference on Very Large Data Bases (VLDB), Athens, 25–29 August 1997, pp. 206–215

  6. Selçuk Candan, K., Li, W.-S., Luo, Q., Hsiung, W.-P., Agrawal, D.: Enabling dynamic content caching for database-driven web sites. In: Proceedings of the 2001 ACM SIGMOD International Conference on Management of Data, santa Barbara, 21–24 May 2001, pp. 532–543

  7. Cao, P., Zhang, J., Beach, K.: Active cache: Caching dynamic contents on the web. In: Proceedings of IFIP International Conference on Distributed Systems Platforms and Open Distributed Processing (Middleware ′98),The Lake District, 15–18 September 1998, pp. 373–388

  8. Challenger, J., Iyengar, A., Dantzig, P.: A scalable system for consistently caching dynamic web data. In: Proceedings of IEEE INFOCOM ′99, The Conference on Computer Communications, 18th Annual Joint Conference of the IEEE Computer and Communications Societies, New York 21–25 March 1999, vol. 1, pp 294–303

  9. Chidlovskii B., Roncancio C., Schneider M.-L. (1999) Semantic cache mechanism for heterogeneous web querying. Comput. Netw. 31(11–16): 1347–1360

    Article  Google Scholar 

  10. Dar, S., Franklin, M.J., Þór Jónsson, B., Srivastava, D., Tan, M.: Semantic data caching and replacement. In: Proceedings of 22nd International Conference on Very Large Data Bases (VLDB), Mumbai (Bombay), 3–6 September 1996, pp. 330–341

  11. Datta, A., Dutta, K., Thomas, H., Vandermeer, D., Suresha, Ramamritham, K.: Proxy-based acceleration of dynamically generated content on the world wide web: an approach and implementation. In: Proceedings of the 2002 ACM SIGMOD International Conference on Management of Data, Madison, 3–6 June 2002, pp. 97–108

  12. Deshpande, P., Ramasamy, K., Shukla, A., Naughton, J.F.: Caching multidimensional queries using chunks. In: Proceedings of the 1998 ACM SIGMOD International Conference on Management of Data, Seattle, 2–4 June 1998, pp. 259–270

  13. Excite Search Engine. http://www.excite.com

  14. Fernandez, M.F., Florescu, D., Kang, J., Levy, A.Y., Suciu, D.: Catching the boat with strudel: experiences with a web-site management system. In: Proceedings of the 1998 ACM SIGMOD International Conference on Management of Data, Seattle, 2–4 June 1998, pp. 414–425

  15. Goldstein, J., Larson, P.-Å.: Optimizing queries using materialized views: a practical, scalable solution. In: Proceedings of the 2001 ACM SIGMOD International Conference on Management of Data, Santa Barbara, 21–24 May 2001, pp. 331–342

  16. Guttman, A.: R-trees: a dynamic index structure for spatial searching. In: Proceedings of the 1984 ACM SIGMOD International Conference on Management of Data, Boston, 18–21 June 1984, pp 47–57

  17. Halevy A.Y. (2000) Theory of answering queries using views. SIGMOD Record 29(4): 40–47

    Article  Google Scholar 

  18. Hellerstein, J.M., Naughton, J.F.: Query execution techniques for caching expensive methods. In: Proceedings of the 1996 ACM SIGMOD International Conference on Management of Data, Montreal, 4–6 June 1996, pp. 423–434 (1996)

  19. Keller A.M., Basu J. (1996) A predicate-based caching scheme for client-server database architectures. In: VLDB J. 5(1): 35–47

    Article  Google Scholar 

  20. Labrinidis, A., Roussopoulos, N.: WebView materialization. In: Proceedings of the 2000 ACM SIGMOD International Conference on Management of Data, Dallas, 16–18 May 2000, pp. 367–378

  21. Larson, P.-Å., Yang, H.Z.: Computing queries from derived relations. In: Proceedings of the 11th International Conference on Very Large Data Bases (VLDB), Stockholm, 21–23 August 1985, pp. 259–269

  22. Lee, D., Chu, W.W.: Semantic caching via query matching for web sources. In: Proceedings of the 8th International Conference on Information and Knowledge Management (CIKM), Kansas City, 2–6 November 1999, pp. 77–85

  23. Levy, A.Y., Mendelzon, A.O., Sagiv, Y., Srivastava, D.: Answering queries using views. In: Proceedings of the 14th ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems (PODS), San Jose, 22–25 May 1995, pp. 95–104

  24. Loukopoulos, T., Kalnis, P., Ahmad, I., Papadias, D.: Active caching of on-line-analytical-processing queries in WWW proxies. In: Proceedings of the 2001 International Conference on Parallel Processing (ICPP), Valencia, 3–7 September 2001, pp. 419–426

  25. Luo, Q., Krishnamurthy, S., Mohan, C., Pirahesh, H., Woo, H., Lindsay, B.G., Naughton, J.F.: Middle-tier database caching for e-business. In: Proceedings of the 2002 ACM SIGMOD International Conference on Management of Data, Madison, 3–6 June 2002, pp. 600–611

  26. Luo, Q., Naughton, J.F., Krishnamurthy, R., Cao, P., Li, Y.: Active query caching for database web servers. In: Proceedings of the 3rd International Workshop on Web and Databases (WebDB), Dallas, 18–19 May 2000, pp. 92–104

  27. Luo, Q., Naughton, J.F.: Form-based proxy caching for database-backed web sites. In: Proceedings of the 27th International Conference on Very Large Data Bases (VLDB), Roma, 11–14 September 2001, pp. 191–200

  28. Luo, Q., Xue, W.: Template-based proxy caching for table-valued functions. In: Proceedings of 9th International Conference on Database Systems for Advances Applications (DASFAA), Jeju Island, 17–19 March 2004, pp. 339–351

  29. Markatos, E.P.: On caching search engine query results. In: Proceedings of the 5th International Web Caching and Content Delivery Workshop, Lisbon, 22–24 May 2000 (2000)

  30. MAST SDSS Query Interface. http://archive.stsci.edu/cgi-bin/ sdss/catalog

  31. Multidimensional Geometry from MathWorld. http:// mathworld.wolfram.com/topics/Multidimensional Geometry.html

  32. Oracle Application Server. http://www.oracle.com/appserver/

  33. Pottinger, R., Halevy, A.: A scalable algorithm for answering queries using views. In: Proceedings of the 26th International Conference on Very Large Data Bases (VLDB), Cairo, 10–14 September 2000, pp. 484–495

  34. Rajaraman, A., Sagiv, Y., Ullman, J.D.: Answering queries using templates with binding patterns. In: Proceedings of the 14th ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems (PODS), San Jose, 22–25 May 1995, pp. 105–112

  35. Rodriguez-Martinez, M., Roussopoulos, N.: MOCHA: a self-extensible database middleware system for distributed data sources. In: Proceedings of the 2000 ACM SIGMOD International Conference on Management of Data, Dallas, 16–18 May 2000, pp. 213–224

  36. Sellis T.K. (1988) Intelligent caching and indexing techniques for relational database systems. Informat. Syst. 13(2): 175–185

    Article  Google Scholar 

  37. Silverstein, C., Henzinger, M., Marais, H., Moicz, M.: Analysis of a very large AltaVista query log. In: Compaq SRC Technical Note 1998–014, October 1998

  38. SkyServer. http://skyserver.sdss.org/

  39. Smith, B., Acharya, A., Yang, T., Zhu, H.: Exploiting result equivalence in caching dynamic web content. In: Proceedings of the 2nd USENIX Symposium on Internet Technologies and Systems, Boulder, 11–14 October 1999, pp. 209–220

  40. Srivastava, D., Dar, S., Jagadish, H.V., Levy, A.Y.: Answering queries with aggregations using views. In: Proceedings of 22th International Conference on Very Large Data Bases (VLDB), Mumbai (Bombay), 3–6 September 1996, pp. 318–329

  41. Szalay, A.S., Gray, J., Thakar, A.R., Kunszt, P.Z., Malik, T., Raddick, J., Stoughton, C., vandenBerg, J.: The SDSS skyserver – public access to the sloan digital sky server data. In: Proceedings of the 2002 ACM SIGMOD International Conference on Management of Data, Madison, 3–6 June 2002, pp. 570–581

  42. Transaction Processing Performance Council (TPC). TPC BenchmarkTH W (Web Commerce) Specification Version 1.1., 27 June 2000

  43. Yagoub, K., Florescu, D., Issarny, V., Valduriez, P.: Building and customizing data-intensive web sites using weave. In: Proceedings of the 26th International Conference on Very Large Data Bases (VLDB), Cairo 10–14 September 2000, pp. 607–610

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Qiong Luo.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Luo, Q., Naughton, J.F. & Xue, W. Form-based proxy caching for database-backed web sites: keywords and functions. The VLDB Journal 17, 489–513 (2008). https://doi.org/10.1007/s00778-006-0018-x

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00778-006-0018-x

Keywords

Navigation