Abstract
Web caching proxy servers are essential for improving web performance and scalability, and recent research has focused on making proxy caching work for database-backed web sites. In this paper, we explore a new proxy caching framework that exploits the query semantics of HTML forms. We identify two common classes of form-based queries from real-world database-backed web sites, namely, keyword-based queries and function-embedded queries. Using typical examples of these queries, we study two representative caching schemes within our framework: (i) traditional passive query caching, and (ii) active query caching, in which the proxy cache can service a request by evaluating a query over the contents of the cache. Results from our experimental implementation show that our form-based proxy is a general and flexible approach that efficiently enables active caching schemes for database-backed web sites. Furthermore, handling query containment at the proxy yields significant performance advantages over passive query caching, but extending the power of the active cache to do full semantic caching appears to be less generally effective.
Similar content being viewed by others
References
Adali, S., Selçuk Candan, K., Papakonstantinou, Y., Subrahmanian, V.S.: Query caching and optimization in distributed mediator systems. In: Proceedings of the 1996 ACM SIGMOD International Conference on Management of Data, Montreal, 4–6 June 1996, pp. 137–148
Altinel, M., Franklin, M.J.: Efficient filtering of XML documents for selective dissemination of information. In: Proceedings of the 26th International Conference on Very Large Data Bases (VLDB), Cairo, 10–14 September 2000, pp. 53–64
Amiri, K., Park, S., Tewari, R., Padmanabhan, S.: Scalable template-based query containment checking for web semantic caches. In: Proceedings of the 19th International Conference on Data Engineering (ICDE), Bangalore 5–8 March 2003, pp. 493–504
Apache Tomcat Servlet Engine. http://jakarta.apache.org/ tomcat/index.html
Atzeni, P., Mecca, G., Merialdo, P.: To weave the web. In: Proceedings of the 23rd International Conference on Very Large Data Bases (VLDB), Athens, 25–29 August 1997, pp. 206–215
Selçuk Candan, K., Li, W.-S., Luo, Q., Hsiung, W.-P., Agrawal, D.: Enabling dynamic content caching for database-driven web sites. In: Proceedings of the 2001 ACM SIGMOD International Conference on Management of Data, santa Barbara, 21–24 May 2001, pp. 532–543
Cao, P., Zhang, J., Beach, K.: Active cache: Caching dynamic contents on the web. In: Proceedings of IFIP International Conference on Distributed Systems Platforms and Open Distributed Processing (Middleware ′98),The Lake District, 15–18 September 1998, pp. 373–388
Challenger, J., Iyengar, A., Dantzig, P.: A scalable system for consistently caching dynamic web data. In: Proceedings of IEEE INFOCOM ′99, The Conference on Computer Communications, 18th Annual Joint Conference of the IEEE Computer and Communications Societies, New York 21–25 March 1999, vol. 1, pp 294–303
Chidlovskii B., Roncancio C., Schneider M.-L. (1999) Semantic cache mechanism for heterogeneous web querying. Comput. Netw. 31(11–16): 1347–1360
Dar, S., Franklin, M.J., Þór Jónsson, B., Srivastava, D., Tan, M.: Semantic data caching and replacement. In: Proceedings of 22nd International Conference on Very Large Data Bases (VLDB), Mumbai (Bombay), 3–6 September 1996, pp. 330–341
Datta, A., Dutta, K., Thomas, H., Vandermeer, D., Suresha, Ramamritham, K.: Proxy-based acceleration of dynamically generated content on the world wide web: an approach and implementation. In: Proceedings of the 2002 ACM SIGMOD International Conference on Management of Data, Madison, 3–6 June 2002, pp. 97–108
Deshpande, P., Ramasamy, K., Shukla, A., Naughton, J.F.: Caching multidimensional queries using chunks. In: Proceedings of the 1998 ACM SIGMOD International Conference on Management of Data, Seattle, 2–4 June 1998, pp. 259–270
Excite Search Engine. http://www.excite.com
Fernandez, M.F., Florescu, D., Kang, J., Levy, A.Y., Suciu, D.: Catching the boat with strudel: experiences with a web-site management system. In: Proceedings of the 1998 ACM SIGMOD International Conference on Management of Data, Seattle, 2–4 June 1998, pp. 414–425
Goldstein, J., Larson, P.-Å.: Optimizing queries using materialized views: a practical, scalable solution. In: Proceedings of the 2001 ACM SIGMOD International Conference on Management of Data, Santa Barbara, 21–24 May 2001, pp. 331–342
Guttman, A.: R-trees: a dynamic index structure for spatial searching. In: Proceedings of the 1984 ACM SIGMOD International Conference on Management of Data, Boston, 18–21 June 1984, pp 47–57
Halevy A.Y. (2000) Theory of answering queries using views. SIGMOD Record 29(4): 40–47
Hellerstein, J.M., Naughton, J.F.: Query execution techniques for caching expensive methods. In: Proceedings of the 1996 ACM SIGMOD International Conference on Management of Data, Montreal, 4–6 June 1996, pp. 423–434 (1996)
Keller A.M., Basu J. (1996) A predicate-based caching scheme for client-server database architectures. In: VLDB J. 5(1): 35–47
Labrinidis, A., Roussopoulos, N.: WebView materialization. In: Proceedings of the 2000 ACM SIGMOD International Conference on Management of Data, Dallas, 16–18 May 2000, pp. 367–378
Larson, P.-Å., Yang, H.Z.: Computing queries from derived relations. In: Proceedings of the 11th International Conference on Very Large Data Bases (VLDB), Stockholm, 21–23 August 1985, pp. 259–269
Lee, D., Chu, W.W.: Semantic caching via query matching for web sources. In: Proceedings of the 8th International Conference on Information and Knowledge Management (CIKM), Kansas City, 2–6 November 1999, pp. 77–85
Levy, A.Y., Mendelzon, A.O., Sagiv, Y., Srivastava, D.: Answering queries using views. In: Proceedings of the 14th ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems (PODS), San Jose, 22–25 May 1995, pp. 95–104
Loukopoulos, T., Kalnis, P., Ahmad, I., Papadias, D.: Active caching of on-line-analytical-processing queries in WWW proxies. In: Proceedings of the 2001 International Conference on Parallel Processing (ICPP), Valencia, 3–7 September 2001, pp. 419–426
Luo, Q., Krishnamurthy, S., Mohan, C., Pirahesh, H., Woo, H., Lindsay, B.G., Naughton, J.F.: Middle-tier database caching for e-business. In: Proceedings of the 2002 ACM SIGMOD International Conference on Management of Data, Madison, 3–6 June 2002, pp. 600–611
Luo, Q., Naughton, J.F., Krishnamurthy, R., Cao, P., Li, Y.: Active query caching for database web servers. In: Proceedings of the 3rd International Workshop on Web and Databases (WebDB), Dallas, 18–19 May 2000, pp. 92–104
Luo, Q., Naughton, J.F.: Form-based proxy caching for database-backed web sites. In: Proceedings of the 27th International Conference on Very Large Data Bases (VLDB), Roma, 11–14 September 2001, pp. 191–200
Luo, Q., Xue, W.: Template-based proxy caching for table-valued functions. In: Proceedings of 9th International Conference on Database Systems for Advances Applications (DASFAA), Jeju Island, 17–19 March 2004, pp. 339–351
Markatos, E.P.: On caching search engine query results. In: Proceedings of the 5th International Web Caching and Content Delivery Workshop, Lisbon, 22–24 May 2000 (2000)
MAST SDSS Query Interface. http://archive.stsci.edu/cgi-bin/ sdss/catalog
Multidimensional Geometry from MathWorld. http:// mathworld.wolfram.com/topics/Multidimensional Geometry.html
Oracle Application Server. http://www.oracle.com/appserver/
Pottinger, R., Halevy, A.: A scalable algorithm for answering queries using views. In: Proceedings of the 26th International Conference on Very Large Data Bases (VLDB), Cairo, 10–14 September 2000, pp. 484–495
Rajaraman, A., Sagiv, Y., Ullman, J.D.: Answering queries using templates with binding patterns. In: Proceedings of the 14th ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems (PODS), San Jose, 22–25 May 1995, pp. 105–112
Rodriguez-Martinez, M., Roussopoulos, N.: MOCHA: a self-extensible database middleware system for distributed data sources. In: Proceedings of the 2000 ACM SIGMOD International Conference on Management of Data, Dallas, 16–18 May 2000, pp. 213–224
Sellis T.K. (1988) Intelligent caching and indexing techniques for relational database systems. Informat. Syst. 13(2): 175–185
Silverstein, C., Henzinger, M., Marais, H., Moicz, M.: Analysis of a very large AltaVista query log. In: Compaq SRC Technical Note 1998–014, October 1998
SkyServer. http://skyserver.sdss.org/
Smith, B., Acharya, A., Yang, T., Zhu, H.: Exploiting result equivalence in caching dynamic web content. In: Proceedings of the 2nd USENIX Symposium on Internet Technologies and Systems, Boulder, 11–14 October 1999, pp. 209–220
Srivastava, D., Dar, S., Jagadish, H.V., Levy, A.Y.: Answering queries with aggregations using views. In: Proceedings of 22th International Conference on Very Large Data Bases (VLDB), Mumbai (Bombay), 3–6 September 1996, pp. 318–329
Szalay, A.S., Gray, J., Thakar, A.R., Kunszt, P.Z., Malik, T., Raddick, J., Stoughton, C., vandenBerg, J.: The SDSS skyserver – public access to the sloan digital sky server data. In: Proceedings of the 2002 ACM SIGMOD International Conference on Management of Data, Madison, 3–6 June 2002, pp. 570–581
Transaction Processing Performance Council (TPC). TPC BenchmarkTH W (Web Commerce) Specification Version 1.1., 27 June 2000
Yagoub, K., Florescu, D., Issarny, V., Valduriez, P.: Building and customizing data-intensive web sites using weave. In: Proceedings of the 26th International Conference on Very Large Data Bases (VLDB), Cairo 10–14 September 2000, pp. 607–610
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Luo, Q., Naughton, J.F. & Xue, W. Form-based proxy caching for database-backed web sites: keywords and functions. The VLDB Journal 17, 489–513 (2008). https://doi.org/10.1007/s00778-006-0018-x
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00778-006-0018-x