Skip to main content
Log in

Towards Intelligent Semantic Caching for Web Sources

  • Published:
Journal of Intelligent Information Systems Aims and scope Submit manuscript

Abstract

An intelligent semantic caching scheme suitable for web sources is presented. Since web sources typically have weaker querying capabilities than conventional databases, existing semantic caching schemes cannot be directly applied. Our proposal takes care of the difference between the query capabilities of an end user system and web sources. In addition, an analysis on the match types between a user's input query and cached queries is presented. Based on this analysis, we present an algorithm that finds the best matched query under different circumstances. Furthermore, a method to use semantic knowledge, acquired from the data, to avoid unnecessary access to web sources by transforming the cache miss to the cache hit is presented. To verify the effectiveness of the proposed semantic caching scheme, we first show how to generate synthetic queries exhibiting different levels of semantic localities. Then, using the test sets, we show that the proposed query matching technique is an efficient and effective way for semantic caching in web databases.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Similar content being viewed by others

References

  • Adali, S., Candan, K.S., Papakonstantinou, Y., and Subrahmanian, V.S. (1996). Query Caching and Optimization in Distributed Mediator Systems. In Proc. ACM SIGMOD.

  • Alonso, R., Barbara, D., and García-Molina, H. (1990). Data Caching Issues in an Information Retrieval System, ACM TODS, 15(3), 359-384.

    Google Scholar 

  • Ashish, N., Knoblock, C.A., and Shahabi, C. (1998). Intelligent Caching for Information Mediators: A KR Based Approach. In Proc. KRDB.

  • Chandra, A.K. and Merlin, P.M. (1977). Optimal Implementation of Conjunctive Queries in Relational Databases. In Proc. ACM Symp. on the Theory of Computing.

  • Chang, C.-C.K., García-Molina, H., and Paepcke, A. (1996). Boolean Query Mapping Across Heterogeneous Information Sources, IEEE TKDE, 8(4).

  • Chen, C.M. and Roussopoulos, N. (1994). The Implementation and Performance Evaluation of the ADMS Query Optimizer: Integrating Query Result Caching and Matching. In Proc. EDBT.

  • Chidlovskii, B. and Borghoff, U.M. (2000). Semantic Caching of Web Queries, The VLDB J., 9(2), 2-17.

    Google Scholar 

  • Chu, W.W., Chen, Q., and Huang, A. (1994). Query Answering via Cooperative Data Inference, JIIS, 3, 57-87.

    Google Scholar 

  • Chu, W.W., Yang, H., Chiang, K., Minock, M., Chow, G., and Larson, C. (1996). CoBase: A Scalable and Extensible Cooperative Information System, JIIS.

  • Cluet, S., Kapitskaia, O., and Srivastava, D. (1999). Using LDAP Directory Caches. In Proc. ACM PODS.

  • Dar, S., Franklin, M.J., Jonsson, B.T., and Srivastava, D. (1996). Semantic Data Caching and Replacement. In Proc. VLDB.

  • Florescu, D., Levy, A.Y., and Mendelzon, A. (1998). Database Techniques for the World-Wide Web: A Survery, ACM SIGMOD Record.

  • Franklin, M.J., Carey, M.J., and Livny, M. (1993). Local Disk Caching for Client-Server Database Systems. In Proc. VLDB.

  • García-Molina, H.,Hammer, J., Ireland, K., Papakonstantinou,Y., Ullman, J., andWidom, J. (1995). Integrating and Accessing Heterogeneous Information Sources in TSIMMIS. In Proc. AAAI Symp. on Information Gathering.

  • Godfrey, P. and Gryz, J. (1997). Semantic Query Caching for Heterogeneous Databases. In Proc. KRDB. Godfrey, P. and Gryz, J. (1999). Answering Queries by Semantic Caches. In Proc. DEXA.

  • Gryz, J. (1998). Query Folding with Inclusion Dependencies. In Proc. IEEE ICDE.

  • Guo, S., Sun, W., and Weiss, M.A. (1996). Solving Satisfiability and Implication Problems in Database Systems, ACM TODS, 21(2), 270-293.

    Google Scholar 

  • Johnson, D.S. and Klug, A. (1984). Testing Containment of Conjunctive Queries under Functional and Inclusion Dependencies, J. of Computer and System Sciences (JCSS).

  • Keller, A.M. and Basu, J. (1996). A Predicate-Based Caching Scheme for Client-Server Database Architectures, The VLDB J., 5(1).

  • Larson, P.-Å. and Yang, H.Z. (1985). Computing Queries from Derived Relations. In Proc. VLDB.

  • Lee, D. and Chu, W.W. (1999). Semantic Caching via Query Matching for Web Sources. In Proc. ACM CIKM.

  • Levy, A.Y., Mendelzon, A.O., Sagiv, Y., and Srivastava, D. (1995). Answering Queries Using Views. In Proc. ACM PODS.

  • Levy, A.Y., Rajaraman, A., and Ordille, J. (1996). Querying Heterogeneous Information Sources Using Source Descriptions. In Proc. VLDB.

  • Qian, X. (1998). Query Folding. In Proc. IEEE ICDE.

  • Ren, Q. and Dunham, M.H. (1998). Semantic Caching and Query Processing. Southern Methodist University, TR-98-CSE-04.

  • Saraiya, Y. (1991). Subtree Elimination Algorithms in Deductive Databases. Ph.D Thesis, Stanford U.

  • Sellis, T. (1988). Intelligent Caching and Indexing Techniques For Relational Database Systems, IS, 13(2).

  • Ullman, J.D. (1988). Principles of Database and Knowledge-Base Systems. Vol. II, The New Technologies. Computer Science Press.

  • Ullman, J.D. (1997). Information Integration Using Logical Views. In Proc. ICDT.

  • Vassalos,V. and Papakonstantinou,Y. (1997). Describing and Using Query Capabilities of Heterogeneous Sources. In Proc. VLDB.

  • Zaniolo, C., Ceri, S., Faloutsos, C., Snodgrass, R.R., and Subrahm, V.S. (1997). Advanced Database Systems. Morgan Kaufmann Pub.

  • Zipf, G.K. (1949). Human Behaviour and the Principle of Least Effort. Addison-Wesley.

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

Lee, D., Chu, W.W. Towards Intelligent Semantic Caching for Web Sources. Journal of Intelligent Information Systems 17, 23–45 (2001). https://doi.org/10.1023/A:1012598631912

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1023/A:1012598631912

Navigation