Abstract
In digital libraries accessing distributed Web-based bibliographic repositories, performance is a major issue. Efficient query processing requires an appropriate caching mechanism. Unfortunately, standard page-based as well as tuple-based caching mechanisms designed for conventional databases are not efficient on the Web, where keyword-based querying is often the only way to retrieve data. Therefore, we study the problem of semantic caching of Web queries and develop a caching mechanism for conjunctiveWeb queries based on signature files.We propose two implementation choices. A first algorithm copes with the relation of semantic containment between a query and the corresponding cache items. A second algorithm extends this processing to more complex cases of semantic intersection. We report results of experiments and show how the caching mechanism is successfully realized in the Knowledge Broker system.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
S. Adali, K. S. Candan, Y. Papakonstantinou, V. S. Subrahmanian. Query Caching and Optimization in Distributed Mediator Systems. In Proc. SIGMOD’ 96 Conf., pp. 137–148, 1996.
R. Alonso, D. Barbara, H. Garcia-Molina. Data Caching Issues in an Information Retrieval System. In ACM TODS 15:3, 359–384, 1990.
J.-M. Andreoli, U. M. Borghoff, R. Pareschi. Constraint-Based Knowledge Broker Model: Semantics, Implementation and Analysis. In Journal of Symbolic Computation bf 21:4, 635–667, 1996.
Y. Arens and C. A. Knoblock. Intelligent Caching: Selecting, Representing, and Reusing Data in an Information Server. In Proc. CIKM’ 94 Conf., Gaithersburg, MD, pp. 433–438, 1994.
U. M. Borghoff, R. Pareschi, F. Arcelli, F. Formato. Constraint-Based Protocols for Distributed Problem Solving. In Science of Computer Programming 30, 201–225, 1998.
M. J. Carey, M. J. Franklin, M. Livny, E. J. Shekita. Data Caching Tradeoffs in Client-Server DBMS Architectures. In Proc. SIGMOD’ 91 Conf., pp. 357–366, 1991.
C.-C. K. Chang, H. Garcia-Molina, A. Paepcke. Boolean Query Mapping Across Heterogeneous Information Sources. In IEEE TOKDE 8:4, 1996.
C.-C. K. Chang and H. Garcia-Molina. Evaluating the Cost of Boolean Query Mapping. In Proc. 2nd ACM Int’l. Conf. on Digital Library, 1997.
S. Dar, M. J. Franklin, B. Jonsson, D. Srivastava, M. Tan. Semantic Data Ca-ching and Replacement. In Proc. 22nd VLDB Conf., Bombay, India, pp. 330–341, 1996.
C. Faloutsos. Signature files: Design and Performance Comparison of Some Signature Extraction Methods. In Proc. SIGMOD’ 85 Conf., pp. 63–82, 1985.
C. Faloutsos and S. Christodoulakis. Signature Files: An Access Method for Documents and Its Analytical Performance Evaluation. In ACM TOIS 2:4, 267–288, 1984.
C. Faloutsos and S. Christodoulakis. Description and Performance Analysis of Signature File Methods for Office Filing. In ACM TOIS 5:3, 237–257, 1987.
P. Godfrey and J. Gryz. Semantic Query Caching For Heterogeneous Databases. In Proc. 4th KRDB Workshop on Intelligent Access to Heterogeneous Information, Athens, Greece, pp. 6.1–6.6, 1997.
H. Kitagawa, J. Fukushima, Y. Ishikawa and N. Ohbo.. Estimation of False Drops in Set-valued Object Retrieval with Signature Files. In Proc. 4th Int’l. Conf. FODO’ 93, Chicago, IL. Springer-Verlag, LNCS 730, 146–63, 1993.
D. L. Lee, Y. M. Kim and G. Patel. Efficient Signature File Methods for Text Retrieval. In IEEE TOKDE 7:3, 423–435, 1995.
A. Y. Levi, A. Rajaraman, J. J. Ordille. Quering Heterogeneous Information Sources Using Source Descriptions. In Proc. 22nd VLDB Conf., Bombay, India, pp. 251–262, 1996.
P. T. Martin and J. I. Russell. Data caching strategies for distributed full text retrieval systems. In Information Systems 16:1, 1–11, 1991.
A. Paepcke, S. B. Cousins, H. Garcia-Molina, et al. Towards Interoperability in Digital Libraries: Overview and Selected Highlights of the Stanford Digital Library Project. In IEEE Computer Magazine 29:5, 1996.
Y. Papakonstantinou, A. Gupta, H. Garcia-Molina, J. Ullman. A Query Transaction Scheme for Rapid Implementation of Wrappers. In Proc. DOOD’95 Conference. Springer-Verlag, LNCS 1013, 161–186, 1995.
Y. Papakonstantinou, H. Garcia-Molina, J. Ullman. MedMaker: A Mediation System Based on Declarative Specifications. in Proc. ICDE’96 Conf., pp.132–141, 1996.
Ch. Reck and B. König-Ries. An Architecture for Transparent Access to Semantically Heterogeneous Information Sources. In Proc. Cooperative Information Agents. Springer-Verlag, LNCS 1202, 1997.
A. Yoshida. MOWS: Distributed Web and Cache Server in Java. In Computer Networks and ISDN Systems 29:8–13, 965–976, 1997.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1998 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Chidlovskii, B., Borghoff, U.M. (1998). Signature File Methods for Semantic Query Caching. In: Nikolaou, C., Stephanidis, C. (eds) Research and Advanced Technology for Digital Libraries. ECDL 1998. Lecture Notes in Computer Science, vol 1513. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-49653-X_29
Download citation
DOI: https://doi.org/10.1007/3-540-49653-X_29
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-65101-7
Online ISBN: 978-3-540-49653-3
eBook Packages: Springer Book Archive