Abstract
Currently, a variety of information retrieval systems are available to potential users. These services are provided by commercial enterprises (such as Dow Jones [Dunn 1984] and The Source [Edelhart 1983]), while others are research efforts (the Boston Community Information System [Gifford1985]). While in many cases these systems are accessed from personal computers, typically no advantage is taken of the computing resources of those machines (such as local processing and storage). In this paper we explore the possibility of using the user's local storage capabilities to cache data at the user's site. This would improve the response time of user queries albeit at the cost of incurring the the overhead required in maintaining multiple copies. In order to reduce this overhead it may be appropriate to allow copies to diverge in a controlled fashion. This would not only make caching less costly, but would also make it possible to propagate updates to the copies more efficiently, e.g., when the system is lightly loaded, when communication tariffs are lower, or by batching together updates. Just as importantly, it also makes it possible to access the copies even when the communication lines or the central site are down. Thus, we introduce the notion of quasi-copies which embodies the ideas sketched above. We also define the types of deviations that seem useful, and discuss the available implementation strategies.
Preview
Unable to display preview. Download preview PDF.
References
Bill Dunn, “Bill Dunn of Dow Jones: The Data Merchant,” Personal Computing, pp. 162–176, December 1984.
Mike Edelhart and Owen Davies, OMNI Online Database Dictionary, Collier MacMillan Publishers, 1983.
Hector Garcia-Molina, Rafael Alonso, Daniel Barbara, and Soraya Abad, “Data Caching in an Information Retrieval System,” Technical Report CS-TR-065-86, Department of Computer Science, Princeton University, 1986.
David K. Gifford, John M. Lucassen, and Stephen T. Berlin, “The Application of Digital Broadcast Communication to Large Scale Information Systems,” IEEE Journal on Selected Areas in Communication, May 1985.
Leonard Kleinrock, “Queueing Systems, Vol. 1: Theory,” Wiley-Interscience, 1975.
Leslie Lamport, “Time, clocks, and the ordering of events in a distributed system,” Communications of the ACM, vol. 21, no. 7, pp. 558–565, July 1978.
Alan J. Smith, “Cache Memories,” Computing Surveys, vol. 14, no. 3, September 1982.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1988 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Alonso, R., Barbara, D., Garcia-Molina, H., Abad, S. (1988). Quasi-copies: Efficient data sharing for information retrieval systems. In: Schmidt, J.W., Ceri, S., Missikoff, M. (eds) Advances in Database Technology—EDBT '88. EDBT 1988. Lecture Notes in Computer Science, vol 303. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-19074-0_67
Download citation
DOI: https://doi.org/10.1007/3-540-19074-0_67
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-19074-5
Online ISBN: 978-3-540-39095-4
eBook Packages: Springer Book Archive