Quasi-copies: Efficient data sharing for information retrieval systems

  • Rafael Alonso
  • Daniel Barbara
  • Hector Garcia-Molina
  • Soraya Abad
Efficiency By Replicated Data
Part of the Lecture Notes in Computer Science book series (LNCS, volume 303)


Currently, a variety of information retrieval systems are available to potential users. These services are provided by commercial enterprises (such as Dow Jones [Dunn 1984] and The Source [Edelhart 1983]), while others are research efforts (the Boston Community Information System [Gifford1985]). While in many cases these systems are accessed from personal computers, typically no advantage is taken of the computing resources of those machines (such as local processing and storage). In this paper we explore the possibility of using the user's local storage capabilities to cache data at the user's site. This would improve the response time of user queries albeit at the cost of incurring the the overhead required in maintaining multiple copies. In order to reduce this overhead it may be appropriate to allow copies to diverge in a controlled fashion. This would not only make caching less costly, but would also make it possible to propagate updates to the copies more efficiently, e.g., when the system is lightly loaded, when communication tariffs are lower, or by batching together updates. Just as importantly, it also makes it possible to access the copies even when the communication lines or the central site are down. Thus, we introduce the notion of quasi-copies which embodies the ideas sketched above. We also define the types of deviations that seem useful, and discuss the available implementation strategies.


Central Node Central Site Transmission Delay Delay Condition Remote Site 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. Dunn 1984.
    Bill Dunn, “Bill Dunn of Dow Jones: The Data Merchant,” Personal Computing, pp. 162–176, December 1984.Google Scholar
  2. Edelhart1983.
    Mike Edelhart and Owen Davies, OMNI Online Database Dictionary, Collier MacMillan Publishers, 1983.Google Scholar
  3. Garcia-Molina1986.
    Hector Garcia-Molina, Rafael Alonso, Daniel Barbara, and Soraya Abad, “Data Caching in an Information Retrieval System,” Technical Report CS-TR-065-86, Department of Computer Science, Princeton University, 1986.Google Scholar
  4. Gifford1985.
    David K. Gifford, John M. Lucassen, and Stephen T. Berlin, “The Application of Digital Broadcast Communication to Large Scale Information Systems,” IEEE Journal on Selected Areas in Communication, May 1985.Google Scholar
  5. Kleinrock1975.
    Leonard Kleinrock, “Queueing Systems, Vol. 1: Theory,” Wiley-Interscience, 1975.Google Scholar
  6. Lamport1978.
    Leslie Lamport, “Time, clocks, and the ordering of events in a distributed system,” Communications of the ACM, vol. 21, no. 7, pp. 558–565, July 1978.CrossRefGoogle Scholar
  7. Smith1982.
    Alan J. Smith, “Cache Memories,” Computing Surveys, vol. 14, no. 3, September 1982.Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 1988

Authors and Affiliations

  • Rafael Alonso
    • 1
  • Daniel Barbara
    • 1
  • Hector Garcia-Molina
    • 1
  • Soraya Abad
    • 2
  1. 1.Department of Computer SciencePrinceton UniversityPrinceton
  2. 2.Department of Computer ScienceRutgers UniversityNew Brunswick

Personalised recommendations