Awareness services for digital libraries

  • Arturo Crespo
  • Hector Garcia-Molina
Part of the Lecture Notes in Computer Science book series (LNCS, volume 1324)


We propose an architecture for Digital Library repositories where one or more data stores persistently hold the digital objects (e.g., documents), and interact with clients that perform indexing, replication, intellectual property management, revenue management, and other functions. One of the most critical components in such stores is the awareness mechanism, used to notify clients of inserted, deleted or changed objects. In this paper we survey the various awareness schemes (including snapshot, timestamp and log based), describing them all as variations of a single unified scheme. This makes it possible to understand their relative differences and strengths. In particular we focus on a signature-based awareness scheme that we believe is especially well suited for digital libraries, and show enhancements to improve its performance.


Digital Library Digital Object Custom Function Signature Algorithm Equivalent Vector 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    M.E. Adiba, and B.G. Lindsay: Database Snapshots. In Proceedings of the International Conference on Very Large Databases, 1980.Google Scholar
  2. 2.
    William Arms: Key Concepts in the Architecture of the Digital Library. D-Lib Magazine, July 1995. At Scholar
  3. 3.
    BackWeb. At Scholar
  4. 4.
    Marthyn Borghuis, Hans Brinckman, Albert Fischer, Karen Hunter, Eleonore van der Loo, Rob ter Mors, Paul Mostert, and Jaco Zijlstra: TULIP. Final Report. Elsevier Science, July 1996.Google Scholar
  5. 5.
    D. Barbara and R.J. Lipton: A class of randomized strategies for low-cost comparison of file copies. In IEEE Transactions on Parallel and Distributed Systems, volume 2(2), pages 160–170. April 1991.CrossRefGoogle Scholar
  6. 6.
    Corporation for National Research Initiatives: Computer Science Technical Reports (CS-TR). At Scholar
  7. 7.
    Robert Kahn and Robert Wilensky: A Framework for Distributed Digital Object Services. Technical Report tn95-01. CNRI, May 1995.Google Scholar
  8. 8.
    G. G. Langdon: A Note on the Ziv-Lempel Model for Compressing Individual Sequences. In IEEE Transactions on Information Theory, volume 29(2), pages 284–287, 1983.CrossRefGoogle Scholar
  9. 9.
    B.G. Lindsay et al.: A Snapshot Differential Refresh Algorithm. In Proceedings of the ACM SIGMOD Annual Conference, 1986.Google Scholar
  10. 10.
    Wilburt Juan Labio and Hector Garcia-Molina: Efficient Snapshot Differential Algorithms for Data Warehousing. In VLDB, 1996.Google Scholar
  11. 11.
    Carl Lagoze, Clifford A. Lynch, and Ron Daniel Jr.: The Warwick Framework: A Container Architecture for Aggregating Sets of Metadata. Technical Report TR961593. Cornell University, June 1996.Google Scholar
  12. 12.
    MARIMBA: Castanet. At Scholar
  13. 13.
    J.J. Metzner: Efficient replicated remote file comparison. In IEEE Transactions on Computers, volume 40(5), pages 651–660. May 1991.CrossRefGoogle Scholar
  14. 14.
    Open Software Associates: NetDeploy. At Graw Hill, 1996.Google Scholar
  15. 15.
    A. Silberschtaz and H. F. Korth and S. Sudarshan: Database System Concepts. Mc Mc Graw Hill, 1996.Google Scholar
  16. 16.
    Stuart Weibel, Jean Godby, Eric Miller, and Ron Daniel: OCLC/NCSA Metadata Workshop Report, March 1995.Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 1997

Authors and Affiliations

  • Arturo Crespo
    • 1
  • Hector Garcia-Molina
    • 1
  1. 1.Stanford UniversityUSA

Personalised recommendations