MARIAN: Flexible Interoperability for Federated Digital Libraries

  • Marcos André Gonçalves
  • Robert K. France
  • Edward A. Fox
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 2163)

Abstract

Federated digital libraries are composed of distributed, autonomous, and often heterogeneous information services but provide users with a transparent, integrated view of collected information. In this paper we discuss a federated system for the Networked Digital Library of Theses and Dissertations (NDLTD), an international consortium of universities, libraries, and other supporting institutions focused on electronic theses and dissertations (ETDs). Federation requires dealing flexibly with differences among systems, ontologies, and data formats while respecting information sources’ autonomy. Our solution involves adapting the object-oriented digital library system MARIAN to serve as mediation middleware for the federated NDLTD collection. Components of the solution include: 1) the use and integration of several harvesting techniques; 2) an architecture based on object-oriented ontologies of search modules and metadata; 3) reconciliation of diversity within the harvested data joined to a single collection view for the user; and 4) an integrated framework for addressing such questions as data quality, flexible and efficient search, and scalability.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. [1]
    Abiteboul, S., Buneman, P. Suciu, D., Data on the Web: from relations to semistructured data and XML. Morgan Kaufmann, 1999Google Scholar
  2. [2]
    Adam, N., Atluri, V., Adiwijaya, I., “Systems Integration in Digital Libraries”, Communications of the ACM, 43(6), 2000, pp. 64–72CrossRefGoogle Scholar
  3. [3]
    Bowman, C. M., Danzig, P. B., Hardy, D. R., Manber, U., Schwartz, M. F., “The Harvest information discovery and access system”, Computer Networks and ISDN Systems, 28(1–2), 1995, pp. 119–126CrossRefGoogle Scholar
  4. [4]
    Fernandez, M. F., Florescu, D., Levy, A. Y., Suciu, D. “Declarative Specification of Web Sites with Strudel”. VLDB Journal 9(1): 38–55 (2000)CrossRefGoogle Scholar
  5. [5]
    Fox, E.A., R.K. France, E. Sahle, A.M. Daoud, and B.E. Cline, “Development of a Modern OPAC: From REVTOLC to MARIAN”. Proc. 16 th Int. ACM SIGIR Conf., 1993: pp. 248–259Google Scholar
  6. [6]
    France, R.K. “Weights and Measures: an Axiomatic Approach to Similarity Computations”. Internal report, Virginia Tech, 1995; http://www.dlib.vt.edu/repors/WeightsMeasures.pdf
  7. [7]
    France, R.K., L.T. Nowell, E.A. Fox, R.A. Saad, and J. Zhao: “Use and usability in a digital library search system.” CoRR cs.DL/9902013Google Scholar
  8. [8]
    Florescu, D., Levy, A., Mendelzon, A. “Database techniques for the World-Wide Web: A Survey”, SIGMOD Record. 27(3) 1998, pp. 59–74CrossRefGoogle Scholar
  9. [9]
    Fuhr, N., Rolleke, T., “A Probabilistic Relational Algebra for the Integration of Information retrieval and Database Systems”,.ACM Transactions on Information Systems, Vol. 15,No. 1, January, 1997, Pg. 32–66.CrossRefGoogle Scholar
  10. [10]
    Fuhr, N. “A Decision-Theoretic Approach to Database Selection in Networked IR”. ACM Transactions on Information Systems 17(3): 229–249 (1999)CrossRefGoogle Scholar
  11. [11]
    Fuhr, N., “Towards Data Abstraction in Networked Information Retrieval Systems”, Information Processing and Management 35(2): 101–119 (1999)CrossRefGoogle Scholar
  12. [12]
    Gravano, L., Garcia-Molina, H, “Merging Ranks from Heterogeneous Internet Sources”, Proc. of the 23 rd International Conference on Very Large Databases, 1997, pp. 196–205Google Scholar
  13. [13]
    Gonçalves, M.A., Kipp, N.A., Fox, E.A., Watson, L.T., “Streams, Structures, Spaces, Scenarios and Societies(5S): A Formal Model for Digital Libraries”, Tech. Rep., Virginia Tech, 2001.Google Scholar
  14. [14]
    Lagoze, C., Fielding, D., Payette, S., “Making Digital Libraries Work: Collection, Services, Connectivity Regions, and Collection Views”, Proc. 3 rd ACM Digital Libraries.1998, pp.134–143Google Scholar
  15. [15]
    Lagoze. C., Sompel, H. V., “The Open Archives Initiative”, Proc. of the First ACM-IEEE The Joint Conference on Digital Libraries, Roanoke, Virginia, 2001.Google Scholar
  16. [16]
    Lynch, C., “The Z39.50 Information Retrieval Standard-Part I: A Strategic View of Its Past, Present and Future”, D-Lib Magazine, April 1997.Google Scholar
  17. [17]
    Melnik, S., H. Garcia-Molina and A. Paepcke, “A Mediation infrastructure for digital library services” Proc. 5 th ACM Digital Libraries, San Antonio, 2000 pp.123–132.Google Scholar
  18. [19]
    McBrien, P., Poulovassilis, A., “Automatic Migration and Wrapping of Database Applications-A Schema Transformation Approach”. ER 1999: 96–113Google Scholar
  19. [20]
    Ouksel, A. M., Sheth, A. P., “Semantic Interoperability in Global Information Systems: A Brief Introduction to the Research Area” SIGMOD Record 28(1):5–12 1999CrossRefGoogle Scholar
  20. [22]
    Paepcke, A., Chang, C. K., Winograd, T., Garcia-Molina, H., “Interoperability for digital libraries worldwide.” Communications of the ACM 41(4), 1998, pp. 33–42.CrossRefGoogle Scholar
  21. [23]
    Phanouriou, C., Kipp, N. A., Sornil, O., Mather, P., Fox, E. A., “A Digital Library for Authors: Recent Progress of the NDLTD”, Proc. 4 th ACM Digital Libraries, 1999, pp. 20–27Google Scholar
  22. [25]
    Powell, A.L. and J.C. French, “Growth and server availability of the NCSTRL digital library.” Proc. 5 th ACM Conf. On Digital Libraries(San Antonio, June 2–7, 2000) pp. 264–265.Google Scholar
  23. [26]
    Rundensteiner, E., Koeller, A., and Zhang, X., “Maintaining Data Warehouses over Changing Information Sources”, Communications of the ACM, 43(6), 2000, pp. 57–62CrossRefGoogle Scholar
  24. [27]
    Semantic Web Activity; http://www.w3.org/2001/sw/
  25. [29]
    Watts, D. J., “Small Worlds:The Dynamics of Networks between Order and Randomness”, Princeton Univ. Press, 1999.Google Scholar
  26. [30]
    Wiederhold, G., “Mediators in the Architecture of Future Information Systems”,IEEE Computer,25(3),1992, pg. 38–49.Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2001

Authors and Affiliations

  • Marcos André Gonçalves
    • 1
  • Robert K. France
    • 1
  • Edward A. Fox
    • 1
  1. 1.Department of Computer ScienceVirginia TechBlacksburgUSA

Personalised recommendations