Skip to main content

Integrating Data Sources Using a Standardized Global Dictionary

  • Chapter

Part of the book series: The International Series in Engineering and Computer Science ((SECS,volume 600))

Abstract

With the constantly increasing reliance on database systems to store, process, and display data comes the additional problem of using these systems properly. Most organizations have several data systems that must work together. As data warehouses, data marts, and other OLAP systems are added to the mix, the complexity of ensuring interoperability between these systems increases dramatically. Interoperability of database systems can be achieved by capturing the semantics of each system and providing a standardized framework for querying and exchanging semantic specifications.

Our work focuses on capturing the semantics of data stored in databases with the goal of integrating data sources within a company, across a network, and even on the World-Wide Web. Our approach to capturing data semantics revolves around the definition of a global dictionary that provides standardized terms for referencing and categorizing data. These standardized terms are then stored in record-based semantic specifications that store metadata and semantic descriptions of the data. Using these semantic specifications, it is possible to integrate diverse data sources even though they were not originally designed to work together.

A prototype of this integration system called the Relational Integration Model (RIM) has been built. This paper describes the architecture and benefits of the system, and its possible applications. The RIM application is currently being tested on production database systems and integration problems.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   169.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD   219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. R. Hull and G. Zhou. A framework for supporting data integration using the materialized and virtual approaches. In Proceedings of ACM SIGMOD Conference on Management of Data, Montreal, Canada, 1996, pages 481–492.

    Google Scholar 

  2. C. Batini, M. Lenzerini and S. B. Navathe. A Comparative Analysis of Methodologies for Database Schema Integration. ACM Computing Surveys, 18(4) Dec. 1986, pages 323–364.

    Article  Google Scholar 

  3. A. P. Sheth and G. Karabatis. Multidatabase interdependencies in industry. In Proceedings of ACM SIGMOD Conference on Management of Data, Washington, DC, USA, 1993, pages 481–492.

    Google Scholar 

  4. S. Castano and V. Antonellis. Semantic dictionary design for database interoperability. In Proceedings of the 13th International Conference on Data Engineering (ICDE’97), 1997, pages 43–54.

    Google Scholar 

  5. T. Kirk, A. Levy, Y. Sagiv and D. Srivastava. The Information Manifold. In AAAI Spring Symposium on Information Gathering, 1995.

    Google Scholar 

  6. C. Li, R. Yemeni, V. Vassalos, H. Garcia-Molina, Y. Papakonstantinou, J. Ullman and M. Valiveti. Capability Based Mediation in TSIMMIS. In Proceedings of ACM SIGMOD Conference on Management of Data, Seattle, WA, USA, 1998, pages 564–566.

    Google Scholar 

  7. The Metadata coalition. Metadata interchange specification. Technical Report version 1.1, August 1997.

    Google Scholar 

  8. W3C. Extensible Markup Langauge (XML) specification. Technical Report version 1.0, February 1998.

    Google Scholar 

  9. C. Collet, M. Huhns, and W-M. Shen. Resource integration using a large knowledge base in Carnot. IEEE Computer. 24(12), December 1991, pages 55–62.

    Google Scholar 

  10. A. Sheth and J. Larson. Federated database systems for managing distributed, heterogenous and autonomous databases. ACM Computing Surveys, 22(3) Sept. 1990, pages 183–236.

    Article  Google Scholar 

  11. M. Bright, A. Hurson, and S. Pakzad. Automated resolution of semantic heterogeneity in multidatabases. In ACM Transactions on Database Systems, 19(2) June 1994, pages 212–253.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2002 Kluwer Academic Publishers

About this chapter

Cite this chapter

Lawrence, R., Barker, K. (2002). Integrating Data Sources Using a Standardized Global Dictionary. In: Abramowicz, W., Zurada, J. (eds) Knowledge Discovery for Business Information Systems. The International Series in Engineering and Computer Science, vol 600. Springer, Boston, MA. https://doi.org/10.1007/0-306-46991-X_7

Download citation

  • DOI: https://doi.org/10.1007/0-306-46991-X_7

  • Publisher Name: Springer, Boston, MA

  • Print ISBN: 978-0-7923-7243-1

  • Online ISBN: 978-0-306-46991-6

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics