Abstract
With the constantly increasing reliance on database systems to store, process, and display data comes the additional problem of using these systems properly. Most organizations have several data systems that must work together. As data warehouses, data marts, and other OLAP systems are added to the mix, the complexity of ensuring interoperability between these systems increases dramatically. Interoperability of database systems can be achieved by capturing the semantics of each system and providing a standardized framework for querying and exchanging semantic specifications.
Our work focuses on capturing the semantics of data stored in databases with the goal of integrating data sources within a company, across a network, and even on the World-Wide Web. Our approach to capturing data semantics revolves around the definition of a global dictionary that provides standardized terms for referencing and categorizing data. These standardized terms are then stored in record-based semantic specifications that store metadata and semantic descriptions of the data. Using these semantic specifications, it is possible to integrate diverse data sources even though they were not originally designed to work together.
A prototype of this integration system called the Relational Integration Model (RIM) has been built. This paper describes the architecture and benefits of the system, and its possible applications. The RIM application is currently being tested on production database systems and integration problems.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
R. Hull and G. Zhou. A framework for supporting data integration using the materialized and virtual approaches. In Proceedings of ACM SIGMOD Conference on Management of Data, Montreal, Canada, 1996, pages 481–492.
C. Batini, M. Lenzerini and S. B. Navathe. A Comparative Analysis of Methodologies for Database Schema Integration. ACM Computing Surveys, 18(4) Dec. 1986, pages 323–364.
A. P. Sheth and G. Karabatis. Multidatabase interdependencies in industry. In Proceedings of ACM SIGMOD Conference on Management of Data, Washington, DC, USA, 1993, pages 481–492.
S. Castano and V. Antonellis. Semantic dictionary design for database interoperability. In Proceedings of the 13th International Conference on Data Engineering (ICDE’97), 1997, pages 43–54.
T. Kirk, A. Levy, Y. Sagiv and D. Srivastava. The Information Manifold. In AAAI Spring Symposium on Information Gathering, 1995.
C. Li, R. Yemeni, V. Vassalos, H. Garcia-Molina, Y. Papakonstantinou, J. Ullman and M. Valiveti. Capability Based Mediation in TSIMMIS. In Proceedings of ACM SIGMOD Conference on Management of Data, Seattle, WA, USA, 1998, pages 564–566.
The Metadata coalition. Metadata interchange specification. Technical Report version 1.1, August 1997.
W3C. Extensible Markup Langauge (XML) specification. Technical Report version 1.0, February 1998.
C. Collet, M. Huhns, and W-M. Shen. Resource integration using a large knowledge base in Carnot. IEEE Computer. 24(12), December 1991, pages 55–62.
A. Sheth and J. Larson. Federated database systems for managing distributed, heterogenous and autonomous databases. ACM Computing Surveys, 22(3) Sept. 1990, pages 183–236.
M. Bright, A. Hurson, and S. Pakzad. Automated resolution of semantic heterogeneity in multidatabases. In ACM Transactions on Database Systems, 19(2) June 1994, pages 212–253.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Kluwer Academic Publishers
About this chapter
Cite this chapter
Lawrence, R., Barker, K. (2002). Integrating Data Sources Using a Standardized Global Dictionary. In: Abramowicz, W., Zurada, J. (eds) Knowledge Discovery for Business Information Systems. The International Series in Engineering and Computer Science, vol 600. Springer, Boston, MA. https://doi.org/10.1007/0-306-46991-X_7
Download citation
DOI: https://doi.org/10.1007/0-306-46991-X_7
Publisher Name: Springer, Boston, MA
Print ISBN: 978-0-7923-7243-1
Online ISBN: 978-0-306-46991-6
eBook Packages: Springer Book Archive