Abstract
Data integration systems provide access to a set of heterogeneous, autonomous data sources through a so-called global, or mediated view. There is a general consensus that the best way to describe the global view is through a conceptual data model, and that there are basically two approaches for designing a data integration system. In the global-as-view approach, one defines the concepts in the global schema as views over the sources, whereas in the local-as-view approach, one characterizes the sources as views over the global schema. It is well known that processing queries in the latter approach is similar to query answering with incomplete information, and, therefore, is a complex task. On the other hand, it is a common opinion that query processing is much easier in the former approach. In this paper we show the surprising result that, when the global schema is expressed in terms of a conceptual data model, even a very simple one, query processing becomes difficult in the global-as-view approach also. We demonstrate that the problem of incomplete information arises in this case too, and we illustrate some basic techniques for effectively answering queries posed to the global schema of the data integration system.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
S. Abiteboul and O. Duschka. Complexity of answering queries using materialized views. In Proc. of PODS’98, pages 254–265, 1998.
S. Abiteboul, R. Hull, and V. Vianu. Foundations of Databases. Addison Wesley Publ. Co., Reading, Massachussetts, 1995.
S. Bergamaschi, S. Castano, M. Vincini, and D. Beneventano. Intelligent techniques for the extraction and integration of heterogeneous information. In Proc. of the IJCAI’99 Workshop on Intelligent Information Integration, 1999.
M. Bouzeghoub and M. Lenzerini. Special issue on data extraction, cleaning, and reconciliation. Information Systems, 2001. To appear.
D. Calvanese, G. De Giacomo, M. Lenzerini, D. Nardi, and R. Rosati. Information integration: Conceptual modeling and reasoning support. In Proc. of CoopIS’98, pages 280–291, 1998.
D. Calvanese, G. De Giacomo, M. Lenzerini, D. Nardi, and R. Rosati. Data integration in data warehousing. Int. J. of Cooperative Information Systems, 2001. To appear.
D. Calvanese, G. De Giacomo, M. Lenzerini, and M. Y. Vardi. Answering regular path queries using views. In Proc. of ICDE 2000, pages 389–398, 2000.
D. Calvanese, G. De Giacomo, M. Lenzerini, and M. Y. Vardi. View-based query processing and constraint satisfaction. In Proc. of LICS 2000, pages 361–371, 2000.
M. J. Carey, L. M. Haas, P. M. Schwarz, M. Arya, W. F. Cody, R. Fagin, M. Flickner, A. Luniewski, W. Niblack, D. Petkovic, J. Thomas, J. H. Williams, and E. L. Wimmers. Towards heterogeneous multimedia information systems: The Garlic approach. In RIDE-DOM, pages 124–131, 1995.
P. P. Chen. The Entity-Relationship model: Toward a unified view of data. ACM Trans. on Database Systems, 1(1):9–36, Mar. 1976.
G. De Giacomo. Intensional query answering by partial evaluation. J. of Intelligent Information Systems, 7(3):205–233, 1996.
H. Galhardas, D. Florescu, D. Shasha, and E. Simon. An extensible framework for data cleaning. Technical Report 3742, INRIA, Rocquencourt, 1999.
G. Grahne and A. O. Mendelzon. Tableau techniques for querying information sources through global schemas. In Proc. of ICDT’99, volume 1540 of LNCS, pages 332–347. Springer-Verlag, 1999.
J. Gryz. Query folding with inclusion dependencies. In Proc. of ICDE’98, pages 126–133, 1998.
A. Y. Halevy. Theory of answering queries using views. SIGMOD Record, 29(4):40–47, 2000.
R. Hull. Managing semantic heterogeneity in databases: A theoretical perspective. In Proc. of PODS’97, 1997.
A. Y. Levy. Logic-based techniques in data integration. In J. Minker, editor, Logic Based Artificial Intelligence. Kluwer Publishers, 2000.
C. Li and E. Chang. Query planning with limited source capabilities. In Proc. of ICDE 2000, pages 401–412, 2000.
C. Li, R. Yerneni, V. Vassalos, H. Garcia-Molina, Y. Papakonstantinou, J. D. Ullman, and M. Valiveti. Capability based mediation in TSIMMIS. In Proc. of ACM SIGMOD, pages 564–566, 1998.
J. W. Lloyd. Foundations of Logic Programming (Second, Extended Edition). Springer-Verlag, Berlin, Heidelberg, 1987.
B. Ludascher, A. Gupta, and M. E. Martone. Model-based mediation with domain maps. In Proc. of ICDE 2001, pages 81–90, 2001.
L. Palopoli, L. Pontieri, G. Terracina, and D. Ursino. Intensional and extensional integration and abstraction of heterogeneous databases. Data and Knowledge Engineering, 35(3):201–237, 2000.
X. Qian. Query folding. In Proc. of ICDE’96, pages 48–55, 1996.
J. D. Ullman. Information integration using logical views. In Proc. of ICDT’97, volume 1186 of LNCS, pages 19–40. Springer-Verlag, 1997.
R. van der Meyden. Logical approaches to incomplete information. In J. Chomicki and G. Saake, editors, Logics for Databases and Information Systems, pages 307–356. Kluwer Academic Publisher, 1998.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2001 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Calì, A., Calvanese, D., De Giacomo, G., Lenzerini, M. (2001). Accessing Data Integration Systems through Conceptual Schemas. In: S.Kunii, H., Jajodia, S., Sølvberg, A. (eds) Conceptual Modeling — ER 2001. ER 2001. Lecture Notes in Computer Science, vol 2224. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45581-7_21
Download citation
DOI: https://doi.org/10.1007/3-540-45581-7_21
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-42866-4
Online ISBN: 978-3-540-45581-3
eBook Packages: Springer Book Archive