Definition
In (virtual) data integration, also known as enterprise information integration, queries are posed over a virtual mediated schema and answered on-the-fly using data from remote sources, which may themselves be DBMSs, Web sites, or applications. This requires two main stages that of query reformulation where the user’s query is composed with schema mappings to produce a combined (distributed) query and query optimization and execution where the query is executed efficiently across the sources.
The query optimization and execution problem for data integration is, in principle, quite similar to that for distributed databases. However, it is actually significantly more complex because (1) remote data sources may have different data models and their own query capabilities; (2) statistics on the data at each source may be unavailable; (3) remote data sources may require the requestor...
Recommended Reading
Smith JM, Bernstein PA, Dayal U, Goodman N, Landers TA, Lin KWT, Wong E. Multibase: integrating heterogeneous database systems. In: AFIPS Nat’l Computer Conference; 1981. p. 487–99.
Levy AY, Rajaraman A, Ordille JJ. Querying heterogeneous information sources using source descriptions. In: VLDB; 1996. p. 251–62.
Arens Y, Knoblock CA. SIMS: retrieving and integrating information from multiple sources. In: SIGMOD; 1993. p. 562–3.
Doan A, Halevy A, Ives Z. Query processing. In: Principles of data integration. Waltham: Morgan Kaufmann; 2012. p. 209–41.
Deshpande A, Ives Z, Raman V. Adaptive query processing. Found Trends Database Syst. 2007;1:1.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Section Editor information
Rights and permissions
Copyright information
© 2016 Springer Science+Business Media LLC
About this entry
Cite this entry
Ives, Z.G. (2016). Query Processing in Data Integration Systems. In: Liu, L., Özsu, M. (eds) Encyclopedia of Database Systems. Springer, New York, NY. https://doi.org/10.1007/978-1-4899-7993-3_80668-1
Download citation
DOI: https://doi.org/10.1007/978-1-4899-7993-3_80668-1
Received:
Accepted:
Published:
Publisher Name: Springer, New York, NY
Online ISBN: 978-1-4899-7993-3
eBook Packages: Springer Reference Computer SciencesReference Module Computer Science and Engineering