Skip to main content

Managing Data Integration Uncertainty

  • Living reference work entry
  • First Online:
  • 40 Accesses

Synonyms

Probabilistic schema alignment

Definition

Consider a set of source schemas \({\mathcal {S}} = \{S_1, \dots , S_n\}\) in the same domain, where different schemas may describe the domain in different ways. An important component in data integration is schema alignment, including three steps: (1) creating a mediated schema M that provides a unified and virtual view of the disparate sources and captures the salient aspects of the domain being considered, (2) generating attribute matching that matches attributes in each source schema S i , i ∈ [1, n], to the corresponding attributes in the mediated schema M, and (3) building a schema mapping between each source schema S i and the mediated schema Mto specify the semantic relationships between the contents of the source and that of the mediated data. The result schema mappings are used to reformulate a user query into a set of queries on the underlying data sources for query answering. Uncertainty can arise in every step of schema...

This is a preview of subscription content, log in via an institution.

References

  1. Franklin M, Halevy AY, Maier D. From databases to dataspaces: a new abstraction for information management. Sigmod Record. 2005;34(4):27–33.

    Google Scholar 

  2. Sarma AD, Dong XL, Halevy A. Bootstrapping pay-as-you-go data integration systems. In: Sigmod; 2008. p. 861–74.

    Google Scholar 

  3. Dong X, Halevy AY, Yu C. Data integration with uncertainties. In: Proceedings of VLDB; 2007. p. 687–98.

    Google Scholar 

  4. Gal A, Anaby-Tavor A, Trombetta A, Montesi D. A framework for modeling and evaluating automatic semantic reconciliation. VLDB J. 2003;14:50–67.

    Google Scholar 

  5. Gal A, Martinez MV, Simari GI, Subrahmanian VS. Aggregate query answering under uncertain schema mappings. In: ICDE; 2009. p. 940–51.

    Google Scholar 

  6. Dong XL, Gabrilovich E, Heitz G, Horn W, Lao N, Murphy K, et al. Knowledge vault: a web-scale approach to probabilistic knowledge fusion. In: SIGKDD; 2014.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Alon Halevy .

Editor information

Editors and Affiliations

Section Editor information

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer Science+Business Media LLC

About this entry

Cite this entry

Dong, X.L., Halevy, A. (2017). Managing Data Integration Uncertainty. In: Liu, L., Özsu, M. (eds) Encyclopedia of Database Systems. Springer, New York, NY. https://doi.org/10.1007/978-1-4899-7993-3_80743-1

Download citation

  • DOI: https://doi.org/10.1007/978-1-4899-7993-3_80743-1

  • Published:

  • Publisher Name: Springer, New York, NY

  • Print ISBN: 978-1-4899-7993-3

  • Online ISBN: 978-1-4899-7993-3

  • eBook Packages: Springer Reference Computer SciencesReference Module Computer Science and Engineering

Publish with us

Policies and ethics