Schema Alignment

Dong, Xin Luna; Srivastava, Divesh

doi:10.1007/978-3-031-01853-4_2

Xin Luna Dong³ &
Divesh Srivastava⁴

Part of the book series: Synthesis Lectures on Data Management ((SLDM))

263 Accesses

Abstract

The first component of data integration is schema alignment. As we showed in Section 1.2.3, there can be thousands to millions of data sources in the same domain, but they often describe the domain using different schemas. As an illustration, in the motivating example in Section 1.1, the four sources describe the flight domain using very different schemas: they contain different numbers of tables and different numbers of attributes; they may use different attribute names for the same attribute (e.g., Scheduled Arrival Date in Airline2.Flight vs. Scheduled in Airport3.Arrivals); they may apply different semantics for attributes with the same name (e.g., Arrival Time may mean landing time in one source and arrival-at-gate time in another source). To integrate data from different sources, the first step is to align the schemas and understand which attributes have the same semantics and which ones do not.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Author information

Authors and Affiliations

Google Inc., USA
Xin Luna Dong
AT&T Labs-Research, USA
Divesh Srivastava

Authors

Xin Luna Dong
View author publications
You can also search for this author in PubMed Google Scholar
Divesh Srivastava
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Dong, X.L., Srivastava, D. (2015). Schema Alignment. In: Big Data Integration. Synthesis Lectures on Data Management. Springer, Cham. https://doi.org/10.1007/978-3-031-01853-4_2

Download citation

DOI: https://doi.org/10.1007/978-3-031-01853-4_2
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-00725-5
Online ISBN: 978-3-031-01853-4
eBook Packages: Synthesis Collection of Technology (R0)eBColl Synthesis Collection 6

Publish with us

Policies and ethics