Advertisement

Incomplete Databases: Missing Records and Missing Values

  • Werner Nutt
  • Simon Razniewski
  • Gil Vegliach
Part of the Lecture Notes in Computer Science book series (LNCS, volume 7240)

Abstract

Data completeness is an essential aspect of data quality as in many scenarios it is crucial to guarantee the completeness of query answers. Data might be incomplete in two ways: records may be missing as a whole, or attribute values of a record may be absent, indicated by a null. We extend previous work by two of the authors [10] that dealt only with the first aspect, to cover both missing records and missing attribute values. To this end, we refine the formalization of incomplete databases and identify the important special case where values of key attributes are always known. We show that in the presence of nulls, completeness of queries can be defined in several ways.We also generalize a previous approach stating completeness of parts of a database, using so-called table completeness statements. With this formalization in place, we define the main inferences for completeness reasoning over incomplete databases and present first results.

Keywords

Completeness Statement Relation Symbol Conjunctive Query Query Answer Database Instance 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Abiteboul, S., Hull, R., Vianu, V.: Foundations of databases. Addison-Wesley (1995)Google Scholar
  2. 2.
    Codd, E.F.: Understanding relations (installment #7). FDT – Bulletin of ACM SIGMOD 7(3), 23–28 (1975)Google Scholar
  3. 3.
    Fan, W., Geerts, F.: Relative information completeness. In: PODS, pp. 97–106 (2009)Google Scholar
  4. 4.
    Fan, W., Geerts, F.: Capturing missing tuples and missing values. In: PODS, pp. 169–178 (2010)Google Scholar
  5. 5.
    Farré, C., Nutt, W., Teniente, E., Urpí, T.: Containment of Conjunctive Queries over Databases with Null Values. In: Schwentick, T., Suciu, D. (eds.) ICDT 2007. LNCS, vol. 4353, pp. 389–403. Springer, Heidelberg (2006)CrossRefGoogle Scholar
  6. 6.
    Imieliński, T., Lipski Jr., W.: Incomplete information in relational databases. J. ACM 31, 761–791 (1984)zbMATHCrossRefGoogle Scholar
  7. 7.
    Levy, A.: Obtaining complete answers from incomplete databases. In: Proc. VLDB, pp. 402–412 (1996)Google Scholar
  8. 8.
    Motro, A.: Integrity = Validity + Completeness. ACM TODS 14(4), 480–502 (1989)CrossRefGoogle Scholar
  9. 9.
    Razniewski, S., Nutt, W.: Checking query completeness over incomplete data. In: LID (2011)Google Scholar
  10. 10.
    Razniewski, S., Nutt, W.: Completeness of queries over incomplete databases. In: VLDB (2011)Google Scholar
  11. 11.
    Segoufin, L., Vianu, V.: Views and queries: Determinacy and rewriting. In: Proc. PODS, pp. 49–60 (2005)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2012

Authors and Affiliations

  • Werner Nutt
    • 1
  • Simon Razniewski
    • 1
  • Gil Vegliach
    • 1
  1. 1.Free University of Bozen-BolzanoBozenItaly

Personalised recommendations