Storage and Use of Provenance Information for Relational Database Queries

  • Zhifeng Bao
  • Henning Koehler
  • Xiaofang Zhou
  • Tok Wang Ling
Part of the Lecture Notes in Computer Science book series (LNCS, volume 6588)

Abstract

In database querying, provenance information can help users understand where data comes from and how it is derived. Storing the provenance data is critical in the sense that, the storage cost should be as small as possible and of fine granularity, and it should support the user query on provenance tracking efficiently as well. In this demo, we have implemented a relational database system prototype which can support SQL-like query while supporting provenance data recording during query execution. In particular, we propose a tree structure to store provenance information and further propose various reduction strategies to optimize its storage cost; we support the functionality of provenance data tracking at tuple level for user queries in a visualized way.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Chapman, A., Jagadish, H.V., Ramanan, P.: Efficient provenance storage. In: SIGMOD Conference, pp. 993–1006 (2008)Google Scholar
  2. 2.
    Cui, Y., Widom, J.: Practical lineage tracing in data warehouses. In: ICDE, pp. 367–378 (2000)Google Scholar
  3. 3.
    Koehler, H., Bao, Z., Zhou, X., Sadiq, S.: Provenance trees: Optimizing relational provenance storage. submitted to SIGMOD (2011), http://www.comp.nus.edu.sg/~baozhife/provenancetree/provenance_longversion.pdf

Copyright information

© Springer-Verlag Berlin Heidelberg 2011

Authors and Affiliations

  • Zhifeng Bao
    • 1
  • Henning Koehler
    • 2
  • Xiaofang Zhou
    • 2
  • Tok Wang Ling
    • 1
  1. 1.School of ComputingNational University of SingaporeSingapore
  2. 2.University of QueenslandAustralia

Personalised recommendations