Storage and Use of Provenance Information for Relational Database Queries
In database querying, provenance information can help users understand where data comes from and how it is derived. Storing the provenance data is critical in the sense that, the storage cost should be as small as possible and of fine granularity, and it should support the user query on provenance tracking efficiently as well. In this demo, we have implemented a relational database system prototype which can support SQL-like query while supporting provenance data recording during query execution. In particular, we propose a tree structure to store provenance information and further propose various reduction strategies to optimize its storage cost; we support the functionality of provenance data tracking at tuple level for user queries in a visualized way.
Unable to display preview. Download preview PDF.
- 1.Chapman, A., Jagadish, H.V., Ramanan, P.: Efficient provenance storage. In: SIGMOD Conference, pp. 993–1006 (2008)Google Scholar
- 2.Cui, Y., Widom, J.: Practical lineage tracing in data warehouses. In: ICDE, pp. 367–378 (2000)Google Scholar
- 3.Koehler, H., Bao, Z., Zhou, X., Sadiq, S.: Provenance trees: Optimizing relational provenance storage. submitted to SIGMOD (2011), http://www.comp.nus.edu.sg/~baozhife/provenancetree/provenance_longversion.pdf