Abstract
Data Provenance refers to the lineage of data including its origin, key events that occur over the course of its lifecycle, and other details associated with data creation, processing, and archiving. We believe that tracking provenance enables users to share, discover, and reuse the data, thus streamlining collaborative activities, reducing the possibility of repeating dead ends, and facilitating learning. It also provides a mechanism to transition from static to active conceptual modeling. The primary goal of our research is to investigate the semantics or meaning of data provenance. We describe the W7 model that represents different components of provenance and their relationships to each other. We conceptualize provenance as a combination of seven interconnected elements including “what”, “when”, “where”, “how”, “who”, “which” and “why”. Each of these components may be used to track events that affect data during its lifetime. A homeland security example illustrates how current conceptual models can be extended to embed provenance.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Pearson, D.: The Grid: Requirements for Establishing the Provenance of Derived Data. In: Workshop on Data Derivation and Provenance, Chicago, Illinois (2002)
Buneman, P., Khanna, S., Tan, W.C.: Data Provenance: Some Basic Issues. In: Kapoor, S., Prasad, S. (eds.) FST TCS 2000. LNCS, vol. 1974, Springer, Heidelberg (2000)
Frew, J., Bose, R.: Earth System Science Workbench: A Data Management Infrastructure for Earth Science Products. In: The 13th International Conference on Scientific and Statistical Database Management, Fairfax, VA (2001)
Buneman, P., Khanna, S., Tan, W.C.: Why and Where: A Characterization of Data Provenance. In: Van den Bussche, J., Vianu, V. (eds.) ICDT 2001. LNCS, vol. 1973, Springer, Heidelberg (2000)
Bunge, M.: Treatise on Basic Philosophy. In: Ontology I: The Furniture of the World, vol. 3, Reidel, Boston, MA (1977)
Bunge, M.: Treatise on Basic Philosophy. In: Ontology II: A World of Systems, vol. 4, Reidel, Boston, MA (1979)
Wand, Y., Weber, R.: On the deep structure of information systems. Information Systems Journal 5 (1995)
Davis, L.: Theory of action. Prentice-Hall, Englewood Cliffs, NJ (1979)
Chisholm, R.: The Agent as Cause. In: Brand, M., Walton, D. (eds.) Action Theory, D. Reidel, Dordrecht (1975)
Ram, S.: Intelligent Database Design Using the Unifying Semantic Model. Information and Management 29, 191–206 (1995)
Chen, P.P.: The entity-relationship model - toward a unified view of data. ACM Trans. Database Syst. 1, 9–36 (1976)
Snodgrass, R.T., Ahn, I.: Temporal Databases. Computer 19, 35–42 (1986)
Khatri, V., Ram, S., Snodgrass, R.: Augmenting a Conceptual Model with Geospatio-temporal Annotations. IEEE Trans. Knowledge and Data Eng. 16, 1324–1338 (2004)
Koubarakis, M., Plexousakis, D.: A formal framework for business process modeling and design. Information Systems 27, 299–319 (2002)
Curtis, B., Kellner, M., Over, J.: Process modeling. Communication of ACM 35, 75–90 (1992)
Georgeff, M., Pell, B., Pollack, M., Tambe, M., Wooldridge, M.: The Belief-Desire-Intention Model of Agency. In: The 5th International Workshop on Intelligent Agent: Agent Theories, Architectures, and Languages, Paris, France (1999)
Konolige, K., Pollack, M.E.: A Representationalist Theory of Intention. In: IJCAI 1993. The Thirteenth International Joint Conference on Artificial Intelligence (1993)
Allen, G., March, S.: Modeling Temporal Dynamics for Business Systems. Journal of Database Management 14, 21–36 (2003)
Chen, P.P., Thalheim, B., Wong, L.: Future direction of conceptual modeling. In: Chen, P.P., Akoka, J., Kangassalu, H., Thalheim, B. (eds.) Conceptual Modeling. LNCS, vol. 1565, pp. 294–308. Springer, Heidelberg (1999)
English, L.: Information Quality: Critical Ingredient for National Security. Journal of Database Management 16, 18–32 (2005)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Ram, S., Liu, J. (2007). Understanding the Semantics of Data Provenance to Support Active Conceptual Modeling. In: Chen, P.P., Wong, L.Y. (eds) Active Conceptual Modeling of Learning. ACM-L 2006. Lecture Notes in Computer Science, vol 4512. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-77503-4_3
Download citation
DOI: https://doi.org/10.1007/978-3-540-77503-4_3
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-77502-7
Online ISBN: 978-3-540-77503-4
eBook Packages: Computer ScienceComputer Science (R0)