Provenance Management in BioSciences
Data provenance is becoming increasingly important for biosciences with the advent of large-scale collaborative environments such as the iPlant collaborative, where scientists collaborate by using data that they themselves did not generate. To facilitate the widespread use and sharing of provenance, ontologies of provenance need to be developed to enable the capture and standardized representation of provenance for biosciences. Working with researchers from the iPlant Tree of Life (iPToL) Grand Challenge Project, we developed a domain ontology of provenance for phylogenetic analysis. Relying on the conceptual graph formalism, we describe the process of developing the provenance ontology based on the W7 model, a generic ontology of data provenance. This domain ontology provides a structured model for harvesting, storing and querying provenance. We also illustrate how the harvested data provenance based on our ontology can be used for different purposes.
KeywordsProvenance tree of life W7 model conceptual graphs
Unable to display preview. Download preview PDF.
- 1.Simmhan, Y., Plale, B., Gannon, D.: A Survey of Data Provenance Techniques, Indiana University, Technical Report IUB-CS-TR618 (2005)Google Scholar
- 4.Ram, S., Liu, J.: A New Perspective on Semantics of Data Provenance. Presented at the First International Workshop on the Role of Semantic Web in Provenance Management, Washington D.C., (2009)Google Scholar