Integrating Provenance Data from Distributed Workflow Systems with ProvManager
Running scientific workflows in distributed environments is motivating the definition of provenance gathering approaches that are loosely coupled to the workflow execution engine. This kind of approach is interesting because it allows both storage and access to provenance data in an integrated way, even in an environment where different workflow systems work together. Therefore, we have proposed a provenance gathering strategy that is independent from the workflow system technology. This strategy has evolved into a provenance management system named ProvManager. In this paper we show how provenance data is captured along in a distributed execution environment with ProvManager and we show its web interface, in which scientists can register experiments, monitor workflow execution, and query provenance data.
Keywordsprovenance scientific workflows distributed environment
- 3.Marinho, A., Murta, L., et al.: A Strategy for Provenance Gathering in Distributed Scientific Workflows. In: IEEE International Workshop on Scientific Workflows, Los Angeles, California, United States (2009)Google Scholar
- 4.Simmhan, Y., Plale, B., Gannon, D.: A Framework for Collecting Provenance in Data-Centric Scientific Workflows. In: ICWS, pp. 427–436 (2006)Google Scholar
- 5.Groth, P., Jiang, S., et al.: An Architecture for Provenance Systems (2006), http://eprints.ecs.soton.ac.uk/13216/ (Visited in: July 19, 2010)