Chapter

Provenance and Annotation of Data and Processes

Volume 6378 of the series Lecture Notes in Computer Science pp 286-288

Integrating Provenance Data from Distributed Workflow Systems with ProvManager

  • Anderson MarinhoAffiliated withLancaster UniversityFederal University of Rio de Janeiro
  • , Leonardo MurtaAffiliated withLancaster UniversityFluminense Federal University
  • , Cláudia WernerAffiliated withLancaster UniversityFederal University of Rio de Janeiro
  • , Vanessa BraganholoAffiliated withLancaster UniversityFederal University of Rio de Janeiro
  • , Eduardo OgasawaraAffiliated withLancaster UniversityFederal University of Rio de Janeiro
  • , Sérgio Manuel Serra da CruzAffiliated withLancaster UniversityFederal University of Rio de Janeiro
  • , Marta MattosoAffiliated withLancaster UniversityFederal University of Rio de Janeiro

Abstract

Running scientific workflows in distributed environments is motivating the definition of provenance gathering approaches that are loosely coupled to the workflow execution engine. This kind of approach is interesting because it allows both storage and access to provenance data in an integrated way, even in an environment where different workflow systems work together. Therefore, we have proposed a provenance gathering strategy that is independent from the workflow system technology. This strategy has evolved into a provenance management system named ProvManager. In this paper we show how provenance data is captured along in a distributed execution environment with ProvManager and we show its web interface, in which scientists can register experiments, monitor workflow execution, and query provenance data.

Keywords

provenance scientific workflows distributed environment