Modelling Provenance Using Structured Occurrence Networks

  • Paolo Missier
  • Brian Randell
  • Maciej Koutny
Part of the Lecture Notes in Computer Science book series (LNCS, volume 7525)

Abstract

Occurrence Nets (ON) are directed acyclic graphs that represent causality and concurrency information concerning a single execution of a system. Structured Occurrence Nets (SONs) extend ONs by adding new relationships, which provide a means of recording the activities of multiple interacting, and evolving, systems. Although the initial motivations for their development focused on the analysis of system failures, their structure makes them a natural candidate as a model for expressing the execution traces of interacting systems. These traces can then be exhibited as the provenance of the data produced by the systems under observation. In this paper we present a number of patterns that make use of SONs to provide principled modelling of provenance. We discuss some of the benefits of this modelling approach, and briefly compare it with others that have been proposed recently. SON-based modelling of provenance combines simplicity with expressiveness, leading to provenance graphs that capture multiple levels of abstraction in the description of a process execution, are easy to understand and can be analysed using the partial order techniques underpinning their behavioural semantics.

Keywords

Execution Trace Asynchronous System Temporal Abstraction Message Sequence Chart Strand Space 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

  1. [ABJF06]
    Altintas, I., Barney, O., Jaeger-Frank, E.: Provenance Collection Support in the Kepler Scientific Workflow System. In: Moreau, L., Foster, I. (eds.) IPAW 2006. LNCS, vol. 4145, pp. 118–132. Springer, Heidelberg (2006)CrossRefGoogle Scholar
  2. [BD87]
    Best, E., Devillers, R.: Sequential and concurrent behaviour in Petri net theory. Theoretical Computer Science 55(1), 87–136 (1987)MathSciNetMATHCrossRefGoogle Scholar
  3. [DGST09]
    Deelman, E., Gannon, D., Shields, M., Taylor, I.: Workflows and e-Science: An overview of workflow system features and capabilities. Future Generation Computer Systems 25(5), 528–540 (2009)CrossRefGoogle Scholar
  4. [GG11]
    Garijo, D., Gil, Y.: A New Approach for Publishing Workflows: Abstractions, Standards, and Linked Data. In: Proceedings of the Sixth Workshop on Workflows in Support of Large-Scale Science (WORKS 2011), held in conjunction with SC 2011, Seattle, Washington (2011)Google Scholar
  5. [HSBMR08]
    Holland, D.A., Seltzer, M.I., Braun, U., Muniswamy-Reddy, K.-K.: PASSing the provenance challenge. Concurrency and Computation: Practice and Experience 20, 531–540 (2008)CrossRefGoogle Scholar
  6. [HT04]
    Harel, D., Thiagarajan, P.: Message Sequence Charts. In: Lavagno, L., Martin, G., Selic, B. (eds.) UML for Real, pp. 77–105. Springer US (2004)Google Scholar
  7. [KK11]
    Kleijn, J., Koutny, M.: Causality in Structured Occurrence Nets. In: Jones, C.B., Lloyd, J.L. (eds.) Dependable and Historic Computing. LNCS, vol. 6875, pp. 283–297. Springer, Heidelberg (2011)CrossRefGoogle Scholar
  8. [KR09]
    Koutny, M., Randell, B.: Structured Occurrence Nets: A Formalism for Aiding System Failure Prevention and Analysis Techniques. Fundamenta Informaticae 97 (2009)Google Scholar
  9. [KSM+11]
    Koop, D., Santos, E., Mates, P., Vo, H.T., Bonnet, P., Bauer, B., Surer, B., Troyer, M., Williams, D.N., Tohline, J.E., Freire, J., Silva, C.T.: A Provenance-Based Infrastructure to Support the Life Cycle of Executable Papers. Procedia CS 4, 648–657 (2011)CrossRefGoogle Scholar
  10. [LLCF10]
    Lim, C., Lu, S., Chebotko, A., Fotouhi, F.: Prospective and Retrospective Provenance Collection in Scientific Workflow Environments. In: 2010 IEEE International Conference on Services Computing (SCC), pp. 449–456 (July 2010)Google Scholar
  11. [LP95]
    Lee, E.A., Parks, T.M.: Dataflow Process Networks. Memorandum 5, UC Berkeley EECS Dept. (1995)Google Scholar
  12. [MCF+11]
    Moreau, L., Clifford, B., Freire, J., Futrelle, J., Gil, Y., Groth, P., Kwasnikowska, N., Miles, S., Missier, P., Myers, J., Plale, B., Simmhan, Y., Stephan, E., Van Den Bussche, J.: The Open Provenance Model — Core Specification (v1.1). Future Generation Computer Systems 7(21), 743–756 (2011)CrossRefGoogle Scholar
  13. [MMW11]
    Marinho, A., Murta, L., Werner, C.: ProvManager: a provenance management system for scientific workflows. Concurrency and Computation: Practice and Experience, n/a–n/a (2011)Google Scholar
  14. [MPB10]
    Missier, P., Paton, N., Belhajjame, K.: Fine-grained and efficient lineage querying of collection-based workflow provenance. In: Procs. EDBT, Lausanne, Switzerland (2010)Google Scholar
  15. [MSZ+10]
    Missier, P., Sahoo, S.S., Zhao, J., Goble, C., Sheth, A.: Janus: From Workflows to Semantic Provenance and Linked Open Data. In: McGuinness, D.L., Michaelis, J.R., Moreau, L. (eds.) IPAW 2010. LNCS, vol. 6378, pp. 129–141. Springer, Heidelberg (2010)CrossRefGoogle Scholar
  16. [Ran11]
    Randell, B.: Occurrence Nets Then and Now: The Path to Structured Occurrence Nets. In: Kristensen, L.M., Petrucci, L. (eds.) PETRI NETS 2011. LNCS, vol. 6709, pp. 1–16. Springer, Heidelberg (2011)CrossRefGoogle Scholar
  17. [Sim08]
    Simmhan, Y.L., Plale, B., Gannon, D.: Karma2: Provenance management for data driven workflows. International Journal of Web Services Research 5(1) (2008)Google Scholar
  18. [SVKF08]
    Scheidegger, C.E., Vo, H.T., Koop, D., Freire, J.: Querying and Re-Using Workflows with VisTrails. In: Procs. SIGMOD, pp. 1251–1254 (2008)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2012

Authors and Affiliations

  • Paolo Missier
    • 1
  • Brian Randell
    • 1
  • Maciej Koutny
    • 1
  1. 1.School of Computing ScienceNewcastle UniversityNewcastle upon TyneUK

Personalised recommendations