Managing Provenance in iRODS

  • Andrea Weise
  • Adil Hasan
  • Mark Hedges
  • Jens Jensen
Part of the Lecture Notes in Computer Science book series (LNCS, volume 5545)

Abstract

Nowadays provenance is an important issue. Provenance data does not only give a history of events, it also provides enough information to allow the opportunity to verify the authenticity of the data, as well as, determine the quality of the data. The data grid management system, iRODS, comes with metadata which can be used as provenance data. Currently, iRODS’s metadata is not sufficient for tracking and reconstructing procedures applied to data. In this paper, we describe the provenance needs of iRODS and we survey briefly current provenance and provenance enabled workflow systems. We describe an architecture that can be used to manage provenance in iRODS (and other systems) in a fault-tolerant way.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
  2. 2.
    Rajasekar, A., Wan, M., Moore, R., Schroeder, W., Kremenek, G., Jagatheesan, A., Cowart, C., Zhu, B., Chen, S.Y., Olschanowsky, R.: Storage resource broker - managing distributed data in a grid. Technical report, San Diego Supercomputer Center (SDSC), University of CaliforniaGoogle Scholar
  3. 3.
  4. 4.
  5. 5.
    Simmhan, Y.L., Plale, B., Gannon, D.: A survey of data provenance techniques. Technical report (2005)Google Scholar
  6. 6.
    Freire, J., Koop, D., Santos, E., Silva, C.T.: Provenance for computational tasks: A survey. Computing in Science & Engineering 10(3), 11–21 (2008)CrossRefGoogle Scholar
  7. 7.
    Simmhan, Y.L., Plale, B., Gannon, D.: A framework for collecting provenance in data-centric scientific workflows. In: IEEE International Conference on Web Services, pp. 427–436 (2006)Google Scholar
  8. 8.
  9. 9.
    Barga, R.S., Digiampietri, L.A.: Automatic capture and efficient storage of e-science experiment provenance. Concurrency and Computation: Practice and Experience 20(5), 419–429 (2008)CrossRefGoogle Scholar
  10. 10.
    Foster, I., Vöckler, J.S., Wilde, M., Zhao, Y.: Chimera: A virtual data system for representing, querying, and automating data derivation. In: SSDBM 2002: Proceedings of the 14th International Conference on Scientific and Statistical Database Management, Washington, DC, USA, pp. 37–46. IEEE Computer Society, Los Alamitos (2002)CrossRefGoogle Scholar
  11. 11.
    R-gma: Relational grid monitoring architecture, http://www.r-gma.org/
  12. 12.
    Simmhan, Y.L., Plale, B., Gannon, D.: Karma2: Provenance management for data-driven workflows. Int. J. Web Service Res. 5(2), 1–22 (2008)Google Scholar
  13. 13.
    Eugster, P.T., Felber, P.A., Guerraoui, R., Kermarrec, A.M.: The many faces of publish/subscribe. ACM Computing Surveys 35, 114–131 (2003)CrossRefGoogle Scholar
  14. 14.
    Moreau, L., Ibbotson, J.: The EU Provenance Project: Enabling and Supporting Provenance in Grids for Complex Problems (Final Report). Technical report, The EU Provenance Consortium (2006)Google Scholar
  15. 15.
    Groth, P., Luck, M., Moreau, L.: Formalising a protocol for recording provenance in grids. In: The UK OST e-Science second All Hands Meeting 2004, AHM 2004 (2004)Google Scholar
  16. 16.
    Kataoka, M., Toumura, K., Okita, H., Yamamoto, J., Suzuki, T.: Distributed cache system for large-scale networks. In: International Multi-Conference on Computing in the Global Information Technology, 2006. ICCGI 2006, p. 40 (August 2006)Google Scholar
  17. 17.
    Sioutas, S., Sakkopoulos, E., Drossos, L., Sirmakessis, S.: Balanced distributed web service lookup system. J. Netw. Comput. Appl. 31(2), 149–162 (2008)CrossRefGoogle Scholar
  18. 18.
    The provenance architecture client side library, http://www.gridprovenance.org/software/CSLPage.html

Copyright information

© Springer-Verlag Berlin Heidelberg 2009

Authors and Affiliations

  • Andrea Weise
    • 1
  • Adil Hasan
    • 2
  • Mark Hedges
    • 3
  • Jens Jensen
    • 4
  1. 1.Centre for Advanced Computing and Emerging Technologies (ACET)University of ReadingUK
  2. 2.English DepartmentLiverpool UniversityUK
  3. 3.Centre for e-ResearchKing’s College LondonUK
  4. 4.Science and Technology Facilities Council,Rutherford Appleton LaboratoryUK

Personalised recommendations