Provenance-Based Validation of E-Science Experiments

  • Sylvia C. Wong
  • Simon Miles
  • Weijian Fang
  • Paul Groth
  • Luc Moreau
Part of the Lecture Notes in Computer Science book series (LNCS, volume 3729)


E-Science experiments typically involve many distributed services maintained by different organisations. After an experiment has been executed, it is useful for a scientist to verify that the execution was performed correctly or is compatible with some existing experimental criteria or standards. Scientists may also want to review and verify experiments performed by their colleagues. There are no exsiting frameworks for validating such experiments in today’s e-Science systems. Users therefore have to rely on error checking performed by the services, or adopt other ad hoc methods. This paper introduces a platform-independent framework for validating workflow executions. The validation relies on reasoning over the documented provenance of experiment results and semantic descriptions of services advertised in a registry. This validation process ensures experiments are performed correctly, and thus results generated are meaningful. The framework is tested in a bioinformatics application that performs protein compressibility analysis.


Policy Language Semantic Annotation Semantic Reasoning Interaction Validity Escape Analysis 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


  1. 1.
    Foster, I., Kesselman, C., Tuecke, S.: The anatomy of the grid: Enabling scalable virtual organizations. International Journal of High Performance Computing Applications 15, 200–222 (2001)CrossRefGoogle Scholar
  2. 2.
    Gil, Y., Deelman, E., Blythe, J., Kesselman, T.H.: Artificial intelligence and grids: workflow planning and beyond. IEEE Intelligent Systems 19, 26–33 (2004)CrossRefGoogle Scholar
  3. 3.
    Consortium, T.G.O.: The Gene Ontology (GO) database and informatics resource. Nucleic Acids Research 32, 258–261 (2004)CrossRefGoogle Scholar
  4. 4.
    Christensen, E., Curbera, F., Meredith, G., Weerawarana, S.: Web services description language (WSDL) 1.1. Technical report, W3C Note (2001),
  5. 5.
    Fallside, D.C., Walmsley, P.: XML schema part 0: Primer 2nd. edn. Technical report, W3C Recommendation (2004),
  6. 6.
    Mitra, N.: SOAP version 1.2 part 0: Primer. Technical report, W3C Recommendation (2004),
  7. 7.
    Martin, D., Burstein, M., Hobbs, J., Lassila, O., McDermott, D., McIlraith, S., Narayanan, S., Paolucci, M., Parsia, B., Payne, T., Sirin, E., Srinivasan, N., Sycara, K.: OWL-S: Semantic markup for web services. Technical report, W3C Member Submission (2004),
  8. 8.
    Wroe, C., Goble, C., Greenwood, M., Lord, P., Miles, S., Papay, J., Payne, T., Moreau, L.: Automating experiments using semantic data on a bioinformatics grid. IEEE Intelligent Systems 19, 48–55 (2004)Google Scholar
  9. 9.
    Miles, S., Papay, J., Luck, M., Moreau, L.: Towards a protocol for the attachment of metadata to grid service descriptions and its use in semantic discovery. Scientific Programming 12, 201–211 (2004)Google Scholar
  10. 10.
    Lee, M., Han, D., Shim, J.: Set-based access conflicts analysis of concurrent workflow definition. In: Proceedings of Third International Symposium on Cooperative Database Systems and Applications, Beijing, China, pp. 189–196 (2001)Google Scholar
  11. 11.
    Baresi, L., Maurino, A., Modafferi, S.: Workflow partitioning in mobile information systems. In: Proceedings of IFIP TC8 Working Conference on Mobile Information Systems (MOBIS 2004), Oslo, Norway, pp. 93–106. Springer, Heidelberg (2004)Google Scholar
  12. 12.
    Eshuis, R., Wieringa, R.: Verification support for workflow design with uml activity graphs. In: Proceedings of the 24th International Conference on Software Engineering, pp. 166–176 (2002)Google Scholar
  13. 13.
    Yang, L., Bundy, A., Berry, D., Huczynska, S.: Inferring quality of service properties for grid applications. In: CS poster, EPSRC e-Science Meeting, Edinburgh, UK, NeSC, static analysis of workflows (2004)Google Scholar
  14. 14.
    Blythe, J., Deelman, E., Gil, Y.: Planning for workflow construction and maintenance on the grid. In: ICAPS 2003 workshop on planning for web services (2003)Google Scholar
  15. 15.
    Uszok, A., Bradshaw, J.M., Jeffers, R.: KAOS: A policy and domain services framework for grid computing and semantic web services. In: Jensen, C., Poslad, S., Dimitrakos, T. (eds.) iTrust 2004. LNCS, vol. 2995, pp. 16–26. Springer, Heidelberg (2004)CrossRefGoogle Scholar
  16. 16.
    Miles, S., Groth, P., Branco, M., Moreau, L.: The requirements of recording and using provenance in e-science experiments. Technical report, Electronics and Computer Science, University of Southampton (2005)Google Scholar
  17. 17.
    Wroe, C., Stevens, R., Goble, C., Roberts, A., Greenwood, M.: A suite of DAML+OIL ontologies to describe bioinformatics web services and data. International Journal of Cooperative Information Systems 12, 197–224 (2003)CrossRefGoogle Scholar
  18. 18.
    Horrocks, I., Patel-Schneider, P.F., Boley, H., Tabet, S., Grosof, B., Dean, M.: SWRL: A semantic web rule language combining OWL and RULEML. Technical report, DARPA Agent Markup Language (DAML) Program (2003),
  19. 19.
    Schmidt-Schauss, M.: Subsumption in KL-ONE is undecidable. In: Brachman, R.J., Levesque, H.J., Reiter, R. (eds.) Proceedings of the 1st International Conference on the Principles of Knowledge Representation and Reasoning (KR 1989), pp. 421–431. Morgan Kaufmann, San Francisco (1989)Google Scholar
  20. 20.
    Moreau, L., Bradshaw, J., Breedy, M., Bunch, L., Johnson, M., Kulkarni, S., Lott, J., Suri, N., Uszok, A.: Behavioural specification of grid services with the KAOS policy language. In: Proceedings of Cluster Computing and Grid (CCGrid), Cardiff, UK (2005)Google Scholar
  21. 21.
    Groth, P., Miles, S., Fang, W., Wong, S.C., Zauner, K.P., Moreau, L.: Recording and using provenance in a protein compressibility experiment. In: Proceedings of the 14th IEEE International Symposium on High Performance Distributed Computing (HPDC-14), NC, USA (2005)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2005

Authors and Affiliations

  • Sylvia C. Wong
    • 1
  • Simon Miles
    • 1
  • Weijian Fang
    • 1
  • Paul Groth
    • 1
  • Luc Moreau
    • 1
  1. 1.School of Electronics and Computer ScienceUniversity of SouthamptonUK

Personalised recommendations