Using Provenance to Improve Workflow Design

  • Frederico T. de Oliveira
  • Leonardo Murta
  • Claudia Werner
  • Marta Mattoso
Part of the Lecture Notes in Computer Science book series (LNCS, volume 5272)


With the popularity of scientific workflow management systems (WfMS), workflow specifications are becoming available. Provenance support in WfMS can help reusing third party code. Browsing can be done through queries instead of ad-hoc search on the Web. Finding dependencies among programs or services through provenance queries, without tool support, is not a trivial task. Due to the huge number of program versions available and their configuration parameters, this task may be heavily error prone and counterproductive. In this work we propose a recommendation service that aims at suggesting frequent combinations of scientific programs for reuse. Our recommendation service is designed to work over WfMS that provide provenance on workflow specification and execution logs. We have based our service on software components reuse and data mining techniques, and implemented a prototype with Vistrails WfMS.


Recommendation System Collaborative Filter Software Reuse Recommendation Service Software Component Reuse 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


  1. 1.
    Scheidegger, C., Koop, D., Santos, E., Vo, H., Callahan, S., Freire, J., Silva, C.: Tackling the provenance challenge one layer at a time. Concurrency and Computation: Practice and Experience (2007)Google Scholar
  2. 2.
    De Roure, D., Goble, C.: myExperiment - A Web 2.0 Virtual Research Environment. In: International Workshop on Virtual Research Environments and Collaborative Work Environments, Edinburgh, UK (May 2007)Google Scholar
  3. 3.
    Sarwar, B.M., Karypis, G., Konstan, J.A., Riedl, J.: Item-Based Collaborative Filtering Recommendation Algorithms. In: Proc. of the 10th International World Wide Web Conference (WWW10), Hong Kong, pp. 285–295 (2001)Google Scholar
  4. 4.
    Krueger, C.W.: Software Reuse. ACM Computing Surveys 24(2), 131–183 (1992)CrossRefGoogle Scholar
  5. 5.
    Brown, A.W.: Large Scale Component Based Development. Prentice Hall PTR, Englewood Cliffs (2000)Google Scholar
  6. 6.
    D’Souza, D., Wills, A.: Objects, components, and frameworks with UML: The catalysis approach. Addison-Wesley, Reading (1998)Google Scholar
  7. 7.
    Szyperski, C.: Component Software: Beyond object-oriented programming. Addison-Wesley, Reading (2002)zbMATHGoogle Scholar
  8. 8.
    Cheesman, J., Daniels, J.: UML Components: A Simple Process for Specifying Component-Based Software. Addison-Wesley, Reading (2000)Google Scholar
  9. 9.
    Atkinson, C., et al.: Component-Based Product Line Engineering with UML. Addison-Wesley, Reading (2001)Google Scholar
  10. 10.
    Ludäscher, B.: Scientific workflow management and the Kepler system. Concurrency and Computation: Practice & Experience, 2006 18(10), 1039–1065 (2006)CrossRefGoogle Scholar
  11. 11.
    Zhuge, H.: Component-based workflow systems development. Decision Support Systems 35(4), 517–536 (2003)CrossRefGoogle Scholar
  12. 12.
    Hill, W.: Recommending and Evaluating Choices in a Virtual Community of Use. In: Conference on Human Factors in Computing Systems (1995)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2008

Authors and Affiliations

  • Frederico T. de Oliveira
    • 1
  • Leonardo Murta
    • 1
    • 2
  • Claudia Werner
    • 1
  • Marta Mattoso
    • 1
  1. 1.COPPE/ Computer Science DepartmentFederal University of Rio de Janeiro (UFRJ)Brazil
  2. 2.Instituto de ComputaçãoUniversidade Federal Fluminense (UFF)Brazil

Personalised recommendations