Cluster Computing

, Volume 18, Issue 4, pp 1317–1329

Ensemble learning of runtime prediction models for gene-expression analysis workflows

  • David A. Monge
  • Matěj Holec
  • Filip Železný
  • Carlos García Garino
Article

DOI: 10.1007/s10586-015-0481-5

Cite this article as:
Monge, D.A., Holec, M., Železný, F. et al. Cluster Comput (2015) 18: 1317. doi:10.1007/s10586-015-0481-5

Abstract

The adequate management of scientific workflow applications strongly depends on the availability of accurate performance models of sub-tasks. Numerous approaches use machine learning to generate such models autonomously, thus alleviating the human effort associated to this process. However, these standalone models may lack robustness, leading to a decay on the quality of information provided to workflow systems on top. This paper presents a novel approach for learning ensemble prediction models of tasks runtime. The ensemble-learning method entitled bootstrap aggregating (bagging) is used to produce robust ensembles of M5P regression trees of better predictive performance than could be achieved by standalone models. Our approach has been tested on gene expression analysis workflows. The results show that the ensemble method leads to significant prediction-error reductions when compared with learned standalone models. This is the first initiative using ensemble learning for generating performance prediction models. These promising results encourage further research in this direction.

Keywords

Performance prediction Ensemble learning Data-intensive workflows Gene expressions analysis experiments 

Copyright information

© Springer Science+Business Media New York 2015

Authors and Affiliations

  • David A. Monge
    • 1
  • Matěj Holec
    • 2
  • Filip Železný
    • 2
  • Carlos García Garino
    • 3
  1. 1.ITIC Research Institute & Faculty of Exact and Natural SciencesNational University of Cuyo (UNCuyo)MendozaArgentina
  2. 2.IDA Research GroupCzech Technical University in PraguePragueCzech Republic
  3. 3.ITIC Research Institute & Faculty of EngineeringNational University of Cuyo (UNCuyo)MendozaArgentina