Journal of Grid Computing

, Volume 9, Issue 4, pp 455–478

Adaptive Executions of Multi-Physics Coupled Applications on Batch Grids

Authors

    • Supercomputer Education and Research CentreIndian Institute of Science
  • Sathish S Vadhiyar
    • Supercomputer Education and Research CentreIndian Institute of Science
  • Ravi S Nanjundiah
    • Centre for Atmospheric & Oceanic SciencesIndian Institute of Science
Article

DOI: 10.1007/s10723-011-9197-9

Cite this article as:
Murugavel, S.S., Vadhiyar, S.S. & Nanjundiah, R.S. J Grid Computing (2011) 9: 455. doi:10.1007/s10723-011-9197-9

Abstract

Long running multi-physics coupled parallel applications have gained prominence in recent years. The high computational requirements and long durations of simulations of these applications necessitate the use of multiple systems of a Grid for execution. In this paper, we have built an adaptive middleware framework for execution of long running multi-physics coupled applications across multiple batch systems of a Grid. Our framework, apart from coordinating the executions of the component jobs of an application on different batch systems, also automatically resubmits the jobs multiple times to the batch queues to continue and sustain long running executions. As the set of active batch systems available for execution changes, our framework performs migration and rescheduling of components using a robust rescheduling decision algorithm. We have used our framework for improving the application throughput of a foremost long running multi-component application for climate modeling, the Community Climate System Model (CCSM). Our real multi-site experiments with CCSM indicate that Grid executions can lead to improved application throughput for climate models.

Keywords

Adaptive framework Batch systems Climate models Multi-component applications Rescheduling

Copyright information

© Springer Science+Business Media B.V. 2011