Advertisement

Workshop on Resiliency in High Performance Computing (Resilience) in Clusters, Clouds, and Grids

  • Stephen L. Scott
  • Chokchai (Box) Leangsuksun
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 7156)

Abstract

Clusters, Clouds, and Grids are three different computational paradigms with the intent or potential to support High Performance Computing (HPC). Currently, they consist of hardware, management, and usage models particular to different computational regimes, e.g., high performance systems designed to support tightly coupled scientific simulation codes and commercial cloud systems designed to support software as a service (SAS). However, in order to support HPC, all must at least utilize large numbers of resources and hence effective HPC in any of these paradigms must address the issue of resiliency at large-scale.

Keywords

Fault Tolerance Usage Model High Performance Computing Computational Paradigm Soft Error 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Copyright information

© Springer-Verlag Berlin Heidelberg 2012

Authors and Affiliations

  • Stephen L. Scott
    • 1
    • 2
  • Chokchai (Box) Leangsuksun
    • 3
  1. 1.Tennessee Tech UniversityUSA
  2. 2.Oak Ridge National LaboratoryUSA
  3. 3.College of Engineering & ScienceLouisiana Tech UniversityRustonUSA

Personalised recommendations