Towards Intelligent Management of Very Large Computing Systems

  • Eugen Volk
  • Jochen Buchholz
  • Stefan Wesner
  • Daniela Koudela
  • Matthias Schmidt
  • Niels Fallenbeck
  • Roland Schwarzkopf
  • Bernd Freisleben
  • Götz Isenmann
  • Jürgen Schwitalla
  • Marc Lohrer
  • Erich Focht
  • Andreas Jeutter
Conference paper

DOI: 10.1007/978-3-642-24025-6_16

Cite this paper as:
Volk E. et al. (2011) Towards Intelligent Management of Very Large Computing Systems. In: Bischof C., Hegering HG., Nagel W., Wittum G. (eds) Competence in High Performance Computing 2010. Springer, Berlin, Heidelberg

Abstract

The increasing complexity of current and future very large computing systems with a rapidly growing number of cores and nodes requires high human effort on administration and maintenance of these systems. Existing monitoring tools are neither scalable nor capable to reduce the overwhelming flow of information and provide only essential information of high value. Current management tools lack on scalability and capability to process a huge amount of information intelligently by relating several data and information from various sources together for making right decisions on error/fault handling. In order to solve these problems, we present a solution designed within the TIMaCS project, a hierarchical, scalable, policy based monitoring and management framework.

Copyright information

© Springer-Verlag Berlin Heidelberg 2011

Authors and Affiliations

  • Eugen Volk
    • 1
  • Jochen Buchholz
    • 1
  • Stefan Wesner
    • 1
  • Daniela Koudela
    • 2
  • Matthias Schmidt
    • 3
  • Niels Fallenbeck
    • 3
  • Roland Schwarzkopf
    • 3
  • Bernd Freisleben
    • 3
  • Götz Isenmann
    • 4
  • Jürgen Schwitalla
    • 4
  • Marc Lohrer
    • 4
  • Erich Focht
    • 5
  • Andreas Jeutter
    • 5
  1. 1.High Performance Computing Center StuttgartStuttgartGermany
  2. 2.Zentrum für Informationsdienste und Hochleistungsrechnen (ZIH)Technische Universität DresdenDresdenGermany
  3. 3.Department of Mathematics and Computer ScienceUniversity of MarburgMarburgGermany
  4. 4.science + computing agTübingenGermany
  5. 5.NEC High Performance Computing EuropeStuttgartGermany

Personalised recommendations