Advertisement

The Virtual Resource Manager: Local Autonomy Versus QoS Guarantees for Grid Applications

  • Lars-Olof Burchard
  • Hans-Ulrich Heiss
  • Barry Linnert
  • Jörg Schneider
  • Felix Heine
  • Matthias Hovestadt
  • Odej Kao
  • Axel Keller

Abstract

In this paper, we describe the architecture of the virtual resource manager VRM, a management system designed to reside on top of local resource management systems for cluster computers and other kinds of resources. The most important feature of the VRM is its capability to handle quality-of-service (QoS) guarantees and service-level agreements (SLAs). The particular emphasis of the paper is on the various opportunities to deal with local autonomy for resource management systems not supporting SLAs. As local administrators may not want to hand over complete control to the Grid management, it is necessary to define strategies that deal with this issue. Local autonomy should be retained as much as possible while providing reliability and QoS guarantees for Grid applications, e.g., specified as SLAs.

Keywords

virtual resouce management local autonomy quality of service Grid applications 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. [1]
    Global Grid Forum. http://www.ggf.org/, visited 05.01.2005.Google Scholar
  2. [2]
    R. Al-Ali, K. Amin, G. von Laszewki, O. F. Rana, D. W. Walker, M. Hategan, and N. Za-luzec. Analysis and Provision of QoS for Distributed Grid Applications. Journal of Grid Computing, 2004.Google Scholar
  3. [3]
    L.-O. Burchard. Networks with Advance Reservations: Applications, Architecture, and Performance. Journal of Network and Systems Management, Kluwer Academic Publishers, 2005 (to appear).Google Scholar
  4. [4]
    L.-O. Burchard, M. Hovestadt, O. Kao, A. Keller, and B. Linnert. The Virtual Resource Manager: An Architecture for SLA-aware Resource Management. In 4th Intl. IEEE/ACM Intl. Symposium on Cluster Computing and the Grid (CCGrid), Chicago, USA, pages 126–133, 2004.Google Scholar
  5. [5]
    K. Czajkowski, I. Foster, C. Kesselman, V. Sander, and S. Tuecke. SNAP: A Protocol for Negotiating Service Level Agreements and Coordinating Resource Management in Distributed Systems. In 8th Intl. Workshop on Job Scheduling Strategies for Parallel Processing (JSSPP), Edinburgh, Scotland, UK, volume 2537 of Lecture Notes in Computer Science (LNCS), pages 153–183. Springer, January 2002.MATHGoogle Scholar
  6. [6]
    T. DeFanti, C. de Laat, J. Mambretti, K. Neggers, and B. S. Arnaud. TransLight: A Global-Scale LambdaGrid for E-Science. Communications of the ACM, 46(11):34–41, November 2003.CrossRefGoogle Scholar
  7. [7]
    D. Ferrari, A. Gupta, and G. Ventre. Distributed Advance Reservation of Real-Time Connections. In 5th Intl. Workshop on Network and Operating System Support for Digital Audio and Video (NOSSDAV), Durham, USA, volume 1018 of Lecture Notes in Computer Science (LNCS), pages 16–27. Springer, 1995.Google Scholar
  8. [8]
    I. Foster, C. Kesselman, C. Lee, R. Lindell, K. Nahrstedt, and A. Roy. A Distributed Resource Management Architecture that Supports Advance Reservations and Co-Allocation. In 7th International Workshop on Quality of Service (IWQoS), London, UK, pages 27–36, 1999.Google Scholar
  9. [9]
    The Globus Project. http://www.globus.org/, visited 05.01.2005.Google Scholar
  10. [10]
    M. Hovestadt, O. Kao, A. Keller, and A. Streit. Scheduling in HPC Resource Management Systems: Queuing vs. Planning. In Job Scheduling Strategies for Parallel Processing: 9th International Workshop, JSSPP 2003 Seattle, WA, USA, June 24, 2003 Revised Papers, 2003.Google Scholar
  11. [11]
    A. Keller and A. Reinefeld. Anatomy of a Resource Management System for HPC Clusters. In Annual Review of Scalable Computing, vol. 3, Singapore University Press, pages 1–31, 2001.Google Scholar
  12. [12]
    OpenPBS. http://www.openpbs.org/, visited 05.01.2005.Google Scholar
  13. [13]
    Q. Snell, M. Clement, D. Jackson, and C. Gregory. The Performance Impact of Advance Reservation Meta-scheduling. In 6th Workshop on Job Scheduling Strategies for Parallel Processing, Cancun, Mexiko, volume 1911 of Lecture Notes in Computer Science (LNCS), pages 137–153. Springer, 2000.Google Scholar

Copyright information

© Springer Science+Business Media, Inc. 2006

Authors and Affiliations

  • Lars-Olof Burchard
    • 1
  • Hans-Ulrich Heiss
    • 1
  • Barry Linnert
    • 1
  • Jörg Schneider
    • 1
  • Felix Heine
    • 2
  • Matthias Hovestadt
    • 2
  • Odej Kao
    • 2
  • Axel Keller
    • 2
  1. 1.Communications and Operating System GroupTechnische Universität BerlinGermany
  2. 2.Paderborn Center for Parallel Computing (PC2)Universität PaderbornGermany

Personalised recommendations