The Virtual Resource Manager: Local Autonomy Versus QoS Guarantees for Grid Applications
In this paper, we describe the architecture of the virtual resource manager VRM, a management system designed to reside on top of local resource management systems for cluster computers and other kinds of resources. The most important feature of the VRM is its capability to handle quality-of-service (QoS) guarantees and service-level agreements (SLAs). The particular emphasis of the paper is on the various opportunities to deal with local autonomy for resource management systems not supporting SLAs. As local administrators may not want to hand over complete control to the Grid management, it is necessary to define strategies that deal with this issue. Local autonomy should be retained as much as possible while providing reliability and QoS guarantees for Grid applications, e.g., specified as SLAs.
Keywordsvirtual resouce management local autonomy quality of service Grid applications
Unable to display preview. Download preview PDF.
- Global Grid Forum. http://www.ggf.org/, visited 05.01.2005.Google Scholar
- R. Al-Ali, K. Amin, G. von Laszewki, O. F. Rana, D. W. Walker, M. Hategan, and N. Za-luzec. Analysis and Provision of QoS for Distributed Grid Applications. Journal of Grid Computing, 2004.Google Scholar
- L.-O. Burchard. Networks with Advance Reservations: Applications, Architecture, and Performance. Journal of Network and Systems Management, Kluwer Academic Publishers, 2005 (to appear).Google Scholar
- L.-O. Burchard, M. Hovestadt, O. Kao, A. Keller, and B. Linnert. The Virtual Resource Manager: An Architecture for SLA-aware Resource Management. In 4th Intl. IEEE/ACM Intl. Symposium on Cluster Computing and the Grid (CCGrid), Chicago, USA, pages 126–133, 2004.Google Scholar
- K. Czajkowski, I. Foster, C. Kesselman, V. Sander, and S. Tuecke. SNAP: A Protocol for Negotiating Service Level Agreements and Coordinating Resource Management in Distributed Systems. In 8th Intl. Workshop on Job Scheduling Strategies for Parallel Processing (JSSPP), Edinburgh, Scotland, UK, volume 2537 of Lecture Notes in Computer Science (LNCS), pages 153–183. Springer, January 2002.MATHGoogle Scholar
- D. Ferrari, A. Gupta, and G. Ventre. Distributed Advance Reservation of Real-Time Connections. In 5th Intl. Workshop on Network and Operating System Support for Digital Audio and Video (NOSSDAV), Durham, USA, volume 1018 of Lecture Notes in Computer Science (LNCS), pages 16–27. Springer, 1995.Google Scholar
- I. Foster, C. Kesselman, C. Lee, R. Lindell, K. Nahrstedt, and A. Roy. A Distributed Resource Management Architecture that Supports Advance Reservations and Co-Allocation. In 7th International Workshop on Quality of Service (IWQoS), London, UK, pages 27–36, 1999.Google Scholar
- The Globus Project. http://www.globus.org/, visited 05.01.2005.Google Scholar
- M. Hovestadt, O. Kao, A. Keller, and A. Streit. Scheduling in HPC Resource Management Systems: Queuing vs. Planning. In Job Scheduling Strategies for Parallel Processing: 9th International Workshop, JSSPP 2003 Seattle, WA, USA, June 24, 2003 Revised Papers, 2003.Google Scholar
- A. Keller and A. Reinefeld. Anatomy of a Resource Management System for HPC Clusters. In Annual Review of Scalable Computing, vol. 3, Singapore University Press, pages 1–31, 2001.Google Scholar
- OpenPBS. http://www.openpbs.org/, visited 05.01.2005.Google Scholar
- Q. Snell, M. Clement, D. Jackson, and C. Gregory. The Performance Impact of Advance Reservation Meta-scheduling. In 6th Workshop on Job Scheduling Strategies for Parallel Processing, Cancun, Mexiko, volume 1911 of Lecture Notes in Computer Science (LNCS), pages 137–153. Springer, 2000.Google Scholar