A Scalable Process-Management Environment for Parallel Programs

  • Ralph Butler
  • William Gropp
  • Ewing Lusk
Conference paper

DOI: 10.1007/3-540-45255-9_25

Part of the Lecture Notes in Computer Science book series (LNCS, volume 1908)
Cite this paper as:
Butler R., Gropp W., Lusk E. (2000) A Scalable Process-Management Environment for Parallel Programs. In: Dongarra J., Kacsuk P., Podhorszki N. (eds) Recent Advances in Parallel Virtual Machine and Message Passing Interface. EuroPVM/MPI 2000. Lecture Notes in Computer Science, vol 1908. Springer, Berlin, Heidelberg

Abstract

We present a process management system for parallel programs such as those written using MPI. A primary goal of the system, which we call MPD (for multipurpose daemon), is to be scalable. By this we mean that startup of interactive parallel jobs comprising a thousand processes is quick, that signals can be quickly delivered to processes, and that stdin, stdout, and stderr are managed intuitively. Our primary target is parallel machines made up of clusters of SMPs, but the system is also useful in more tightly integrated environments. We describe how MPD enables much faster startup and better runtime management of MPICH jobs. We show how close control of stdio can support the easy implementation of a number of convenient system utilities, even a parallel debugger. MPD is implemented and freely distributed with MPICH.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Copyright information

© Springer-Verlag Berlin Heidelberg 2000

Authors and Affiliations

  • Ralph Butler
    • 1
  • William Gropp
    • 2
  • Ewing Lusk
    • 2
  1. 1.University of North FloridaUSA
  2. 2.Argonne National LaboratoryUSA

Personalised recommendations