Implementing MPI on the BlueGene/L Supercomputer

  • George Almási
  • Charles Archer
  • José G. Castaños
  • C. Chris Erway
  • Philip Heidelberger
  • Xavier Martorell
  • José E. Moreira
  • Kurt Pinnow
  • Joe Ratterman
  • Nils Smeds
  • Burkhard Steinmacher-burow
  • William Gropp
  • Brian Toonen
Part of the Lecture Notes in Computer Science book series (LNCS, volume 3149)

Abstract

The BlueGene/L supercomputer will consist of 65,536 dual-processor compute nodes interconnected by two high-speed networks: a three-dimensional torus network and a tree topology network. Each compute node can only address its own local memory, making message passing the natural programming model for BlueGene/L. In this paper we present our implementation of MPI for BlueGene/L. In particular, we discuss how we leveraged the architectural features of BlueGene/L to arrive at an efficient implementation of MPI in this machine. We validate our approach by comparing MPI performance against the hardware limits and also the relative performance of the different modes of operation of BlueGene/L. We show that dedicating one of the processors of a node to communication functions greatly improves the bandwidth achieved by MPI operation, whereas running two MPI tasks per compute node can have a positive impact on application performance.

Keywords

Tree Network Broadcast Algorithm Main Processor Packet Layer Torus Network 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    The MPICH and MPICH2 homepage, http://www-unix.mcs.anl.gov/mpi/mpich
  2. 2.
    Adiga, N.R., et al.: An overview of the BlueGene/L supercomputer. In: SC 2002 – High Performance Networking and Computing, Baltimore, MD (November 2002)Google Scholar
  3. 3.
    Almási, G., Archer, C., Nos, J.G.C., Gupta, M., Martorell, X., Moreira, J.E., Gropp, W., Rus, S., Toonen, B.: MPI on BlueGene/L: Designing an Efficient General Purpose Messaging Solution for a Large Cellular System, September 2003. LNCS. Springer, Heidelberg (2003)Google Scholar
  4. 4.
    Almási, G., Bellofatto, R., Brunheroto, J., Casçaval, C., Nos, J.G.C., Ceze, L., Crumley, P., Erway, C., Gagliano, J., Lieber, D., Martorell, X., Moreira, J.E., Sanomiya, A., Strauss, K.: An overview of the BlueGene/L system software organization. In: Kosch, H., Böszörményi, L., Hellwagner, H. (eds.) Euro-Par 2003. LNCS, vol. 2790, pp. 543–555. Springer, Heidelberg (2003)CrossRefGoogle Scholar
  5. 5.
    Almasi, G., et al.: Cellular supercomputing with system-on-a-chip. In: IEEE International Solid-state Circuits Conference ISSCC (2001)Google Scholar
  6. 6.
    Bailey, D., Harris, T., Saphir, W., van der Wijngaart, R., Woo, A., Yarrow, M.: The NAS Parallel Benchmarks 2.0. Technical Report NAS-95-020, NASA Ames Research Center (December 1995)Google Scholar
  7. 7.
    Chiola, G., Ciaccio, G.: Gamma: a low cost network of workstations based on active messages. In: Proc. Euromicro PDP 1997, London, UK, January 1997, IEEE Computer Society, Los Alamitos (1997)Google Scholar
  8. 8.
    Gropp, W., Lusk, E., Ashton, D., Ross, R., Thakur, R., Toonen, B.: MPICH Abstract Device Interface Version 3.4 Reference Manual: Draft of May 20 (2003), http://www-unix.mcs.anl.gov/mpi/mpich/adi3/adi3man.pdf
  9. 9.
    Pakin, S., Lauria, M., Chien, A.: High performance messaging on workstations: Illinois Fast Messages (FM) for Myrinet. In: Supercomputing 1995, San Diego, CA (December 1995)Google Scholar
  10. 10.
    Snir, M., Otto, S., Huss-Lederman, S., Walker, D., Dongarra, J.: MPI - The Complete Reference, 2nd edn. The MIT Press, Cambridge (2000)Google Scholar
  11. 11.
    Turner, D., Oline, A., Chen, X., Benjegerdes, T.: Integrating new capabilities into NetPIPE, September 2003. LNCS. Springer, Heidelberg (2003)Google Scholar
  12. 12.
    von Eicken, T., Basu, A., Buch, V., Vogels, W.: U-net: A user-level network interface for parallel and distributed computing. In: Proceedings of the 15th ACM Symposium on Operating Systems Principles, Copper Mountain, Colorado (December 1995)Google Scholar
  13. 13.
    von Eicken, T., Culler, D.E., Goldstein, S.C., Schauser, K.E.: Active Messages: a mechanism for integrated communication and computation. In: Proceedings of the 19th International Symposium on Computer Architecture (May 1992)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2004

Authors and Affiliations

  • George Almási
    • 1
  • Charles Archer
    • 2
  • José G. Castaños
    • 1
  • C. Chris Erway
    • 1
  • Philip Heidelberger
    • 1
  • Xavier Martorell
    • 1
  • José E. Moreira
    • 1
  • Kurt Pinnow
    • 2
  • Joe Ratterman
    • 2
  • Nils Smeds
    • 1
  • Burkhard Steinmacher-burow
    • 1
  • William Gropp
    • 3
  • Brian Toonen
    • 3
  1. 1.IBM Thomas J. Watson Research CenterYorktown HeightsUSA
  2. 2.IBM Systems GroupRochesterUSA
  3. 3.Mathematics and Computer Science DivisionArgonne National LaboratoryArgonneUSA

Personalised recommendations