MPI on BlueGene/L: Designing an Efficient General Purpose Messaging Solution for a Large Cellular System

  • George Almási
  • Charles Archer
  • José G. Castaños
  • Manish Gupta
  • Xavier Martorell
  • José E. Moreira
  • William D. Gropp
  • Silvius Rus
  • Brian Toonen
Part of the Lecture Notes in Computer Science book series (LNCS, volume 2840)

Abstract

The BlueGene/L computer uses system-on-a-chip integration and a highly scalable 65,536-node cellular architecture to deliver 360 Tflops of peak computing power. Efficient operation of the machine requires a fast, scalable, and standards compliant MPI library. In this paper, we discuss our efforts to port the MPICH2 library to BlueGene/L.

Keywords

Coherence Dinates Dispatch Percolate 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Adiga, N.R., et al.: An overview of the BlueGene/L supercomputer. In: SC 2002 – High Performance Networking and Computing, Baltimore, MD (November 2002)Google Scholar
  2. 2.
    Almasi, G., et al.: Cellular supercomputing with system-on-a-chip. In: IEEE International Solid-state Circuits Conference ISSCC (2001)Google Scholar
  3. 3.
    Bailey, D.H., Barszcz, E., Barton, J.T., Browning, D.S., Carter, R.L., Dagum, D., Fatoohi, R.A., Frederickson, P.O., Lasinski, T.A., Schreiber, R.S., Simon, H.D., Venkatakrishnan, V., Weeratunga, S.K.: The NAS Parallel Benchmarks. The International Journal of Supercomputer Applications 5(3), 63–73 (1991)CrossRefGoogle Scholar
  4. 4.
    Brightwell, R., Shuler, L.: Design and Implementation of MPI on Puma portals. In: In Proceedings of the Second MPI Developer’s Conference, July 1996, pp. 18–25 (1996)Google Scholar
  5. 5.
    Ceze, L., Strauss, K., Almási, G., Bohrer, P.J., Brunheroto, J.R., Casçaval, C., Castanos, J.G., Lieber, D., Martorell, X., Moreira, J.E., Sanomiya, A., Schenfeld, E.: Full circle: Simulating Linux clusters on Linux clusters. In: Proceedings of the Fourth LCI International Conference on Linux Clusters: The HPC Revolution 2003, San Jose, CA (June 2003)Google Scholar
  6. 6.
    Chiola, G., Ciaccio, G.: Gamma: a low cost network of workstations based on active messages. In: Proc. Euromicro PDP 1997, London, UK, IEEE Computer Society, Los Alamitos (1997)Google Scholar
  7. 7.
    DeRose, L.: The Hardware Performance Monitor Toolkit. In: Sakellariou, R., Keane, J.A., Gurd, J.R., Freeman, L. (eds.) Euro-Par 2001. LNCS, vol. 2150, pp. 122–131. Springer, Heidelberg (2001)CrossRefGoogle Scholar
  8. 8.
    Gropp, W., Lusk, E., Ashton, D., Ross, R., Thakur, R., Toonen, B.: MPICH Abstract Device Interface Version 3.4 Reference Manual: Draft of May 20 (2003), http://www-unix.mcs.anl.gov/mpi/mpich/adi3/adi3man.pdf
  9. 9.
    Mindlin, P., Brunheroto, J.R., DeRose, L., Moreira, J.E.: Obtaining hardware performance metrics for the BlueGene/L supercomputer. In: Kosch, H., Böszörményi, L., Hellwagner, H. (eds.) Euro-Par 2003. LNCS, vol. 2790, pp. 109–118. Springer, Heidelberg (2003)CrossRefGoogle Scholar
  10. 10.
    NAS Parallel Benchmarks, http://www.nas.nasa.gov/Software/NPB
  11. 11.
    Pakin, S., Lauria, M., Chien, A.: High performance messaging on workstations: Illinois Fast Messages (FM) for Myrinet. In: Supercomputing 1995, San Diego, CA (December 1995)Google Scholar
  12. 12.
    Shuler, L., Riesen, R., Jong, C., van Dresser, D., Maccabe, A.B., Fisk, L.A., Stallcup, T.M.: The PUMA operating system for massively parallel computers. In: Proceedings of the Intel Supercomputer Users’ Group. 1995 Annual North America Users’ Conference (June 1995)Google Scholar
  13. 13.
    von Eicken, T., Basu, A., Buch, V., Vogels, W.: U-net: A user-level network interface for parallel and distributed computing. In: Proceedings of the 15th ACM Symposium on Operating Systems Principles, Copper Mountain, Colorado (December 1995)Google Scholar
  14. 14.
    von Eicken, T., Culler, D.E., Goldstein, S.C., Schauser, K.E.: Active Messages: a mechanism for integrated communication and computation. In: Proceedings of the 19th International Symposium on Computer Architecture (May 1992)Google Scholar
  15. 15.
    The MPICH and MPICH2 homepage, http://www-unix.mcs.anl.gov/mpi/mpich

Copyright information

© Springer-Verlag Berlin Heidelberg 2003

Authors and Affiliations

  • George Almási
    • 1
  • Charles Archer
    • 2
  • José G. Castaños
    • 1
  • Manish Gupta
    • 1
  • Xavier Martorell
    • 1
  • José E. Moreira
    • 1
  • William D. Gropp
    • 3
  • Silvius Rus
    • 4
  • Brian Toonen
    • 3
  1. 1.IBM T. J. Watson Research CenterYorktown HeightsUSA
  2. 2.IBM Systems GroupRochesterUSA
  3. 3.Argonne National LaboratoryArgonneUSA
  4. 4.Texas A&M UniversityCollege StationUSA

Personalised recommendations