Toward message passing for a million processes: characterizing MPI on a massive scale blue gene/P

Balaji, Pavan; Chan, Anthony; Thakur, Rajeev; Gropp, William; Lusk, Ewing

doi:10.1007/s00450-009-0095-3

Toward message passing for a million processes: characterizing MPI on a massive scale blue gene/P

Special Issue Paper
Published: 14 August 2009

Volume 24, pages 11–19, (2009)
Cite this article

Computer Science - Research and Development

Pavan Balaji¹,
Anthony Chan¹,
Rajeev Thakur¹,
William Gropp² &
…
Ewing Lusk¹

70 Accesses
11 Citations
Explore all metrics

Abstract

Upcoming exascale capable systems are expected to comprise more than a million processing elements. As researchers continue to work toward architecting these systems, it is becoming increasingly clear that these systems will utilize a significant amount of shared hardware between processing units; this includes shared caches, memory and network components. Thus, understanding how effective current message passing and communication infrastructure is in tying these processing elements together, is critical to making educated guesses on what we can expect from such future machines. Thus, in this paper, we characterize the communication performance of the message passing interface (MPI) implementation on 32 racks (131072 cores) of the largest Blue Gene/P (BG/P) system in the United States (80% of the total system size) and reveal various interesting insights into it.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Containerization technologies: taxonomies, applications and challenges

Article 08 June 2021

Ouafa Bentaleb, Adam S. Z. Belloum, … Aouaouche El-Maouhab

Balancing Tracking Granularity and Parallelism in Many-Task Systems: The Horizons Approach

Article Open access 06 April 2024

Peter Thoman & Philip Salzmann

Performance improvement of the triangular matrix product in commodity clusters

Article Open access 15 April 2024

Inmaculada Santamaria-Valenzuela, Rocío Carratalá-Sáez, … Arturo Gonzalez-Escribano

References

InfiniBand Trade Association http://www.infinibandta.com
Naval Research Laboratory Layered Ocean Model (NLOM) http://www.navo.hpc.mil/Navigator/Fall99_Feature.html
Alam S, Barrett B, Bast M, Fahey MR, Kuehn J, McCurdy C, Rogers J, Roth P, Sankaran R, Vetter J, Worley P, Yu W (2008) Early Evaluation of IBM BlueGene/P. In: SC
Balaji P, Chan A, Thakur R, Gropp W, Lusk E (2008) Non-Data-Communication Overheads in MPI: Analysis on Blue Gene/P. In: Euro PVM/MPI Users’ Group Meeting, Dublin, Ireland
Overview of the IBM Blue Gene/P project http://www.research.ibm.com/journal/rd/521/team.pdf
IBM System Blue Gene Solution: Blue Gene/P Application Development http://www.redbooks.ibm.com/redbooks/pdfs/sg247287.pdf
Chan A, Balaji P, Thakur R, Gropp W, Lusk E (2008) Communication Analysis of Parallel 3D FFT for Flat Cartesian Meshes on Large Blue Gene Systems. In: HiPC, Bangalore, India
Liu J, Chandrasekaran B, Wu J, Jiang W, Kini S, Yu W, Buntinas D, Wyckoff P, Panda DK (2003) Performance Comparison of MPI Implementations over InfiniBand Myrinet and Quadrics. In: Supercomputing 2003: The International Conference for High Performance Computing and Communications, November 2003
Liu J, Jiang W, Wyckoff P, Panda DK, Ashton D, Buntinas D, Gropp W, Toonen B (2004) Design and Implementation of MPICH2 over InfiniBand with RDMA Support. In: Proceedings of Int’l Parallel and Distributed Processing Symposium (IPDPS ’04), April 2004
Liu J, Wu J, Kini S, Noronha R, Wyckoff P, Panda DK (2002) MPI Over InfiniBand: Early Experiences. In: IPDPS
Petrini F, Feng W-C, Hoisie A, Coll S, Frachtenberg E (2002) The Quadrics Network: High Performance Clustering Technology. IEEE Micro 22(1):46–57
Article Google Scholar

Download references

Author information

Authors and Affiliations

Mathematics and Computer Science, Argonne National Laboratory, Argonne, USA
Pavan Balaji, Anthony Chan, Rajeev Thakur & Ewing Lusk
University of Illinois, Urbana-Champaign, USA
William Gropp

Authors

Pavan Balaji
View author publications
You can also search for this author in PubMed Google Scholar
Anthony Chan
View author publications
You can also search for this author in PubMed Google Scholar
Rajeev Thakur
View author publications
You can also search for this author in PubMed Google Scholar
William Gropp
View author publications
You can also search for this author in PubMed Google Scholar
Ewing Lusk
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Pavan Balaji.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Balaji, P., Chan, A., Thakur, R. et al. Toward message passing for a million processes: characterizing MPI on a massive scale blue gene/P . Comp. Sci. Res. Dev. 24, 11–19 (2009). https://doi.org/10.1007/s00450-009-0095-3

Download citation

Published: 14 August 2009
Issue Date: September 2009
DOI: https://doi.org/10.1007/s00450-009-0095-3

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Toward message passing for a million processes: characterizing MPI on a massive scale blue gene/P

Abstract

Access this article

Similar content being viewed by others

Containerization technologies: taxonomies, applications and challenges

Balancing Tracking Granularity and Parallelism in Many-Task Systems: The Horizons Approach

Performance improvement of the triangular matrix product in commodity clusters

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Toward message passing for a million processes: characterizing MPI on a massive scale blue gene/P

Abstract

Access this article

Similar content being viewed by others

Containerization technologies: taxonomies, applications and challenges

Balancing Tracking Granularity and Parallelism in Many-Task Systems: The Horizons Approach

Performance improvement of the triangular matrix product in commodity clusters

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation