Abstract
In the quest to build exascale supercomputers, designers are increasing the number of hierarchical levels that exist among system components. Software developed for these systems must account for the various hierarchies to achieve maximum efficiency. The first step in this work is to identify groups of processes that share common resources. We develop, analyze, and test several algorithms that can split millions of processes into groups based on arbitrary, user-defined data. We find that bitonic sort and our new hash-based algorithm best suit the task.
This work was performed under the auspices of the U.S. Department of Energy by Lawrence Livermore National Laboratory under contract DE-AC52-07NA27344. (LLNL-CONF-484653).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Argonne National Laboratory, MPICH2, http://www.mcs.anl.gov/mpi/mpich2
Network-Based Computing Laboratory, MVAPICH: MPI over Infiniband and iWARP, http://mvapich.cse.ohio-state.edu
Gabriel, E., Fagg, G.E., Bosilca, G., Angskun, T., Dongarra, J., Squyres, J.M., Sahay, V., Kambadur, P., Barrett, B.W., Lumsdaine, A., Castain, R.H., Daniel, D.J., Graham, R.L., Woodall, T.S.: Open MPI: Goals, concept, and design of a next generation MPI implementation. In: Kranzlmüller, D., Kacsuk, P., Dongarra, J. (eds.) EuroPVM/MPI 2004. LNCS, vol. 3241, pp. 97–104. Springer, Heidelberg (2004)
Kandalla, K., Subramoni, H., Vishnu, A., Panda, D.K.: Designing Topology-Aware Collective Communication Algorithms for Large Scale Infiniband Clusters: Case Studies with Scatter and Gather. In: The 10th Workshop on Communication Architechture for Clusters, CAC 2010 (2010)
Faraj, A., Kumar, S., Smith, B., Mamidala, A., Gunnels, J.: MPI Collective Communications on The Blue Gene/P Supercomputer: Algorithms and Optimizations. In: 17th IEEE Symposium on High Performance Interconnects, HOTI 2009, pp. 63–72 (August 2009)
Moody, A., Bronevetsky, G., Mohror, K., de Supinski, B.R.: Design, Modeling, and Evaluation of a Scalable Multi-level Checkpointing System. In: Proceedings of the 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2010, pp. 1–11. IEEE Computer Society, Washington, DC (2010)
Gomez, L.A.B., Maruyama, N., Cappello, F., Matsuoka, S.: Distributed Diskless Checkpoint for Large Scale Systems. In: CCGRID 2010, pp. 63–72 (2010)
Mellor-Crummey, J., Adhianto, L., Scherer III, W.N., Jin, G.: A New Vision for Coarray Fortran. In: Proceedings of the Third Conference on Partitioned Global Address Space Programing Models, PGAS 2009, pp. 5:1–5:9. ACM, New York (2009)
Sack, P., Gropp, W.: A Scalable MPI_Comm_split Algorithm for Exascale Computing. In: Keller, R., Gabriel, E., Resch, M., Dongarra, J. (eds.) EuroMPI 2010. LNCS, vol. 6305, pp. 1–10. Springer, Heidelberg (2010)
Cheng, D.R., Edelman, A., Gilbert, J.R., Shah, V.: A Novel Parallel Sorting Algorithm for Contemporary Architectures. In: Submitted to ALENEX 2006 (2006)
Batcher, K.E.: Sorting Networks and their Applications. In: AFIPS Spring Joint Computer Conference, vol. 32, pp. 307–314 (1968)
Jenkins, B.: Algorithm Alley: Hash Functions. Dr. Dobb’s Journal of Software Tools 22(9), 107–109, 115–116 (1997)
Jenkins, B.: Hash Functions for Hash Table Lookup (2006), http://burtleburtle.net/bob/hash/doobs.html
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Moody, A., Ahn, D.H., de Supinski, B.R. (2011). Exascale Algorithms for Generalized MPI_Comm_split. In: Cotronis, Y., Danalis, A., Nikolopoulos, D.S., Dongarra, J. (eds) Recent Advances in the Message Passing Interface. EuroMPI 2011. Lecture Notes in Computer Science, vol 6960. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-24449-0_4
Download citation
DOI: https://doi.org/10.1007/978-3-642-24449-0_4
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-24448-3
Online ISBN: 978-3-642-24449-0
eBook Packages: Computer ScienceComputer Science (R0)