Noncollective Communicator Creation in MPI
MPI communicators abstract communication operations across application modules, facilitating seamless composition of different libraries. In addition, communicators provide the ability to form groups of processes and establish multiple levels of parallelism. Traditionally, communicators have been collectively created in the context of the parent communicator. The recent thrust toward systems at petascale and beyond has brought forth new application use cases, including fault tolerance and load balancing, that highlight the ability to construct an MPI communicator in the context of its new process group as a key capability. However, it has long been believed that MPI is not capable of allowing the user to form a new communicator in this way. We present a new algorithm that allows the user to create such flexible process groups using only the functionality given in the current MPI standard. We explore performance implications of this technique and demonstrate its utility for load balancing in the context of a Markov chain Monte Carlo computation. In comparison with a traditional collective approach, noncollective communicator creation enables a 30% improvement in execution time through asynchronous load balancing.
KeywordsExecution Time Markov Chain Monte Carlo Load Balance Fault Tolerance Parent Communicator
Unable to display preview. Download preview PDF.
- 1.MPICH2 Project Website (June 2011), http://www.mcs.anl.gov/research/projects/mpich2/
- 2.Dickson, A., Maienschein-Cline, M., Tovo-Dwyer, A., Hammond, J.R., Dinner, A.R.: Flow-dependent unfolding and refolding of an RNA by nonequilibrium umbrella sampling. ArXiv e-prints (1104.5180), cond–mat.stat–mech (April 2011)Google Scholar
- 6.MPI Forum: MPI: A Message-Passing Interface Standard. Version 2.2 (September 4, 2009)Google Scholar
- 8.Nieplocha, J., Krishamoorthy, S., Valiev, M., Krishnan, M., Palmer, B., Sadayappan, P.: Integrated data and task management for scientific applications. In: Bubak, M., van Albada, G.D., Dongarra, J., Sloot, P.M.A. (eds.) ICCS 2008, Part I. LNCS, vol. 5101, pp. 20–31. Springer, Heidelberg (2008)CrossRefGoogle Scholar
- 11.Windus, T.L., Kathmann, S.M., Crosby, L.D.: High performance computations using dynamical nucleation theory. Journal of Physics: Conference Series 125(1), 12017 (2008)Google Scholar