Abstract
When running parallel programs on clusters of individual computers or workstations, network communication is often the performance bottleneck. Since the round-trip time for a network packet is orders of magnitude larger than the amount of time it takes for an equivalent amount of data to be transferred from memory, methods which reduce network usage can result in significant performance improvements for parallel programs.
This work demonstrates that broadcast performance can be improved by a significant factor using a portable reliable multicasting protocol compared to unicasting, which is typically used. Our end-product is an MPICH patch that does not require kernel modification. It is therefore portable to any UNIX-based system. MPICH is a popular, portable MPI implementation provided by Argonne National Laboratories (ANL). Since absolute reliability is critical for data integrity when broadcasting messages on clusters, our multicasting protocol also addresses reliability issues.
Supported by the Independent Research and Development Fund at the Advanced Research Laboratories at the University of Texas at Austin, USA
Most of this work was done while Dr. Elster was an adjunct faculty member at the Department of Electrical and Computer Engineering at the University of Texas at Austin, USA
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Message Passing Interface Forum, “MPI: A Message-Passing Interface Standard,” http://www.mpi-forum.org/docs/mpi-11-html/mpi-report.html.
W. Gropp, E. Lusk, N. Doss, A. Skjellum: A High-Performance, Portable Implementation of the (MPI) Message Passing Interface Standard. Parallel Computing, Vol. 22. (1996) 789–828
W. Gropp, E. Lusk: User’s Guide for mpich, a Portable Implementation of MPI. Mathematics and Computer Science Division, Argonne National Laboratory. (1996)
RFC 1112: Host Extensions for IP Multicasting. http://www.cis.ohio-state.edu/cgi-bin/rfc/rfc1112.html. (1989)
G. Fairhurst: Best Effort. http://www.erg.abdn.ac.uk/users/gorry/course/arq-pages/best-effort.html
J. Bruck, D. Dolev, C.T. Ho, M. Rosu, R. Strong: Efficient Message Passing Interface (MPI) for Parallel Computing on Clusters of Workstations. Journal of Parallel and Distributed Computing, Vol. 40. (1997)
K. Obraczka: Multicast Transport Protocols: A Survey and Taxonomy. IEEE Communications Magazine. (1998) 94–102
W. Richard Stevens: Unix Network Programming, Vol. 1. Prentice-Hall, Upper Saddle River (1998)
Byte order. Solaris Man Pages. Sun Microsystems. (1997)
L. Peterson, B. Davie: Computer Networks. Academic Press, San Diego. (2000)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Sack, P., Elster, A.C. (2002). Fast MPI Broadcasts through Reliable Multicasting. In: Fagerholm, J., Haataja, J., Järvinen, J., Lyly, M., Råback, P., Savolainen, V. (eds) Applied Parallel Computing. PARA 2002. Lecture Notes in Computer Science, vol 2367. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-48051-X_44
Download citation
DOI: https://doi.org/10.1007/3-540-48051-X_44
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-43786-4
Online ISBN: 978-3-540-48051-8
eBook Packages: Springer Book Archive