Automatic Memory Optimizations for Improving MPI Derived Datatype Performance

Byna, Surendra; Sun, Xian-He; Thakur, Rajeev; Gropp, William

doi:10.1007/11846802_36

Surendra Byna²⁰,
Xian-He Sun²⁰,
Rajeev Thakur²¹ &
…
William Gropp²¹

Part of the book series: Lecture Notes in Computer Science ((LNPSE,volume 4192))

Included in the following conference series:

European Parallel Virtual Machine / Message Passing Interface Users’ Group Meeting

1182 Accesses
12 Citations

Abstract

MPI derived datatypes allow users to describe noncontiguous memory layout and communicate noncontiguous data with a single communication function. This powerful feature enables an MPI implementation to optimize the transfer of noncontiguous data. In practice, however, many implementations of MPI derived datatypes perform poorly, which makes application developers avoid using this feature. In this paper, we present a technique to automatically select templates that are optimized for memory performance based on the access pattern of derived datatypes. We implement this mechanism in the MPICH2 source code. The performance of our implementation is compared to well-written manual packing/unpacking routines and original MPICH2 implementation. We show that performance for various derived datatypes is significantly improved and comparable to that of optimized manual routines.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Byna, S., Gropp, W., Sun, X.-H., Thakur, R.: Improving the Performance of MPI Derived Datatypes by Optimizing Memory-Access Cost. In: Proceedings of IEEE International Conference on Cluster Computing (December 2003)
Google Scholar
Byna, S., Sun, X.-H., Gropp, W., Thakur, R.: Predicting Memory-Access Cost Based on Data-Access Patterns. In: Proceedings of IEEE International Conference on Cluster Computing (September 2004)
Google Scholar
Carns, P.H., Ligon III, W.B., Ross, R.B., Thakur, R.: PVFS: A Parallel File System for Linux Clusters. In: Proceedings of the 4th Annual Linux Showcase and Conference, Atlanta, GA, pp. 317–327, USENIX Association (2000)
Google Scholar
Gropp, W., Lusk, E., Swider, D.: Improving the Performance of MPI Derived Datatypes. In: Proceedings of the Third MPI Developer’s and User’s Conference, pp. 25–30. MPI Software Technology Press (March 1999)
Google Scholar
Lam, M., Rothberg, E.E., Wolf, M.E.: The Cache Performance of Blocked Algorithms. In: Proceedings of the Fourth International Conference on Architectural Support for Programming Languages and Operating Systems, April 1991, pp. 63–74 (1991)
Google Scholar
Lu, Q., Wu, J., Panda, D., Sadayappan, P.: Applying MPI Derived Datatypes to the NAS Benchmarks: A Case Study. Technical Report OSU-CISRC-4/04-TR19, Ohio State University
Google Scholar
Message Passing Interface Forum, MPI: A Message-Passing Interface Standard, Version 1.1 (June 1995), http://www.mpi-forum.org/docs/docs.html
Mowry, T., Gupta, A.: Tolerating Latency Through Software-controlled Prefetching in Shared-memory Multiprocessors. Journal of Parallel and Distributed Computing 12(2) (June 1991)
Google Scholar
Ogawa, H., Matsuoka, S.: OMPI: Optimizing MPI Programs using Partial Evaluation. In: Proceedings of IEEE/ACM Supercomputing Conference, Pittsburgh (November 1996)
Google Scholar
Reussner, R., Träff, J.L., Hunzelmann, G.: A Benchmark for MPI Derived Datatypes. In: Dongarra, J., Kacsuk, P., Podhorszki, N. (eds.) PVM/MPI 2000. LNCS, vol. 1908, pp. 10–17. Springer, Heidelberg (2000)
Chapter Google Scholar
Ross, R., Miller, N., Gropp, W.: Implementing Fast and Reusable Datatype Processing. In: Dongarra, J., Laforenza, D., Orlando, S. (eds.) EuroPVM/MPI 2003. LNCS, vol. 2840, pp. 404–413. Springer, Heidelberg (2003)
Chapter Google Scholar
Träff, J.L., Hempel, R., Ritzdorf, H., Zimmermann, F.: Flattening on the fly: Efficient handling of MPI derived datatypes. In: Margalef, T., Dongarra, J., Luque, E. (eds.) PVM/MPI 1999. LNCS, vol. 1697, pp. 109–116. Springer, Heidelberg (1999)
Chapter Google Scholar
Wu, J., Wyckoff, P., Panda, D.: High Performance Implementation of MPI Derived Datatype Communication over InfiniBand. In: Proceedings of the 18th International Parallel and Distributed Processing Symposium (2004)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, Illinois Institute of Technology, Chicago, IL, USA
Surendra Byna & Xian-He Sun
Math. and Computer Science Division, Argonne National Laboratory, Argonne, IL, USA
Rajeev Thakur & William Gropp

Authors

Surendra Byna
View author publications
You can also search for this author in PubMed Google Scholar
Xian-He Sun
View author publications
You can also search for this author in PubMed Google Scholar
Rajeev Thakur
View author publications
You can also search for this author in PubMed Google Scholar
William Gropp
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Forschungszentrum Jülich, ZAM, 52425, Jülich, Germany
Bernd Mohr
NEC Europe Ltd., NEC Laboratories Europe, Rathausallee 10, D-53757, Sankt Augustin, Germany
Jesper Larsson Träff
Dolphin Interconnect Solutions ASA R&D Germany, Siebengebirgsblick 26, 53343, Wachtberg, Germany
Joachim Worringen
Computer Science Department, University of Tennessee, 37996-3450, Knoxville, TN, USA
Jack Dongarra

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Byna, S., Sun, XH., Thakur, R., Gropp, W. (2006). Automatic Memory Optimizations for Improving MPI Derived Datatype Performance. In: Mohr, B., Träff, J.L., Worringen, J., Dongarra, J. (eds) Recent Advances in Parallel Virtual Machine and Message Passing Interface. EuroPVM/MPI 2006. Lecture Notes in Computer Science, vol 4192. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11846802_36

Download citation

DOI: https://doi.org/10.1007/11846802_36
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-39110-4
Online ISBN: 978-3-540-39112-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics