Empirical Optimization of Collective Communications with ADCL

Benkert, Katharina; Gabriel, Edgar

doi:10.1007/978-3-642-11851-7_3

Katharina Benkert⁸ &
Edgar Gabriel⁹

561 Accesses
1 Citations

Abstract

The Abstract Data and Communication Library (ADCL) allows for auto-tuning of communication operations for parallel applications. This paper presents a new set of interfaces introduced in ADCL in order to support most MPI collective communication operations, and thus enable the optimization of one of the most widely used features of the MPI specification. The paper discusses semantic as well as implementation aspects, and evaluates the new interfaces using the NPB FT benchmark on a large selection of platforms and MPI libraries.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 109.00; Price excludes VAT (USA)

Softcover Book: USD 139.99; Price excludes VAT (USA)

Hardcover Book: USD 139.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Bailey, D., Barszcz, E., Barton, J., Browning, D., Carter, R., Dagum, L., Fatoohi, R., Fineberg, S., Frederickson, P., Lasinski, T., Schreiber, R., Simon, H., Venkatakrishnan, V., Weeratunga, S.: The NAS Parallel Benchmarks (1994)
Google Scholar
Benkert, K., Gabriel, E., Resch, M.M.: Outlier Detection in Performance Data of Parallel Applications. In: 9th IEEE International Workshop on Parallel and Distributed Scientific and Engineering Computing (2008)
Google Scholar
Bruck, J., Ho, C.T., Kipnis, S., Weathersby, D.: Efficient algorithms for all-to-all communications in multi-port message-passing systems. In: SPAA ’94: Proceedings of the sixth annual ACM symposium on Parallel algorithms and architectures, pp. 298–309. ACM, New York, NY, USA (1994). DOI http://doi.acm.org/10.1145/181014.181756
Chapter Google Scholar
Chen, J., Zhang, Y., Zhang, L., Yuan, W.: Performance evaluation of allgather algorithms on terascale linux cluster with fast ethernet. International Conference on High Performance Computing and Grid in Asia Pacific Region, 437–442 (2005). DOI http://doi.ieeecomputersociety.org/10.1109/HPCASIA.2005.75
Gabriel, E., Feki, S., Benkert, K., Resch, M.M.: Towards Performance Portability through Runtime Adaption for High Performance Computing Applications. Concurrency and Computation—Practice and Experience, accepted for publication (2010)
Google Scholar
Gabriel, E., Huang, S.: Runtime optimization of application level communication patterns. In: Proceedings of the 2007 International Parallel and Distributed Processing Symposium, 12th International Workshop on High-Level Parallel Programming Models and Supportive Environments, p. 185 (2007)
Google Scholar
Gropp, W., Lusk, E., Doss, N., Skjellum, A.: A high-performance, portable implementation of the MPI message passing interface standard. Parallel Computing 22(6), 789–828 (1996)
Article MATH Google Scholar
Jones, T.: Survey of MPI Call Usage. In: IBM System Scientific User Group (ScicomP) 10 (2004)
Google Scholar
Rabenseifner, R.: Automatic MPI Counter Profiling. In: 42nd CUG Conference. Noorwijk, The Netherlands. (2000). URL http://scholar.google.com/scholar?hl=en&btnG=Search&q=intitle:Automatic+MPI+Counter+Profiling#0

Download references

Author information

Authors and Affiliations

High Performance Computing Center Stuttgart (HLRS), University of Stuttgart, 70550, Stuttgart, Germany
Katharina Benkert
Parallel Software Technologies Laboratory, Department of Computer Science, University of Houston, Houston, TX, USA
Edgar Gabriel

Authors

Katharina Benkert
View author publications
You can also search for this author in PubMed Google Scholar
Edgar Gabriel
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Katharina Benkert .

Editor information

Editors and Affiliations

Stuttgart (HLRS), Universität Stuttgart, Höchstleistungsrechenzentrum, Nobelstraße 19, Stuttgart, 70569, Germany
Michael Resch
Stuttgart (HLRS), Universität Stuttgart, Höchstleistungsrechenzentrum, Nobelstraße 19, Stuttgart, 70569, Germany
Katharina Benkert
Stuttgart (HLRS), Universität Stuttgart, Höchstleistungsrechenzentrum, Nobelstraße 19, Stuttgart, 70569, Germany
Xin Wang
Europe GmbH, NEC High Performance Computing, Prinzenallee 11, Düsseldorf, 40459, Germany
Martin Galle
Europe GmbH, NEC High Performance Computing, Prinzenallee 11, Düsseldorf, 40459, Germany
Wolfgang Bez
Cyberscience Center, Tohoku University, Aramaki-Aza-Aoba 4F, Sendai, 980-8578, Japan
Hiroaki Kobayashi
Simulation Sciences, German Research School for, Schinkelstrasse 2a, Aachen, 52062, Germany
Sabine Roller

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Benkert, K., Gabriel, E. (2010). Empirical Optimization of Collective Communications with ADCL. In: Resch, M., et al. High Performance Computing on Vector Systems 2010. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-11851-7_3

Download citation

DOI: https://doi.org/10.1007/978-3-642-11851-7_3
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-11850-0
Online ISBN: 978-3-642-11851-7
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)

Publish with us

Policies and ethics