Historic Learning Approach for Auto-tuning OpenACC Accelerated Scientific Applications

Siddiqui, Shahzeb; AlZayer, Fatemah; Feki, Saber

doi:10.1007/978-3-319-17353-5_19

Shahzeb Siddiqui¹⁶,
Fatemah AlZayer¹⁶ &
Saber Feki¹⁷

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 8969))

Included in the following conference series:

International Conference on High Performance Computing for Computational Science

766 Accesses
1 Citations

Abstract

The performance optimization of scientific applications usually requires an in-depth knowledge of the hardware and software. A performance tuning mechanism is suggested to automatically tune OpenACC parameters to adapt to the execution environment on a given system. A historic learning based methodology is suggested to prune the parameter search space for a more efficient auto-tuning process. This approach is applied to tune the OpenACC gang and vector clauses for a better mapping of the compute kernels onto the underlying architecture. Our experiments show a significant performance improvement against the default compiler parameters and drastic reduction in tuning time compared to a brute force search-based approach.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

OpenACC Standard specification. www.openacc-standard.org
OpenMP 4.0 specification. www.openmp.org/mp-documents/OpenMP4.0.0.pdf
Gabriel, E., Feki, S., Benkert, K., Chaarawi, M.: The abstract data and communication library. J. Algorithms Comput. Technol. 2(4), 581600 (2008)
Article Google Scholar
Gabriel, E., Feki, S., Benkert, K., Resch, M.: Towards performance and portability through runtime adaption for high performance computing applications. In: International Supercomputing Conference, Dresden, Germany, June 2008
Google Scholar
Choi, J.W., Singh, A., Vuduc, R.W.: Model-driven autotuning of sparse matrix-vector multiply on GPUs. In: Proceedings of the 15th Symposium on Principles and Practice of Parallel Programming
Google Scholar
Dolbeau, R., Bihan, S., Bodin, F.: HMPP: a hybrid multi-core parallel programming environment. In: The 1st Workshop on General Purpose Processing on Graphics Processing Units, GPGPU (2007)
Google Scholar
Siddiqui, S., Feki, S.: Predictive performance tuning of OpenACC accelerated applications, 29th International Conference, 22–26 June 2014, Leipzig, Germany. LNCS, vol. 8488, pp. 511–512 (2014)
Google Scholar
Feki, S., Gabriel, E.: A historic knowledge based approach for dynamic optimization. In: Proceedings of the International Conference on Parallel Computing, pp. 389–396 (2009)
Google Scholar
Feki, S., Gabriel, E.: Incorporating historic knowledge into a communication library for self-optimizing high performance computing applications. In: Second IEEE International Conference on Self-Adaptive and Self-Organizing Systems, Venice, Italy (2008)
Google Scholar
Frigo, M., Johnson, S.: The design and implementation of FFTW3. Proceedings of IEEE 93(2), 216–231 (2005)
Article Google Scholar
Mametjanov, A., Lowell, M.C., Norris, B.: Autotuning stencil-based computations on GPUs, In: Cluster Conference, Beijing, China (2012)
Google Scholar
Vuduc, R., Demmel, J.W., Bilmes, J.A.: Statistical models for empirical search-based performance tuning. Int. J. High Perform. Comput. Appl. 18(1), 6594 (2004)
Article Google Scholar
Tillmann, M., Karcher, T., Dachsbacher, C., Tichy, W.F.: Application-independent autotuning for GPUs. In: International Conference on Parallel Computing, Munich, Germany (2013)
Google Scholar
Feki, S., Al-Jarro, A., Bagci, H.: Multi-GPU-based acceleration of the explicit time domain volume integral equation solver using MPI-OpenACC. In: IEEE International Symposium on Antennas and Propagation and USNC/URSI National Radio Science, Lake Buena Vista, Florida, USA (2013)
Google Scholar
Bodin, F.: Using CAPS compiler on NVIDIA kepler and CARMA systems. In: Supercomputing, Salt Lake City, Utah, USA (2012)
Google Scholar

Download references

Acknowledgments

The authors would like to thank NVIDIA for the hardware donation to KAUST as CUDA center of research and KAUST IT Research Computing for their support.

Author information

Authors and Affiliations

Computer, Electrical and Mathematical Sciences and Engineering Division, Extreme Computing Research Center, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
Shahzeb Siddiqui & Fatemah AlZayer
KAUST Supercomputing Laboratory, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
Saber Feki

Authors

Shahzeb Siddiqui
View author publications
You can also search for this author in PubMed Google Scholar
Fatemah AlZayer
View author publications
You can also search for this author in PubMed Google Scholar
Saber Feki
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Saber Feki .

Editor information

Editors and Affiliations

IRIT, ENSEEIHT, Toulouse Cedex, France
Michel Daydé
Lawrence Berkeley National Laboratory, Berkeley, California, USA
Osni Marques
Information Technology Center, The University of Tokyo, Tokyo, Japan
Kengo Nakajima

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Siddiqui, S., AlZayer, F., Feki, S. (2015). Historic Learning Approach for Auto-tuning OpenACC Accelerated Scientific Applications. In: Daydé, M., Marques, O., Nakajima, K. (eds) High Performance Computing for Computational Science -- VECPAR 2014. VECPAR 2014. Lecture Notes in Computer Science(), vol 8969. Springer, Cham. https://doi.org/10.1007/978-3-319-17353-5_19

Download citation

DOI: https://doi.org/10.1007/978-3-319-17353-5_19
Published: 18 April 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-17352-8
Online ISBN: 978-3-319-17353-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics