Abstract
The performance optimization of scientific applications usually requires an in-depth knowledge of the hardware and software. A performance tuning mechanism is suggested to automatically tune OpenACC parameters to adapt to the execution environment on a given system. A historic learning based methodology is suggested to prune the parameter search space for a more efficient auto-tuning process. This approach is applied to tune the OpenACC gang and vector clauses for a better mapping of the compute kernels onto the underlying architecture. Our experiments show a significant performance improvement against the default compiler parameters and drastic reduction in tuning time compared to a brute force search-based approach.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
OpenACC Standard specification. www.openacc-standard.org
OpenMP 4.0 specification. www.openmp.org/mp-documents/OpenMP4.0.0.pdf
Gabriel, E., Feki, S., Benkert, K., Chaarawi, M.: The abstract data and communication library. J. Algorithms Comput. Technol. 2(4), 581600 (2008)
Gabriel, E., Feki, S., Benkert, K., Resch, M.: Towards performance and portability through runtime adaption for high performance computing applications. In: International Supercomputing Conference, Dresden, Germany, June 2008
Choi, J.W., Singh, A., Vuduc, R.W.: Model-driven autotuning of sparse matrix-vector multiply on GPUs. In: Proceedings of the 15th Symposium on Principles and Practice of Parallel Programming
Dolbeau, R., Bihan, S., Bodin, F.: HMPP: a hybrid multi-core parallel programming environment. In: The 1st Workshop on General Purpose Processing on Graphics Processing Units, GPGPU (2007)
Siddiqui, S., Feki, S.: Predictive performance tuning of OpenACC accelerated applications, 29th International Conference, 22–26 June 2014, Leipzig, Germany. LNCS, vol. 8488, pp. 511–512 (2014)
Feki, S., Gabriel, E.: A historic knowledge based approach for dynamic optimization. In: Proceedings of the International Conference on Parallel Computing, pp. 389–396 (2009)
Feki, S., Gabriel, E.: Incorporating historic knowledge into a communication library for self-optimizing high performance computing applications. In: Second IEEE International Conference on Self-Adaptive and Self-Organizing Systems, Venice, Italy (2008)
Frigo, M., Johnson, S.: The design and implementation of FFTW3. Proceedings of IEEE 93(2), 216–231 (2005)
Mametjanov, A., Lowell, M.C., Norris, B.: Autotuning stencil-based computations on GPUs, In: Cluster Conference, Beijing, China (2012)
Vuduc, R., Demmel, J.W., Bilmes, J.A.: Statistical models for empirical search-based performance tuning. Int. J. High Perform. Comput. Appl. 18(1), 6594 (2004)
Tillmann, M., Karcher, T., Dachsbacher, C., Tichy, W.F.: Application-independent autotuning for GPUs. In: International Conference on Parallel Computing, Munich, Germany (2013)
Feki, S., Al-Jarro, A., Bagci, H.: Multi-GPU-based acceleration of the explicit time domain volume integral equation solver using MPI-OpenACC. In: IEEE International Symposium on Antennas and Propagation and USNC/URSI National Radio Science, Lake Buena Vista, Florida, USA (2013)
Bodin, F.: Using CAPS compiler on NVIDIA kepler and CARMA systems. In: Supercomputing, Salt Lake City, Utah, USA (2012)
Acknowledgments
The authors would like to thank NVIDIA for the hardware donation to KAUST as CUDA center of research and KAUST IT Research Computing for their support.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Siddiqui, S., AlZayer, F., Feki, S. (2015). Historic Learning Approach for Auto-tuning OpenACC Accelerated Scientific Applications. In: Daydé, M., Marques, O., Nakajima, K. (eds) High Performance Computing for Computational Science -- VECPAR 2014. VECPAR 2014. Lecture Notes in Computer Science(), vol 8969. Springer, Cham. https://doi.org/10.1007/978-3-319-17353-5_19
Download citation
DOI: https://doi.org/10.1007/978-3-319-17353-5_19
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-17352-8
Online ISBN: 978-3-319-17353-5
eBook Packages: Computer ScienceComputer Science (R0)