Porting and scaling OpenACC applications on massively-parallel, GPU-accelerated supercomputers

Hart, A.; Ansaloni, R.; Gray, A.

doi:10.1140/epjst/e2012-01634-y

Porting and scaling OpenACC applications on massively-parallel, GPU-accelerated supercomputers

Regular Article
Published: 06 September 2012

Volume 210, pages 5–16, (2012)
Cite this article

The European Physical Journal Special Topics Aims and scope Submit manuscript

A. Hart¹,
R. Ansaloni² &
A. Gray³

496 Accesses
21 Citations
Explore all metrics

Abstract

An increasing number of massively-parallel supercomputers are based on heterogeneous node architectures combining traditional, powerful multicore CPUs with energy-efficient GPU accelerators. Such systems offer high computational performance with modest power consumption. As the industry trend of closer integration of CPU and GPU silicon continues, these architectures are a possible template for future exascale systems. Given the longevity of large-scale parallel HPC applications, it is important that there is a mechanism for easy migration to such hybrid systems. The OpenACC programming model offers a directive-based method for porting existing codes to run on hybrid architectures. In this paper, we describe our experiences in porting the Himeno benchmark to run on the Cray XK6 hybrid supercomputer. We describe the OpenACC programming model and the changes needed in the code, both to port the functionality and to tune the performance. Despite the additional PCIe-related overheads when transferring data from one GPU to another over the Cray Gemini interconnect, we find the application gives very good performance and scales well. Of particular interest is the facility to launch OpenACC kernels and data transfers asynchronously, which speeds the Himeno benchmark by 5%–10%. Comparing performance with an optimised code on a similar CPU-based system (using 32 threads per node), we find the OpenACC GPU version to be just under twice the speed in a node-for-node comparison. This speed-up is limited by the computational simplicity of the Himeno benchmark and is likely to be greater for more complicated applications.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Containerization technologies: taxonomies, applications and challenges

Article 08 June 2021

FCC-hh: The Hadron Collider

Article Open access 05 July 2019

WRF-MOSIT: a modular and cross-platform tool for configuring and installing the WRF model

Article 10 November 2023

References

J.Dongarra, P.Beckman, et al., Int J. High Performance Computing Applications 25, ISSN 1094-3420, http://www.exascale.org/mediawiki/images/2/20/IESP-roadmap.pdf
J.Levesque, G.Wagenbreth, High Performance Computing: Programming and Applications (Chapman & Hall/CRC Computational Science, 2010)
The OpenACC standard, http://www.openacc-standard.org
R.Himeno, The Himeno benchmark, http://accc.riken.jp/HPC_e/himenobmt_e.html
R.Ansaloni, A.Hart, Cray’s approach to heterogeneous computing, in: Advances of Parallel Computing, Proceedings of ParCo 2011 conference (IOS-Press, Amsterdam, to appear)
The Cray XK6, http://www.cray.com/Products/XK6
The Cray XE6, http://www.cray.com/Products/XE6
CUDA, http://www.nvidia.com/object/cuda_home.html
OpenCL, http://www.khronos.org/opencl
OpenMP, http://www.openmp.org
NVIDIA, http://www.nvidia.com
W.Long, private communication
A.Gray, A.Hart, A.Richardson, K.Stratford, Lattice Boltzmann for large-scale GPU systems, in: Advances of Parallel Computing, Proceedings of ParCo 2011 conference (IOS-Press, Amsterdam, to appear)

Download references

Author information

Authors and Affiliations

Cray Exascale Research Initiative Europe, King’s Buildings, Edinburgh, EH9 3JZ, UK
A. Hart
Cray Italy S.r.l., via Motta 10, 20144, Milano, Italy
R. Ansaloni
EPCC, The University of Edinburgh, King’s Buildings, Edinburgh, EH9 3JZ, UK
A. Gray

Authors

A. Hart
View author publications
You can also search for this author in PubMed Google Scholar
R. Ansaloni
View author publications
You can also search for this author in PubMed Google Scholar
A. Gray
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to A. Hart.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Hart, A., Ansaloni, R. & Gray, A. Porting and scaling OpenACC applications on massively-parallel, GPU-accelerated supercomputers. Eur. Phys. J. Spec. Top. 210, 5–16 (2012). https://doi.org/10.1140/epjst/e2012-01634-y

Download citation

Received: 30 April 2012
Revised: 25 June 2012
Published: 06 September 2012
Issue Date: August 2012
DOI: https://doi.org/10.1140/epjst/e2012-01634-y

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Porting and scaling OpenACC applications on massively-parallel, GPU-accelerated supercomputers

Abstract

Access this article

Similar content being viewed by others

Containerization technologies: taxonomies, applications and challenges

FCC-hh: The Hadron Collider

WRF-MOSIT: a modular and cross-platform tool for configuring and installing the WRF model

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Porting and scaling OpenACC applications on massively-parallel, GPU-accelerated supercomputers

Abstract

Access this article

Similar content being viewed by others

Containerization technologies: taxonomies, applications and challenges

FCC-hh: The Hadron Collider

WRF-MOSIT: a modular and cross-platform tool for configuring and installing the WRF model

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation