OpenMP for Accelerators

Beyer, James C.; Stotzer, Eric J.; Hart, Alistair; de Supinski, Bronis R.

doi:10.1007/978-3-642-21487-5_9

James C. Beyer²⁰,
Eric J. Stotzer²¹,
Alistair Hart²² &
…
Bronis R. de Supinski²³

Part of the book series: Lecture Notes in Computer Science ((LNPSE,volume 6665))

Included in the following conference series:

International Workshop on OpenMP

782 Accesses
25 Citations

Abstract

OpenMP [14] is the dominant programming model for shared-memory parallelism in C, C++ and Fortran due to its easy-to-use directive-based style, portability and broad support by compiler vendors. Compute-intensive application regions are increasingly being accelerated using devices such as GPUs and DSPs, and a programming model with similar characteristics is needed here. This paper presents extensions to OpenMP that provide such a programming model. Our results demonstrate that a high-level programming model can provide accelerated performance comparable to that of hand-coded implementations in CUDA.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Early Experiences with the OpenMP Accelerator Model

A Pattern-Based Comparison of OpenACC and OpenMP for Accelerator Computing

High-Level Programming for Many-Cores Using C++14 and the STL

Article 13 March 2017

References

AMD: The AMD Fusion Family of APUs (March 2011), http://sites.amd.com/us/fusion
Ayguadé, E., et al.: Extending OpenMP to survive the heterogeneous multi-core era. International Journal of Parallel Programming 38, 440–459 (2010), http://dx.doi.org/10.1007/s10766-010-0135-4 , 10.1007/s10766-010-0135-4
Article MATH Google Scholar
Bailey, D.H., et al.: The NAS parallel benchmarks. International Journal of High Performance Computing Applications 5(3), 63–73 (1991)
Article Google Scholar
CAPS: HMPP (November 2010), http://www.caps-entreprise.com
Clearspeed: Support (November 2010), http://support.clearspeed.com
Han, T.D., Abdelrahman, T.S.: hiCUDA: a high-level directive-based language for gpu programming. In: Proceedings of 2nd Workshop on General Purpose Processing on Graphics Processing Units, GPGPU-2, pp. 52–61. ACM, New York (2009), http://doi.acm.org/10.1145/1513895.1513902
Chapter Google Scholar
Intel Corp.: Intel C++ Compiler 12.0 User and Reference Guides (March 2011), http://software.intel.com
Intel Corp.: Intel unveils new product plans for high-performance computing (March 2011), http://www.intel.com
Khronos Group: The OpenCL Specification, v. 1.1 (September 2010), http://www.khronos.org/registry/cl/
Lee, S., Eigenmann, R.: OpenMPC: Extended OpenMP Programming and Tuning for GPUs. In: Proceedings of the 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2010, pp. 1–11. IEEE Computer Society, Los Alamitos (2010), http://dx.doi.org/10.1109/SC.2010.36
Chapter Google Scholar
MCA: The Multicore Association (2011), http://www.multicore-association.com
Nvidia Corp.: NVIDIA CUDA C Programming Guide, v. 3.2 (2010), http://developer.nvidia.com/object/gpucomputing.html
Nvidia Corp.: What is CUDA (February 2011), http://www.nvidia.com/object/what_is_cuda_new.html
OpenMP ARB: OpenMP Application Program Interface, v. 3.0 (May 2008), http://openmp.org/wp/openmp-specifications
PGI: Accelerator (November 2011), http://www.pgroup.com/resources/accel.htm
PGI: Cuda fortran (March 2011), http://www.pgroup.com/resources/cudafortran.htm
Wang, P.H., et al.: EXOCHI: architecture and programming environment for a heterogeneous multi-core multithreaded system. In: Proceedings of the 2007 ACM SIGPLAN Conference on Programming Language Design and Implementation, pp. 156–166. ACM, New York (2007), http://doi.acm.org/10.1145/1250734.1250753
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

Cray Inc., 380 Jackson Street, Suite 210, St. Paul, MN, USA
James C. Beyer
Texas Instruments Inc., 12203 Southwest Freeway, Stafford, TX, USA
Eric J. Stotzer
Cray European Exascale Research Initiative, c/o EPCC, University of Edinburgh, UK
Alistair Hart
Center for Applied Scientific Computing, Lawrence Livermore National Laboratory, USA
Bronis R. de Supinski

Authors

James C. Beyer
View author publications
You can also search for this author in PubMed Google Scholar
Eric J. Stotzer
View author publications
You can also search for this author in PubMed Google Scholar
Alistair Hart
View author publications
You can also search for this author in PubMed Google Scholar
Bronis R. de Supinski
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Dept. of Computer Science, University of Houston, 501 Philip G. Hoffman Hall, 4800 Calhoun Rd, 77204-3475, Houston, TX, USA
Barbara M. Chapman
Dept. of Computer Sci., Univ. of Illinois, 61801, Urbana, Illinois, USA
William D. Gropp
Argonne National Laboratory, TCS, Bldg 240, Rm 1125, 9700 S. Cass Avenue, 60439, Argonne, IL, USA
Kalyan Kumaran
Center for Information Services and High Performance Computing (ZIH), Technische Universität Dresden, Zellescher Weg 12, 01062, Dresden, Germany
Matthias S. Müller

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Beyer, J.C., Stotzer, E.J., Hart, A., de Supinski, B.R. (2011). OpenMP for Accelerators. In: Chapman, B.M., Gropp, W.D., Kumaran, K., Müller, M.S. (eds) OpenMP in the Petascale Era. IWOMP 2011. Lecture Notes in Computer Science, vol 6665. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-21487-5_9

Download citation

DOI: https://doi.org/10.1007/978-3-642-21487-5_9
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-21486-8
Online ISBN: 978-3-642-21487-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

OpenMP for Accelerators

Abstract

Access this chapter

Preview

Similar content being viewed by others

Early Experiences with the OpenMP Accelerator Model

A Pattern-Based Comparison of OpenACC and OpenMP for Accelerator Computing

High-Level Programming for Many-Cores Using C++14 and the STL

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

OpenMP for Accelerators

Abstract

Access this chapter

Preview

Similar content being viewed by others

Early Experiences with the OpenMP Accelerator Model

A Pattern-Based Comparison of OpenACC and OpenMP for Accelerator Computing

High-Level Programming for Many-Cores Using C++14 and the STL

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation