Skip to main content

The Realm of Graphical Processing Unit (GPU) Computing

  • Chapter
  • First Online:
Topics in Parallel and Distributed Computing

Abstract

The goal of the chapter is to introduce the upper-level Computer Engineering/Computer Science undergraduate (UG) students to general-purpose graphical processing unit (GPGPU) computing. The specific focus of the chapter is on GPGPU computing using the Compute Unified Device Architecture (CUDA) C framework due to the following three reasons: (1) Nvidia GPUs are ubiquitous in high-performance computing, (2) CUDA is relatively easy to understand versus OpenCL, especially for UG students with limited heterogeneous device programming experience, and (3) CUDA experience simplifies learning OpenCL and OpenACC. The chapter consists of nine pedagogical sections with several active-learning exercises to effectively engage students with the text. The chapter opens with an introduction to GPGPU computing. The chapter sections include: (1) Data parallelism; (2) CUDA program structure; (3) CUDA compilation flow; (4) CUDA thread organization; (5) Kernel: Execution configuration and kernel structure; (6) CUDA memory organization; (7) CUDA optimizations; (8) Case study: Image convolution on GPUs; and (9) GPU computing: The future. The authors believe that the chapter layout facilitates effective student-learning by starting from the basics of GPGPU computing and then leading up to the advanced concepts. With this chapter, the authors aim to equip students with the necessary skills to undertake graduate-level courses on GPU programming and make a strong start with undergraduate research.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 44.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 59.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 59.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. CUDA zone. https://developer.nvidia.com/cuda-zone. Last Accessed 11 Feb. 2018

  2. cuBLAS. https://developer.nvidia.com/cublas. Last Accessed 11 Feb. 2018

  3. Nvidia cuDDN. GPU Accelerated Deep Learning. https://developer.nvidia.com/cudnn. Last Accessed 11 Feb. 2018

  4. OpenCL overview. https://www.khronos.org/opencl/. Last Accessed 11 Feb. 2018

  5. OpenACC. More sciene, less programming. https://www.openacc.org/. Last Accessed 11 Feb. 2018

  6. Thrust. https://developer.nvidia.com/thrust. https://developer.nvidia.com/cudnn. Last Accessed 11 Feb. 2018

  7. Nvidia. www.nvidia.com. Last Accessed 11 Feb. 2018

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Vivek K. Pallipuram .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer International Publishing AG, part of Springer Nature

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Pallipuram, V.K., Gao, J. (2018). The Realm of Graphical Processing Unit (GPU) Computing. In: Prasad, S., Gupta, A., Rosenberg, A., Sussman, A., Weems, C. (eds) Topics in Parallel and Distributed Computing. Springer, Cham. https://doi.org/10.1007/978-3-319-93109-8_8

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-93109-8_8

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-93108-1

  • Online ISBN: 978-3-319-93109-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics