Skip to main content

Part of the book series: Lecture Notes in Computational Science and Engineering ((LNCSE,volume 51))

Summary

This chapter provides an introduction to the use of Graphics Processor Units (GPUs) as parallel computing devices. It describes the architecture, the available functionality and the programming model. Simple examples and references to freely available tools and resources motivate the reader to explore these new possibilities. An overview of the different applications of GPUs demonstrates their wide applicability, yet also highlights limitations of their use. Finally, a glimpse into the future of GPUs sketches the growing prospects of these inexpensive parallel computing devices.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Alienware. Alienware’s Video Array. http://www.alienware.com/alx_pages/main_content.aspx.

    Google Scholar 

  2. C. Bajaj, I. Ihm, J. Min, and J. Oh. SIMD optimization of linear expressions for programmable graphics hardware. Computer Graphics Forum, 23(4), Dec 2004.

    Google Scholar 

  3. J. Bolz, I. Farmer, E. Grinspun, and P. Schröder. Sparse matrix solvers on the GPU: Conjugate gradients and multigrid. In Proceedings of SIGGRAPH 2003, 2003.

    Google Scholar 

  4. G. Coombe, M. J. Harris, and A. Lastra. Radiosity on graphics hardware. In Proceedings. Graphics Interface 2004, 2004.

    Google Scholar 

  5. Z. Fan, F. Qiu, A. Kaufman, and S. Yoakum-Stover. GPU cluster for high performance computing. In Proceedings of the ACM/IEEE SuperComputing 2004 (SC’04), Nov 2004.

    Google Scholar 

  6. K. Fatahalian, J. Sugerman, and P. Hanrahan. Understanding the efficiency of GPU algorithms for matrix-matrix multiplication. In Graphics Hardware 2004, 2004.

    Google Scholar 

  7. R. Fernando, editor. GPU Gems: Programming Techniques, Tips, and Tricks for Real-Time Graphics. Addison-Wesley Professional, 2004.

    Google Scholar 

  8. J. Fung and S. Mann. Using multiple graphics cards as a general purpose parallel computer: Applications to computer vision. In Proceedings of the 17th International Conference on Pattern Recognition (ICPR 2004), volume 1, pages 805–808, 2004.

    Article  Google Scholar 

  9. N. K. Govindaraju, A. Sud, S.-E. Yoon, and D. Manocha. Interactive visibility culling in complex environments using occlusion-switches. In ACM SIGGRAPH Symposium on Interactive 3D Graphics, 2003.

    Google Scholar 

  10. GPGPU-general purpose computation using graphics hardware. http://www.gpgpu.org/.

    Google Scholar 

  11. M. Harris. Real-Time Cloud Simulation and Rendering. PhD thesis, UNC Chapel Hill, Sep. 2003.

    Google Scholar 

  12. M. J. Harris, G. Coombe, T. Scheuermann, and A. Lastra. Physically-based visual simulation on graphics hardware. In Proceedings of Graphics Hardware 2002, pages 109–118, 2002.

    Google Scholar 

  13. R. Hartenstein. Data-stream-based computing: Models and architectural resources. In International Conference on Microelectronics, Devices and Materials (MIDEM 2003), Ptuj, Slovenia, Oct. 2003.

    Google Scholar 

  14. R. Hill, J. Fung, and S. Mann. Reality window manager: A user interface for mediated reality. In Proceedings of the 2004 IEEE International Conference on Image Processing. (ICIP 2004), 2004.

    Google Scholar 

  15. G. Humphreys, M. Houston, R. Ng, R. Frank, S. Ahern, P. D. Kirchner, and J. T. Klosowski. Chromium: a stream-processing framework for interactive rendering on clusters. In SIGGRAPH’02, pages 693–702, 2002.

    Google Scholar 

  16. R. A. Kendall, M. Sosonkina, W. D. Gropp, R. W. Numrich, and T. Sterling. Parallel programming models applicable to cluster computing and beyond. In A. M. Bruaset and A. Tveito, editors, Numerical Solution of Partial Differential Equations on Parallel Computers, volume 51 of Lecture Notes in Computational Science and Engineering, pages 3–54. Springer-Verlag, 2005.

    Google Scholar 

  17. T. Kim and M. Lin. Visual simulation of ice crystal growth. In Proc. ACM SIGGRAPH / Eurographics Symposium on Computer Animation, 2003.

    Google Scholar 

  18. P. Kipfer, M. Segal, and R. Westermann. UberFlow: A GPU-based particle engine. In Graphics Hardware 2004, 2004.

    Google Scholar 

  19. J. Krueger and R. Westermann. Linear algebra operators for GPU implementation of numerical algorithms. ACM Transactions on Graphics (TOG), 22(3):908–916, 2003.

    Article  Google Scholar 

  20. A. Lefohn, J. Kniss, C. Handen, and R. Whitaker. Interactive visualization and deformation of level set surfaces using graphics hardware. In Proc. Visualization, pages 73–82. IEEE CS Press, 2003.

    Google Scholar 

  21. W. Li, X. Wei, and A. Kaufman. Implementing Lattice Boltzmann computation on graphics hardware. The Visual Computer, 2003.

    Google Scholar 

  22. Microsoft. Longhorn Developer Center. http://msdn.microsoft.com/longhorn.

    Google Scholar 

  23. NVIDIA. NVIDIA scalable link interface (SLI). http://www.nvidia.com/page/sli.html.

    Google Scholar 

  24. OpenGL Architectural Review Board (ARB). OpenGL: graphics application programming interface. http://www.opengl.org/.

    Google Scholar 

  25. M. Pharr and R. Fernando, editors. GPU Gems 2: Programming Techniques for High-Performance Graphics and General-Purpose Computation. Addison-Wesley Professional, 2005.

    Google Scholar 

  26. M. Rumpf and R. Strzodka. Level set segmentation in graphics hardware. In Proceedings. ICIP’01, volume 3, pages 1103–1106, 2001.

    Google Scholar 

  27. M. Rumpf and R. Strzodka. Using graphics cards for quantized FEM computations. In Proceedings VIIP’01, pages 193–202, 2001.

    Google Scholar 

  28. R. Samanta, T. Funkhouser, K. Li, and J. P. Singh. Hybrid sort-first and sort-last parallel rendering with a cluster of PCs. In Proceedings of SIGGRAPH/Eurographics Workshop on Graphics Hardware 2000, pages 97–108, 2000.

    Google Scholar 

  29. R. Strzodka, M. Droske, and M. Rumpf. Image registration by a regularized gradient flow-a streaming implementation in DX9 graphics hardware. Computing, 2004. to appear.

    Google Scholar 

  30. R. Strzodka and A. Telea. Generalized distance transforms and skeletons in graphics hardware. In Proceedings of EG/IEEE TCVG Symposium on Visualization VisSym’ 04, 2004.

    Google Scholar 

  31. J. D. Teresco, K. D. Devine, and J. E. Flaherty. Partitioning and dynamic load balancing for the numerical solution of partial differential equations. In A. M. Bruaset and A. Tveito, editors, Numerical Solution of Partial Differential Equations on Parallel Computers, volume 51 of Lecture Notes in Computational Science and Engineering, pages 55–88. Springer-Verlag, 2005.

    Google Scholar 

  32. M. Wilkes. The memory gap (keynote). In Solving the Memory Wall Problem Workshop, 2000. http://www.ece.neu.edu/conf/wall2k/wilkes1.pdf.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Rumpf, M., Strzodka, R. (2006). Graphics Processor Units: New Prospects for Parallel Computing. In: Bruaset, A.M., Tveito, A. (eds) Numerical Solution of Partial Differential Equations on Parallel Computers. Lecture Notes in Computational Science and Engineering, vol 51. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-31619-1_3

Download citation

Publish with us

Policies and ethics