Abstract
Significant manual design effort is required to implement a software routine on a GPU. This chapter presents an automated approach to partition a software application into kernels (which are executed in parallel) that can be run on the GPU. The software application should satisfy the constraint that it is executed multiple times on different data, and there exist no control dependencies between invocations.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Oink – A collaboration of C static analysis tools. http://www.cubewano.org/oink
Fisher, J.A., Ellis, J.R., Ruttenberg, J.C., Nicolau, A.: Parallel processing: A smart compiler and a dumb machine. SIGPLAN Notices 19(6), 37–47 (1984)
Govindaraju, N.K., Lloyd, B., Wang, W., Lin, M., Manocha, D.: Fast computation of database operations using graphics processors. In: SIGMOD ’04: Proceedings of the 2004 ACM SIGMOD International Conference on Management of Data, pp. 215–226 (2004)
He, B., Fang, W., Luo, Q., Govindaraju, N.K., Wang, T.: Mars: A mapreduce framework on graphics processors. In: PACT ’08: Proceedings of the 17th International Conference on Parallel Architectures and Compilation Techniques, pp. 260–269 (2008)
Karypis, G., Kumar, V.: A Software package for Partitioning Unstructured Graphs, Partitioning Meshes and Computing Fill-Reducing Orderings of Sparse Matrices. http://www-users.cs.umn.edu/~karypis/metis (1998)
Kuck, Lawrie, D., Cytron, R., Sameh, A., Gajski, D.: The architecture and programming of the Cedar System. Cedar Document no. 21, University of Illinois at Urbana-Champaign (1983)
Nagel, L.: SPICE: A computer program to simulate computer circuits. In: University of California, Berkeley UCB/ERL Memo M520 (1995)
Pharr, M., Fernando, R.: GPU Gems 2: Programming Techniques for High-Performance Graphics and General-Purpose Computation. Addison-Wesley Professional, Reading, MA (2005)
Sintorn, E., Assarsson, U.: Fast parallel GPU-sorting using a hybrid algorithm. Journal of Parallel and Distributed Computing 68(10), 1381–1388 (2008)
Wall, L., Schwartz, R.: Programming perl. O’Reilly and Associates, Inc., Sebastopol, CA (1992)
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
Copyright information
© 2010 Springer Science+Business Media, LLC
About this chapter
Cite this chapter
Gulati, K., Khatri, S.P. (2010). Automated Approach for Graphics Processor Based Software Acceleration. In: Hardware Acceleration of EDA Algorithms. Springer, Boston, MA. https://doi.org/10.1007/978-1-4419-0944-2_11
Download citation
DOI: https://doi.org/10.1007/978-1-4419-0944-2_11
Published:
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4419-0943-5
Online ISBN: 978-1-4419-0944-2
eBook Packages: EngineeringEngineering (R0)