Simulation and Application Performance Evaluation Using GPU Through CUDA C & Deep Learning in TensorFlow

Kumar, Ajeet; Khanna, Abhishek

doi:10.1007/978-981-10-8527-7_34

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 799))

Included in the following conference series:

International Conference on Recent Developments in Science, Engineering and Technology

2052 Accesses
1 Citations

Abstract

GPUs have as of late pulled in the consideration of numerous application designers as product information parallel coprocessors. The most current eras of GPU design give less demanding programmability and expanded all-inclusive statement while keeping up the gigantic memory data transfer capacity and computational force of conventional GPUs. This open door ought to divert endeavors in GPU examination to setting up standards and systems that permit proficient mapping of calculation to design equipment. The project, shows the GeForce GTX 560 Ti processors association, highlights, and summed up improvement systems. Method to execution on the platform is by utilizing gigantic multithreading and use vast quantity of centers, cover up global storage inactivity. In order to achieve it, designers confront the test of striking the right harmony between every string’s asset utilization and the quantity of all the while dynamic strings. The assets to oversee incorporate the quantity of resistors also the degree of on-chip storage utilized per string, given strings per multiprocessor, also worldwide memory transmission capacity. The researcher likewise get expanded execution on rearranging, gets to off-chip storage and join solicitations for similar else adjoining storage areas therefore, implement established enhancements by diminishing quantity of implemented function. Such methodologies are used over an assortment of utilizations and areas and accomplish between a 10.5X to 14X application speedup. The similar result was achieved with the single core GPU using deep learning technique in TensorFlow framework.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Google TensorFlow Opensource Repository. https://github.com/tensorflow/tensorflow/blob/master/RELEASE.md
Buck, I.: Brook Specification v0.2, October 2003
Google Scholar
CUDA benchmark Suite. http://www.crhc.uiuc.edu/impact/cudabench.html
Kennedy, K., Allen, R.: Automatic translation of Fortran programs to vector form. ACM Trans. Prog. Lang. Syst. 9(4), 491–542 (1987)
Article Google Scholar
Atallah, M.J. (ed.): Algorithms and Theory of Computation Handbook. CRC Press LLC, Boco Raton (1998)
MATH Google Scholar
Kennedy, K., Callahan, D., Carr, S.: Improving register allocation for subscripted variables. ACM SIGPLAN Not. 9(4), 328–342 (2004)
Google Scholar
Akeley, K., Glanville, R.S., Kilgard, M.J., Mark, W.R.: Cg: a system for programming graphics hardware in a C-like language. In: ACM SIGGRAPH 2003 Papers, pp. 896–907 (2003)
Google Scholar
Loveman, D.B.: High performance Fortran. IEEE Parallel Distrib. Technol.: Syst. Technol. 1(1), 25–42 (1993)
Article Google Scholar
Rothberg, E.E., Lam, M.S., Wolf, M.E.: The cache performance and optimizations of blocked algorithms. In: Proceedings of 4th International Conference on Architectural Support for Programming Languages and Operating Systems, pp. 63–74, April 1991
Google Scholar
Allen, J.R., Kennedy, K.: Optimizing Compilers for Modern Architectures: A Dependence-Based Approach. Morgan Kaufmann Publishers Inc., Burlington (2002)
Google Scholar
Gray, J., Govindaraju, N.K., Manocha, D., Larsen, S.: A memory model for scientific algorithms on graphics processors. In: Proceedings of 2006 ACM/IEEE Conference on Supercomputing, no. 89 (2006)
Google Scholar
Sugerman, J., Fatahalian, K., Hanrahan, P.: Understanding the efficiency of GPU algorithms for matrix-matrix multiplication. In: Proceedings of ACM SIGGRAPH/EUROGRAPHICS Conference on Graphics Hardware, pp. 133–137 (2004)
Google Scholar
Brainerd, W.S., Adams, J.C., Smith, B.T., Martin, J.T., Wagener, J.L.: Fortran 90 Handbook: Complete ANSI/ISO Reference. Intertext Publications Inc./McGraw-Hill Inc., New York (1992)
Google Scholar
ECE 498AL1: Programming Massively Parallel Processors, Fall 2007. http://courses.ece.uiuc.edu/ece498/al1/
The PeakStream Platform: High productivity software development for multi-core processors. Technical report (2006)
Google Scholar
NVIDIA CUDA. http://developer.nvidia.com/object/cuda.html

Download references

Author information

Authors and Affiliations

Birla Institute of Technology and Science, Pilani, India
Ajeet Kumar
Maharaja Surajmal Institute of Technology, GGSIPU, Delhi, India
Abhishek Khanna

Authors

Ajeet Kumar
View author publications
You can also search for this author in PubMed Google Scholar
Abhishek Khanna
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ajeet Kumar .

Editor information

Editors and Affiliations

University of Arkansas, Fayetteville, AR, USA
Brajendra Panda
GD Goenka University, Gurugram, Haryana, India
Sudeep Sharma
GD Goenka University, Gurugram, Haryana, India
Nihar Ranjan Roy

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kumar, A., Khanna, A. (2018). Simulation and Application Performance Evaluation Using GPU Through CUDA C & Deep Learning in TensorFlow. In: Panda, B., Sharma, S., Roy, N. (eds) Data Science and Analytics. REDSET 2017. Communications in Computer and Information Science, vol 799. Springer, Singapore. https://doi.org/10.1007/978-981-10-8527-7_34

Download citation

DOI: https://doi.org/10.1007/978-981-10-8527-7_34
Published: 08 March 2018
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-8526-0
Online ISBN: 978-981-10-8527-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics