Abstract
General-Purpose computing on Graphics Processing Units (GPGPU) is an emerging field of research which allows software developers to utilize the significant amount of computing resources GPUs provide for a wider range of applications. While traditional high performance computing environments such as clusters, grids and supercomputers require significant architectural modifications to incorporate GPUs, volunteer computing grids already have these resources available as most personal computers have GPUs available for recreational use. Additionally, volunteer computing grids are gradually upgraded by the volunteers as they upgrade their hardware, whereas clusters, grids and supercomputers are typically upgraded only when replaced by newer hardware. As such, MilkyWay@Home’s volunteer computing system is an excellent testbed for measuring the potential of large scale distributed GPGPU computing across a large number of heterogeneous GPUs. This work discusses the implementation and optimization of the MilkyWay@Home client application for both Nvidia and ATI GPUs. A 17 times speedup was achieved for double-precision calculations on a Nvidia GeForce GTX 285 card, and a 109 times speedup for double-precision calculations on an ATI HD5870 card, compared to the CPU version running on one core of a 3.0 GHz AMD Phenom(tm)II X4 940. Using single-precision calculations was also evaluated which further increased performance 6.2 times for ATI card, and 7.8 times on the Nvidia card but with some loss of accuracy. Modifications to the BOINC infrastructure which enable GPU discovery and utilization are also discussed. The resulting software enabled MilkyWay@Home to use GPU applications for a significant increase in computing power, at the time of this publication approximately 216 teraflops, which would place the combined power of these GPUs between the 11th and 12th fastest supercomputers in the world.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Anderson, D., Korpela, E., Walton, R.: High-performance task distribution for volunteer computing. In: e-Science, pp. 196–203. IEEE Computer Society, Los Alamitos (2005)
Pande, V.S.: http://folding.stanford.edu
De Fabritiis, G.: http://gpugrid.net
Allen, B.: http://einstein.phys.uwm.edu
D-Wave Systems Inc., http://aqua.dwavesys.com
Elsen, E., Houston, M., Vishal, V., Darve, E., Hanrahan, P., Pande, V.S.: N-Body simulation on GPUs. In: SC 2006: Proceedings of the 2006 ACM/IEEE Conference on Supercomputing, p. 188. ACM, New York (2006)
D-Wave Systems Inc., http://aqua.dwavesys.com/faq.html
Friedrichs, M., Eastman, P., Vaidyanathan, V., Houston, M., Legrand, S., Beberg, A., Ensign, D.L., Bruns, C.M., Pande, V.S.: Accelerating molecular dynamic simulation on graphics processing units. Journal of Computational Chemistry 30, 864–872 (2009)
NVIDIA Corporation. NVIDIA CUDA Programming Guide Version 2.3.1.
AMD Corporation. ATI Stream Computing User Guide Version 1.4.0.
Beberg, A., Ensign, D., Jayachandran, G., Khaliq, S., Pande, V.: Folding@home: Lessons from eight years of volunteer distributed computing. In: IEEE International Symposium on Parallel & Distributed Processing, pp. 1–8 (2009)
Harvey, M., Giupponi, G., De Fabritiis, G.: ACEMD: Accelerating Biomolecular Dynamics in the Microsecond Time Scale. Journal of Chemical Theory and Computation 5 (2009)
Purnell, J., Magdon-Ismail, M., Newberg, H.: A probabilistic approach to finding geometric objects in spatial datasets of the Milky Way. In: Hacid, M.-S., Murray, N.V., Raś, Z.W., Tsumoto, S. (eds.) ISMIS 2005. LNCS (LNAI), vol. 3488, pp. 485–493. Springer, Heidelberg (2005)
Reina, C., Bradley, P., Fayyad, U.: Clustering very large databases using mixture models. In: Proc. 15th International Conference on Pattern Recognition (2000)
Adelman-McCarthy, J., et al.: The 6th Sloan Digital Sky Survey Data Release, ApJS (July 2007) (in Press), arXiv/0707.3413, http://www.sdss.org/dr6/
IEEE Standard for Binary Floating-Point Arithmetic, ANSI / IEEE Std. 754-1985 (1985)
Kahan, W.: Pracniques: further remarks on reducing truncation errors. ACM Commun. 8(1), 40 (1965)
Cornea-Hasegan, M., Golliver, R., Markstein, P.: Correctness Proofs Outline for Newton-Raphson Based Floating-Point Divide and Square Root Algorithms. In: Proceedings of the 14th IEEE Symposium on Computer Arithemtic, pp. 96–105 (1999)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Desell, T. et al. (2010). Accelerating the MilkyWay@Home Volunteer Computing Project with GPUs. In: Wyrzykowski, R., Dongarra, J., Karczewski, K., Wasniewski, J. (eds) Parallel Processing and Applied Mathematics. PPAM 2009. Lecture Notes in Computer Science, vol 6067. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-14390-8_29
Download citation
DOI: https://doi.org/10.1007/978-3-642-14390-8_29
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-14389-2
Online ISBN: 978-3-642-14390-8
eBook Packages: Computer ScienceComputer Science (R0)