Colloquium: Large scale simulations on GPU clusters

Bernaschi, Massimo; Bisson, Mauro; Fatica, Massimiliano

doi:10.1140/epjb/e2015-60180-8

Colloquium: Large scale simulations on GPU clusters

Colloquium
Published: 17 June 2015

Volume 88, article number 158, (2015)
Cite this article

The European Physical Journal B Aims and scope Submit manuscript

Massimo Bernaschi¹,
Mauro Bisson² &
Massimiliano Fatica²

164 Accesses
2 Citations
Explore all metrics

Abstract

Graphics processing units (GPU) are currently used as a cost-effective platform for computer simulations and big-data processing. Large scale applications require that multiple GPUs work together but the efficiency obtained with cluster of GPUs is, at times, sub-optimal because the GPU features are not exploited at their best. We describe how it is possible to achieve an excellent efficiency for applications in statistical mechanics, particle dynamics and networks analysis by using suitable memory access patterns and mechanisms like CUDA streams, profiling tools, etc. Similar concepts and techniques may be applied also to other problems like the solution of Partial Differential Equations.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

NVIDIA CUDA Compute Unified Device Architecture Programming Guide, http://www.nvidia.com/cuda
J. Glaser, T.D. Nguyen, J.A. Anderson, P. Lui, F. Spiga, J.A. Millan, D.C. Morse, S.C. Glotzer, Comput. Phys. Commun. 192, 97 (2015)
Article ADS Google Scholar
M. Bernaschi, G. Parisi, L. Parisi, Comput. Phys. Commun. 182, 6 (2011)
Article Google Scholar
T. Preis, P. Virnau, W. Paul, J. Schneider, J. Comput. Phys. 228, 4468 (2009)
Article MATH ADS Google Scholar
M. Weigel, Comput. Phys. Commun. 182, 1833 (2011)
Article ADS Google Scholar
M. Weigel, J. Comput. Phys. 231, 3064 (2012)
Article MATH ADS Google Scholar
M. Lulli, M. Bernaschi, G. Parisi, accepted in Comput. Phys. Commun.
M. Bernaschi, G. Amati, M. Bisson, S. Melchionna, S. Succi, Comput. Phys. Commun. 184, 2 (2012)
Google Scholar
M. Bisson, M. Bernaschi, S. Melchionna, Commun. Comput. Phys. 10, 1077 (2011)
Google Scholar
G. Karypis, V. Kumar, SIAM J. Sci. Comput. 20, 359 (1999)
Article MATH MathSciNet Google Scholar
C. Chevalier, F. Pellegrini, Parallel Comput. 34, 318 (2008)
Article MathSciNet Google Scholar
C. Begau, G. Sutmann, Comput. Phys. Commun. 190, 51 (2015)
Article ADS Google Scholar
D. Merrill, M. Garland, A. Grimshaw, Scalable gpu graph traversal, in Proceedings of the 17th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPoPP ’12 (ACM, New York, 2012), pp. 117–128
T. Hiragushi, D. Takahashi, in Algorithms and Architectures for Parallel Processing, Lect. Notes Computer Science (Springer, 2013), Vol. 8286, pp. 40–50
G. Karypis, V. Kumar, SIAM J. Sci. Comput. 20, 359 (1999)
Article MATH MathSciNet Google Scholar
M. Bernaschi, M. Bisson, E. Mastrostefano, submitted to IEEE Transactions on Distributed and Parallel Systems, arXiv:1408.1605 (2014)
N. Satish, C. Kim, J. Chhugani, P. Dubey, Large-scale Energy-efficient Graph Traversal: A Path to Efficient Data-intensive Supercomputing, in Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis (SC), 2012
F. Checconi, F. Petrini, J. Willcock, A. Lumsdaine, A.R. Choudhury, Y. Sabharwal, Breaking the speed and scalability barriers for graph exploration on distributed-memory machines, in Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis (SC), 2012

Download references

Author information

Authors and Affiliations

Istituto per le Applicazioni del Calcolo, National Research Council of Italy, Via dei Taurini 19, 00185, Roma, Italy
Massimo Bernaschi
NVIDIA Corporation, 2701 San Tomas Expressway, Santa Clara, CA, 95050, USA
Mauro Bisson & Massimiliano Fatica

Authors

Massimo Bernaschi
View author publications
You can also search for this author in PubMed Google Scholar
Mauro Bisson
View author publications
You can also search for this author in PubMed Google Scholar
Massimiliano Fatica
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Massimo Bernaschi.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Bernaschi, M., Bisson, M. & Fatica, M. Colloquium: Large scale simulations on GPU clusters. Eur. Phys. J. B 88, 158 (2015). https://doi.org/10.1140/epjb/e2015-60180-8

Download citation

Received: 06 March 2015
Revised: 11 May 2015
Published: 17 June 2015
DOI: https://doi.org/10.1140/epjb/e2015-60180-8

Keywords

Statistical and Nonlinear Physics

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Colloquium: Large scale simulations on GPU clusters

Abstract

Access this article

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation