Abstract
In this paper, we analyze the power consumption of different GPU-accelerated iterative solver implementations enhanced with energy-saving techniques. Specifically, while conducting kernel calls on the graphics accelerator, we manually set the host system to a power-efficient idle-wait status so as to leverage dynamic voltage and frequency control. While the usage of iterative refinement combined with mixed precision arithmetic often improves the execution time of an iterative solver on a graphics processor, this may not necessarily be true for the power consumption as well. To analyze the trade-off between computation time and power consumption we compare a plain GMRES solver and its preconditioned variant to the mixed-precision iterative refinement implementations based on the respective solvers. Benchmark experiments conclusively reveal how the usage of idle-wait during GPU-kernel calls effectively leverages the power-tools provided by hardware, and improves the energy performance of the algorithm.
Similar content being viewed by others
References
Lampe F (2010) Green-IT, Virtualisierung und Thin Clients: Mit neuen IT-Technologien Energieeffizienz erreichen, die Umwelt schonen und Kosten sparen. Vieweg/Teubner, Wiesbaten
Kogge P et al (2008) ExaScale computing study: technology challenges in achieving ExaScale systems
Dongarra J et al (2011) The international ExaScale software project roadmap. doi:10.1177/1094342010391989
Saad Y (2003) Iterative methods for sparse linear systems. Society for Industrial and Applied Mathematics, Philadelphia
Saad Y, Schultz MH (1986) Gmres: a generalized minimal residual algorithm for solving nonsymmetric linear systems. SIAM J Sci Stat Comput 7:856–869. Available: http://portal.acm.org/citation.cfm?id=14063.14074
Anzt H, Rocker B, Heuveline V (2010) Energy efficiency of mixed precision iterative refinement methods using hybrid hardware platforms. Comput Sci Res Dev 25(3):141–149
Anzt H, Castillo M, Fernández JC, Heuveline V, Mayo R, Quintana-Ortí ES, Rocker B (2011) Power consumption of mixed precision in the iterative solution of sparse linear systems. In: Proceedings of the 25th IEEE international parallel & distributed processing symposium
Anzt H, Aliaga J, Castillo M, Fernández JC, Mayo R, Quintana-Ortí ES (2011) Analysis and optimization of power consumption in the iterative solution of sparse linear systems on multi-core and many-core platforms. EMCL Preprint Series. Available: http://www.emcl.kit.edu/preprints/emcl-preprint-2011-05.pdf
NVIDIA CUDA CUBLAS library programming guide, 1st edn, NVIDIA Corporation (June 2007)
Bell N, Garland M (2009) Implementing sparse matrix-vector multiplication on throughput-oriented processors. In: Proceedings of the conference on high performance computing networking, storage and analysis, SC’09. ACM, New York, pp 18:1–18:11
Palma J, Daydé M, Marques O, Lopes J (eds) (2011) An error correction solver for linear systems: evaluation of mixed precision implementations. Lecture notes in computer science, vol 6449. Springer, Berlin
Dongarra JJ, Duff IS, Sorensen DC, van der Vorst HA (1998) Numerical linear algebra for high-performance computers. Society for Industrial and Applied Mathematics, Philadelphia
Göddeke D, Strzodka R, Turek S (2007) Performance and accuracy of hardware-oriented native-, emulated- and mixed-precision solvers in FEM simulations. Int J Parallel Emergent Distributed Syst 22(4):221–256
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Anzt, H., Castillo, M., Fernández, J.C. et al. Optimization of power consumption in the iterative solution of sparse linear systems on graphics processors. Comput Sci Res Dev 27, 299–307 (2012). https://doi.org/10.1007/s00450-011-0195-8
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00450-011-0195-8