Performance and Energy Analysis of the Iterative Solution of Sparse Linear Systems on Multicore and Manycore Architectures

  • José I. Aliaga
  • Hartwig AnztEmail author
  • Maribel Castillo
  • Juan C. Fernández
  • Germán León
  • Joaquín Pérez
  • Enrique S. Quintana-Ortí
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 8384)


In this paper we investigate the performance-energy balance of a variety of concurrent architectures, from general-purpose and digital signal multicore systems to graphics processors (GPUs), representative of current technology. This analysis employs the conjugate gradient method, an important algorithm for the iterative solution of linear systems that is basically composed of the sparse matrix-vector product and other (minor) vector kernels. To allow a fair comparison, we leverage simple implementations of the numerical methods and underlying kernels, and rely only on those optimizations applied by the target compiler.


Energy efficiency High-performance computing Sparse linear algebra Multicore processors Low-power processors GPUs 



This work was supported by the CICYT project TIN2011-23283 and FEDER, and by EU FET grant “EXA2GREEN” 318793.


  1. 1.
    CRESTA: collaborative research into Exascale systemware, tools and applications.
  2. 2.
    The Mont Blanc project.
  3. 3.
    Anzt, H., Heuveline, V., Aliaga, J., Castillo, M., Fernández, J., Mayo, R., Quintana-Ortí, E.S.: Analysis and optimization of power consumption in the iterative solution of sparse linear systems on multi-core and many-core platforms. In: Green Computing Conference and Workshops (IGCC), pp. 1–6 (2011)Google Scholar
  4. 4.
    Asanovic, K., et al.: The landscape of parallel computing research: a view from Berkeley. Technical Report UCB/EECS-2006-183, University of California at Berkeley, Electrical Engineering and Computer Sciences (2006)Google Scholar
  5. 5.
    Ashby, S., et al.: The opportunities and challenges of Exascale computing. Summary Report of the Advanced Scientific Computing Advisory Committee (ASCAC) Subcommittee, November 2010Google Scholar
  6. 6.
    Barrett, R., Berry, M., Chan, T.F., Demmel, J., Donato, J., Dongarra, J., Eijkhout, V., Pozo, R., Romine, C., der Vorst, H.V.: Templates for the Solution of Linear Systems: Building Blocks for Iterative Methods, 2nd edn. SIAM, Philadelphia (1994)CrossRefGoogle Scholar
  7. 7.
    Bekas, C., Curioni, A.: A new energy aware performance metric. Comput. Sci. Res. Dev. 25, 187–195 (2010). doi: 10.1007/s00450-010-0119-z CrossRefGoogle Scholar
  8. 8.
    Bell, N., Garland, M.: Efficient sparse matrix-vector multiplication on CUDA. NVIDIA Technical Report NVR-2008-004, NVIDIA Corporation, December 2008Google Scholar
  9. 9.
    Bergman, K., et al.: Exascale computing study: Technology challenges in achieving exascale systems. DARPA IPTO ExaScale Computing Study (2008)Google Scholar
  10. 10.
    Buluç, A., Williams, S., Oliker, L., Demmel, J.: Reduced-bandwidth multithreaded algorithms for sparse matrix-vector multiplication. In Proceedings of the IPDPS,  pp. 721–733 (2011)Google Scholar
  11. 11.
    Langville, A., Meyer, C.: Google’s PageRank and Beyond: The Science of Search Engine Rankings. Princeton University Press, Princeton (2009)Google Scholar
  12. 12.
    Saad, Y.: Iterative Methods for Sparse Linear Systems. Society for Industrial and Applied Mathematics, Philadelphia (2003)CrossRefzbMATHGoogle Scholar
  13. 13.
    Vázquez, F., Fernández, J.J., Garzón, E.M.: A new approach for sparse matrix vector product on nvidia gpus. Concurrency Comput. Pract. Experience 23(8), 815–826 (2011)CrossRefGoogle Scholar
  14. 14.
    Williams, S., Bell, N., Choi, J., Garland, M., Oliker, L., Vuduc, R.: Sparse matrix vector multiplication on multicore and accelerator systems. In: Kurzak, J., Bader, D.A., Dongarra, J. (eds.) Scientific Computing with Multicore Processors and Accelerators. CRC Press, Boca Raton (2010)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2014

Authors and Affiliations

  • José I. Aliaga
    • 1
  • Hartwig Anzt
    • 2
    Email author
  • Maribel Castillo
    • 1
  • Juan C. Fernández
    • 1
  • Germán León
    • 1
  • Joaquín Pérez
    • 1
  • Enrique S. Quintana-Ortí
    • 1
  1. 1.Dpto. de Ingeniería y Ciencia de ComputadoresUniversidad Jaume ICastellónSpain
  2. 2.Innovative Computing Lab (ICL)University of TennesseeKnoxvilleUSA

Personalised recommendations