Performance Tuning of Matrix Triple Products Based on Matrix Structure

Im, Eun-Jin; Bustany, Ismail; Ashcraft, Cleve; Demmel, James W.; Yelick, Katherine A.

doi:10.1007/11558958_89

Eun-Jin Im¹⁹,
Ismail Bustany²⁰,
Cleve Ashcraft²¹,
James W. Demmel²² &
…
Katherine A. Yelick²²

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 3732))

Included in the following conference series:

International Workshop on Applied Parallel Computing

1135 Accesses
1 Citations

Abstract

Sparse matrix computations arise in many scientific and engineering applications, but their performance is limited by the growing gap between processor and memory speed. In this paper, we present a case study of an important sparse matrix triple product problem that commonly arises in primal-dual optimization method.

Instead of a generic two-phase algorithm, we devise and implement a single pass algorithm that exploits the block diagonal structure of the matrix. Our algorithm uses fewer floating point operations and roughly half the memory of the two-phase algorithm. The speed-up of the one-phase scheme over the two-phase scheme is 2.04 on a 900 MHz Intel Itanium-2, 1.63 on an 1 GHz Power-4, and 1.99 on a 900 MHz Sun Ultra-3.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Goemans, M.X., Williamson, D.P.: The primal-dual method for approximation algorithms and its application to network design problems. In: Approximation Algorithms for NP-hard Problems, pp. 144–191. PWS Publishing Co., Boston (1996)
Google Scholar
Gilbert, J.R., Moler, C., Schreiber, R.: Sparse matrices in Matlab: Design and implementation. SIAM J. Matrix Analysis and Applications 13, 333–356 (1992)
Article MATH MathSciNet Google Scholar
Im, E., Yelick, K.A.: Optimizing sparse matrix computations for register reuse in SPARSITY. In: Alexandrov, V.N., Dongarra, J., Juliano, B.A., Renner, R.S., Tan, C.J.K. (eds.) ICCS-ComputSci 2001. LNCS, vol. 2073, pp. 127–136. Springer, Heidelberg (2001)
Chapter Google Scholar
Im, E., Yelick, K.A., Vuduc, R.: SPARSITY: Framework for Optimizing Sparse Matrix- Vector Multiply. International Journal of High Performance Computing Applications 18(1), 135–158 (2004)
Article Google Scholar
Vuduc, R., Gyulassy, A., Demmel, J.W., Yelick, K.A.: Memory Hierarchy Optimizations and Bounds for Sparse A ^TAx. In: Proceedings of the ICCS Workshop on Parallel Linear Algebra, Melbourne, Australia, June 2003. LNCS, vol. 2660, pp. 705–714. Springer, Heidelberg (2003)
Google Scholar

Download references

Author information

Authors and Affiliations

Kookmin University, Seoul, Korea
Eun-Jin Im
Barcelona Design Inc, USA
Ismail Bustany
Livermore Software Technology Corporation, USA
Cleve Ashcraft
U.C. Berkeley, USA
James W. Demmel & Katherine A. Yelick

Authors

Eun-Jin Im
View author publications
You can also search for this author in PubMed Google Scholar
Ismail Bustany
View author publications
You can also search for this author in PubMed Google Scholar
Cleve Ashcraft
View author publications
You can also search for this author in PubMed Google Scholar
James W. Demmel
View author publications
You can also search for this author in PubMed Google Scholar
Katherine A. Yelick
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Computer Science Department, University of Tennessee, 37996-3450, Knoxville, TN, USA
Jack Dongarra
Department of Informatics and Mathematical Modelling, Technical University of Denmark, DK-2800, Lyngby, Denmark
Kaj Madsen
Informatics & Mathematical Modeling, Technical University of Denmark, DK-2800, Lyngby, Denmark
Jerzy Waśniewski

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Im, EJ., Bustany, I., Ashcraft, C., Demmel, J.W., Yelick, K.A. (2006). Performance Tuning of Matrix Triple Products Based on Matrix Structure. In: Dongarra, J., Madsen, K., Waśniewski, J. (eds) Applied Parallel Computing. State of the Art in Scientific Computing. PARA 2004. Lecture Notes in Computer Science, vol 3732. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11558958_89

Download citation

DOI: https://doi.org/10.1007/11558958_89
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-29067-4
Online ISBN: 978-3-540-33498-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics