Statistical Models for Automatic Performance Tuning

Vuduc, Richard; Demmel, James W.; Bilmes, Jeff

doi:10.1007/3-540-45545-0_21

Richard Vuduc⁵,
James W. Demmel⁶ &
Jeff Bilmes⁷

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2073))

Included in the following conference series:

International Conference on Computational Science

2404 Accesses
10 Citations

Abstract

Achieving peak performance from library subroutines usually requires extensive, machine-dependent tuning by hand. Automatic tuning systems have emerged in response, and they typically operate, at compile-time, by (1) generating a large number of possible implementations of a subroutine, and (2) selecting a fast implementation by an exhaustive, empirical search. This paper applies statistical techniques to exploit the large amount of performance data collected during the search. First, we develop a heuristic for stopping an exhaustive compile-time search early if a near-optimal implementation is found. Second, we show how to construct run-time decision rules, based on run-time inputs, for selecting from among a subset of the best implementations. We apply our methods to actual performance data collected by the PHiPAC tuning system for matrix multiply on a variety of hardware platforms.

Download to read the full chapter text

Chapter PDF

Exploiting Historical Data: Pruning Autotuning Spaces and Estimating the Number of Tuning Steps

A multi-aspect online tuning framework for HPC applications

Article 16 May 2017

Michael Gerndt, Siegfried Benkner, … Anna Sikora

Prediction models for performance, power, and energy efficiency of software executed on heterogeneous hardware

Article 02 February 2018

Dénes Bán, Rudolf Ferenc, … Tibor Gyimóthy

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

J. Bilmes, K. Asanović, C. Chin, and J. Demmel. Optimizing matrix multiply using PHiPAC: a Portable, High-Performance, ANSI C coding methodology. In Proc. of the Int’l Conf. on Supercomputing, Vienna, Austria, July 1997.
Google Scholar
J. Bilmes, K. Asanović, J. Demmel, D. Lam, and C. Chin. The PHiPAC v1.0 matrix-multiply distribution. Technical Report UCB/CSD-98-1020, University of California, Berkeley, October 1998.
Google Scholar
Z. W. Birnbaum. Numerical tabulation of the distribution of Kolmogorov’s statistic for finite sample size. J. Am. Stat. Assoc., 47:425–441, September 1952.
Article MATH MathSciNet Google Scholar
E. Brewer. High-level optimization via automated statistical modeling. In Sym. Par. Alg. Arch., Santa Barbara, California, July 1995.
Google Scholar
J. Dongarra, J. D. Croz, I. Duff, and S. Hammarling. A set of level 3 basic linear algebra subprograms. ACM Trans. Math. Soft., 16(1):1–17, March 1990.
Article MATH Google Scholar
M. Frigo and S. Johnson. FFTW: An adaptive software architecture for the FFT. In Proc. of the Int’l Conf. on Acoustics, Speech, and Signal Processing, May 1998.
Google Scholar
G. Haentjens. An investigation of recursive FFT implementations. Master’s thesis, Carnegie Mellon University, 2000.
Google Scholar
E.-J. Im and K. Yelick. Optimizing sparse matrix vector multiplication on SMPs. In Proc. of the 9th SIAM Conf. on Parallel Processing for Sci. Comp., March 1999.
Google Scholar
M. I. Jordan. Why the logistic function? Technical Report 9503, MIT, 1995.
Google Scholar
T. Kisuki, P. M. Knijnenburg, M. F. O’Boyle, and H. Wijshoff. Iterative compilation in program optimization. In Proceedings of the 8th International Workshop on Compilers for Parallel Computers, pages 35–44, 2000.
Google Scholar
C. Lawson, R. Hanson, D. Kincaid, and F. Krogh. Basic linear algebra subprograms for Fortran usage. ACM Trans. Math. Soft., 5:308–323, 1979.
Article MATH Google Scholar
D. A. Schwartz, R. R. Judd, W. J. Harrod, and D. P. Manley. VSIPL 1.0 API, March 2000. http://www.vsipl.org.
B. Singer and M. Veloso. Learning to predict performance from formula modeling and training data. In Proc. of the 17th Int’l Conf. on Mach. Learn., 2000.
Google Scholar
S. S. Vadhiyar, G. E. Fagg, and J. Dongarra. Automatically tuned collective operations. In Proceedings of Supercomputing 2000, November 2000.
Google Scholar
V. N. Vapnik. Statistical Learning Theory. John Wiley and Sons, Inc., 1998.
Google Scholar
R. Vuduc, J. Demmel, and J. Bilmes. Statistical modeling of feedback data in an automatic tuning system. In MICRO-33: Third ACM Workshop on Feedback-Directed Dynamic Optimization, December 2000.
Google Scholar
C. Whaley and J. Dongarra. Automatically tuned linear algebra software. In Proc. of Supercomp., 1998.
Google Scholar

Download references

Author information

Authors and Affiliations

Computer Science Division, University of California at Berkeley, Berkeley, CA, 94720, USA
Richard Vuduc
Computer Science Division and Dept. of Mathematics, University of California at Berkeley, Berkeley, CA, 94720, USA
James W. Demmel
Dept. of Electrical Engineering, University of Washington, Seattle, WA, USA
Jeff Bilmes

Authors

Richard Vuduc
View author publications
You can also search for this author in PubMed Google Scholar
James W. Demmel
View author publications
You can also search for this author in PubMed Google Scholar
Jeff Bilmes
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Computer Science, Cybernetics and Electronic Engineering, University of Reading, Whiteknights, P.O. Box 225, Reading, RG6 6AY, UK
Vassil N. Alexandrov
Innovative Computing Lab, Computer Science Department, University of Tennessee, 1122 Volunteer Blvd, Knoxville, TN, 37996-3450, USA
Jack J. Dongarra
Computer Science Department, California State University, Chico, CA, 95929-0410, USA
Benjoe A. Juliano & René S. Renner &
School of Computer Science, The Queen’s University of Belfast, Belfast, BT7 1NN, Northern Ireland, UK
C. J. Kenneth Tan

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Vuduc, R., Demmel, J.W., Bilmes, J. (2001). Statistical Models for Automatic Performance Tuning. In: Alexandrov, V.N., Dongarra, J.J., Juliano, B.A., Renner, R.S., Tan, C.J.K. (eds) Computational Science — ICCS 2001. ICCS 2001. Lecture Notes in Computer Science, vol 2073. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45545-0_21

Download citation

DOI: https://doi.org/10.1007/3-540-45545-0_21
Published: 17 July 2001
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-42232-7
Online ISBN: 978-3-540-45545-5
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

Statistical Models for Automatic Performance Tuning

Abstract

Chapter PDF

Similar content being viewed by others

Exploiting Historical Data: Pruning Autotuning Spaces and Estimating the Number of Tuning Steps

A multi-aspect online tuning framework for HPC applications

Prediction models for performance, power, and energy efficiency of software executed on heterogeneous hardware

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Statistical Models for Automatic Performance Tuning

Abstract

Chapter PDF

Similar content being viewed by others

Exploiting Historical Data: Pruning Autotuning Spaces and Estimating the Number of Tuning Steps

A multi-aspect online tuning framework for HPC applications

Prediction models for performance, power, and energy efficiency of software executed on heterogeneous hardware

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation