AutoTune: A Plugin-Driven Approach to the Automatic Tuning of Parallel Applications

Miceli, Renato; Civario, Gilles; Sikora, Anna; César, Eduardo; Gerndt, Michael; Haitof, Houssam; Navarrete, Carmen; Benkner, Siegfried; Sandrieser, Martin; Morin, Laurent; Bodin, François

doi:10.1007/978-3-642-36803-5_24

Renato Miceli¹⁷,
Gilles Civario¹⁷,
Anna Sikora¹⁸,
Eduardo César¹⁸,
Michael Gerndt¹⁹,
Houssam Haitof¹⁹,
Carmen Navarrete²⁰,
Siegfried Benkner²¹,
Martin Sandrieser²¹,
Laurent Morin²² &
…
François Bodin²²

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 7782))

Included in the following conference series:

International Workshop on Applied Parallel Computing

2770 Accesses
32 Citations
2 Altmetric

Abstract

Performance analysis and tuning is an important step in programming multicore- and manycore-based parallel architectures. While there are several tools to help developers analyze application performance, no tool provides recommendations about how to tune the code. The AutoTune project is extending Periscope, an automatic distributed performance analysis tool developed by Technische Universität München, with plugins for performance and energy efficiency tuning. The resulting Periscope Tuning Framework will be able to tune serial and parallel codes for multicore and manycore architectures and return tuning recommendations that can be integrated into the production version of the code. The whole tuning process – both performance analysis and tuning – will be performed automatically during a single run of the application.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Miceli, R., Civario, G., Bodin, F.: AutoTune: Automatic Online Code Tuning. In: NVIDIA GPU Technology Conference 2012 (GTC 2012), San Jose, USA (2012)
Google Scholar
Benedict, S., Petkov, V., Gerndt, M.: PERISCOPE: An Online-Based Distributed Performance Analysis Tool Tools for High Performance Computing 2009, pp. 1–16. Springer, Heidelberg (2010)
Book Google Scholar
Whaley, C., Petitet, A., Dongarra, J.J.: Automated empirical optimization of software and the atlas project. Parallel Computing 27, 2001 (2000)
Google Scholar
Frigo, M., Johnson, S.G.: Fftw: An adaptive software architecture for the fft, pp. 1381–1384. IEEE (1998)
Google Scholar
Vuduc, R., Demmel, J.W., Yelick, K.A.: Oski: A library of automatically tuned sparse matrix kernels. Institute of Physics Publishing (2005)
Google Scholar
Püschel, M., Moura, J.M.F., Singer, B., Xiong, J., Johnson, J., Padua, D., Veloso, M., Johnson, R.W.: Spiral: A generator for platform-adapted libraries of signal processing algorithms. Journal of High Performance Computing and Applications 18, 21–45 (2004)
Article Google Scholar
Triantafyllis, S., Vachharajani, M., Vachharajani, N., August, D.I.: Compiler optimization-space exploration. In: Proceedings of the International Symposium on Code Generation and Optimization, pp. 204–215. IEEE Computer Society (2003)
Google Scholar
Haneda, M., Knijnenburg, P., Wijshoff, H.: Automatic selection of compiler options using non-parametric inferential statistics. In: 14th International Conference on Parallel Architectures and Compilation Techniques (PACT 2005), pp. 123–132 (September 2005)
Google Scholar
Pan, Z., Eigenmann, R.: Fast and effective orchestration of compiler optimizations for automatic performance tuning. In: Proceedings of the International Symposium on Code Generation and Optimization (CGO), pp. 319–332 (2006)
Google Scholar
Leather, H., Bonilla, E.: Automatic feature generation for machine learning based optimizing compilation. In: Code Generation and Optimization (CGO), pp. 81–91 (2009)
Google Scholar
Fursin, G., Kashnikov, Y., Wahid, A., Chamski, M.Z., Temam, O., Namolaru, M., Yom-tov, E., Mendelson, B., Zaks, A., Courtois, E., Bodin, F., Barnard, P., Ashton, E., Bonilla, E., Thomson, J., Williams, C.K.I.: Milepost gcc: machine learning enabled self-tuning compiler (2009)
Google Scholar
Chung, I.H., Hollingsworth, J.: Using information from prior runs to improve automated tuning systems. In: Supercomputing. Proceedings of the ACM/IEEE SC2004 Conference, vol. 30 (November 2004)
Google Scholar
Nelson, Y., Bansal, B., Hall, M., Nakano, A., Lerman, K.: Model-guided performance tuning of parameter values: A case study with molecular dynamics visualization. In: IEEE International Symposium on Parallel and Distributed Processing (IPDPS 2008), pp. 1–8 (April 2008)
Google Scholar
Tiwari, A., Chen, C., Chame, J., Hall, M., Hollingsworth, J.: A scalable auto-tuning framework for compiler optimization. In: IEEE International Symposium on Parallel Distributed Processing (IPDPS 2009), pp. 1–12 (May 2009)
Google Scholar
Ribler, R., Vetter, J., Simitci, H., Reed, D.: Autopilot: adaptive control of distributed applications. In: Proceedings of the Seventh International Symposium on High Performance Distributed Computing, pp. 172–179 (July 1998)
Google Scholar
Benkner, S., Pllana, S., Traff, J., Tsigas, P., Dolinsky, U., Augonnet, C., Bachmayer, B., Kessler, C., Moloney, D., Osipov, V.: Peppher: Efficient and productive usage of hybrid computing systems. IEEE Micro. 31(5), 28–41 (2011)
Article Google Scholar
Gary, B.: Learning opencv: Computer vision with the opencv library (2008)
Google Scholar
Augonnet, C., Thibault, S., Namyst, R., Wacrenier, P.-A., Inria, L., Sud-ouest, B.: Author manuscript, published in “euro-par (2009)” starpu: A unified platform for task scheduling on heterogeneous multicore architectures (2009)
Google Scholar
CAPS Entreprise: HMPP Directives Reference Manual, version 3.2.0 (2012)
Google Scholar
CAPS Entreprise: The HMPP Workbench, http://www.caps-entreprise.com/products/hmpp/ (accessed on October 16, 2012)
CAPS Entreprise: H4H - HMPP Profiling Event specification, version 2.3.3 (2012)
Google Scholar
The CUDA Profiling Tools Interface, http://docs.nvidia.com/cupti/ (accessed on October 16, 2012)
David, H., Gorbato, E., Hanebutte, U., Khanna, R., Le, C.: RAPL: memory power estimation and capping. In: Proceedings of the 16th ACM/IEEE International Symposium on Low Power Electronics and Design (2010)
Google Scholar

Download references

Author information

Authors and Affiliations

Irish Centre for High-End Computing, Trinity Tech & Ent Campus, Dublin, Ireland
Renato Miceli & Gilles Civario
CAOS Department, Universitat Autònoma de Barcelona, Barcelona, Spain
Anna Sikora & Eduardo César
Institut für Informatik, Technische Universität München, Garching, Germany
Michael Gerndt & Houssam Haitof
Leibniz-Rechenzentrum, The Bavarian Academy of Sciences, Garching, Germany
Carmen Navarrete
Faculty of Computer Science, University of Vienna, Wien, Austria
Siegfried Benkner & Martin Sandrieser
Immeuble CAP Nord, CAPS Entreprise, 4 Allée Marie Berhaut, Rennes, France
Laurent Morin & François Bodin

Authors

Renato Miceli
View author publications
You can also search for this author in PubMed Google Scholar
Gilles Civario
View author publications
You can also search for this author in PubMed Google Scholar
Anna Sikora
View author publications
You can also search for this author in PubMed Google Scholar
Eduardo César
View author publications
You can also search for this author in PubMed Google Scholar
Michael Gerndt
View author publications
You can also search for this author in PubMed Google Scholar
Houssam Haitof
View author publications
You can also search for this author in PubMed Google Scholar
Carmen Navarrete
View author publications
You can also search for this author in PubMed Google Scholar
Siegfried Benkner
View author publications
You can also search for this author in PubMed Google Scholar
Martin Sandrieser
View author publications
You can also search for this author in PubMed Google Scholar
Laurent Morin
View author publications
You can also search for this author in PubMed Google Scholar
François Bodin
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

CSC - IT Center for Science, 02101, Espoo, Finland
Pekka Manninen & Per Öster &

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Miceli, R. et al. (2013). AutoTune: A Plugin-Driven Approach to the Automatic Tuning of Parallel Applications. In: Manninen, P., Öster, P. (eds) Applied Parallel and Scientific Computing. PARA 2012. Lecture Notes in Computer Science, vol 7782. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-36803-5_24

Download citation

DOI: https://doi.org/10.1007/978-3-642-36803-5_24
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-36802-8
Online ISBN: 978-3-642-36803-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics