Abstract
Operation costs of high performance computers, like cooling and energy, drive HPC centers towards improving the efficient usage of their resources. Performance tuning through experts here is an indispensable ingredient to ensure efficient HPC operation. This ”brainware” component, in addition to software and hardware, is in fact crucial to ensure continued performance of codes in light of diversifying and changing hardware platforms. However, as tuning experts are a scarce and costly resource themselves, processes should be developed that ensure the quality of the performance tuning process. This is not to dampen human ingenuity, but to ensure that tuning effort time is limited to achieve a realistic substantial gain, and that code changes are accepted by users and made part of their code distribution. In this paper, we therefore formalize a service-based Performance Tuning Workflow to standardize the tuning process and to improve usage of tuning-expert time.
Keywords
- Code Change
- Tuning Process
- Performance Tune
- Improvement Report
- Tuning Activity
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
Download conference paper PDF
References
Bischof, C., an Mey, D., Iwainsky, C.: Brainware for Green HPC. In: Ludwig, T. (ed.) Proceedings EnA-HPC 2011 (2011) (to appear)
Behr, M., Arora, D., Benedict, N.A., O’Neill, J.J.: Intel compilers on linux clusters. Intel Developer Services online publication (October 2002)
Zeng, P., Sarholz, S., Iwainsky, C., Binninger, B., Peters, N., Herrmann, M.: Simulation of Primary Breakup for Diesel Spray with Phase Transition. In: Ropo, M., Westerholm, J., Dongarra, J. (eds.) PVM/MPI. LNCS, vol. 5759, pp. 313–320. Springer, Heidelberg (2009)
Altenfeld, R., Apel, M., an Mey, D., Böttger, B., Benke, S., Bischof, C.: Parallelising Computational Microstructure Simulations for Metallic Materials with OpenMP. In: Chapman, B.M., Gropp, W.D., Kumaran, K., Müller, M.S. (eds.) IWOMP 2011. LNCS, vol. 6665, pp. 1–11. Springer, Heidelberg (2011)
Geimer, M., Wolf, F., Wylie, B.J.N., Ábrahám, E., Becker, D., Mohr, B.: The Scalasca performance toolset architecture. Concurrency and Computation: Practice and Experience 22(6), 702–719 (2010)
Knüpfer, A., Brunst, H., Doleschal, J., Jurenz, M., Lieber, M., Mickler, H., Müller, M.S., Nagel, W.E.: The vampir performance analysis tool-set. In: Proceedings of the 2nd HLRS Parallel Tools Workshop, Stuttgart, Germany (July 2008)
Shende, S.S., Malony, A.D.: The tau parallel performance system. The International Journal of High Performance Computing Applications 20, 287–331 (2006)
GNU: gprof, http://sourceware.org/binutils/docs/gprof/
Intel: Intel©parallel amplifier (2011) http://software.intel.com/en-us/articles/intel-parallel-amplifier/
London, K., Moore, S., Mucci, P., Seymour, K., Luczak, R.: The papi cross-platform interface to hardware performance counters. In: Department of Defense Users Group Conference Proceedings, pp. 18–21 (2001)
Iwainsky, C., an Mey, D.: Comparing the Usability of Performance Analysis Tools. In: Cèsar, E., Alexander, M., Streit, A., Träff, J., Cèrin, C., Knüpfer, A., Kranzlmüller, D., Jha, S. (eds.) Euro-Par 2008 Workshops. LNCS, vol. 5415, pp. 315–325. Springer, Heidelberg (2009)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Iwainsky, C., Altenfeld, R., Mey, D.a., Bischof, C. (2012). Enhancing Brainware Productivity through a Performance Tuning Workflow. In: Alexander, M., et al. Euro-Par 2011: Parallel Processing Workshops. Euro-Par 2011. Lecture Notes in Computer Science, vol 7156. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-29740-3_23
Download citation
DOI: https://doi.org/10.1007/978-3-642-29740-3_23
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-29739-7
Online ISBN: 978-3-642-29740-3
eBook Packages: Computer ScienceComputer Science (R0)
