Abstract
This paper presents results on development of the auto-tuning framework named TuningGenie aimed at automating adjustment of parallel tasks to target platform. The framework works with source code of parallel software and performs source-to-source transformations by using facilities of a rule-based rewriting system. This approach significantly raises flexibility of proposed solution and enables changing computation structure of a tuned program in a declarative way. The framework’s architecture, lifecycle, offered toolset for optimization, demo examples and results of tuning of a computationally complex task of short-term meteorological forecasting are presented.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Amdahl, G.M.: Validity of the single processor approach to achieving large-scale computing capabilities. In: AFIPS Conference Proceedings, pp. 483–485. ACM, New York (1967)
Intel® Parallel Compilers. http://software.intel.com/en-us/articles/automatic-parallelization-with-intel-compilers
Oracle® Solaris Studio. http://www.oracle.com/technetwork/server-storage/solarisstudio/overview/index.html
Naono, K., Teranishi, K., Cavazos, J., Suda, R.: Software Automatic Tuning From Concepts to State-of-the-Art Results. Springer, Berlin (2010)
TermWare Framework. http://www.gradsoft.ua/products/termware_end.html
Doroshenko, A., Shevchenko, R.: A rewriting framework for rule-based programming dynamic applications. Fundamenta Informaticae 72(1), 95–108 (2006)
Eppstein, D., Galil, Z.: Parallel algorithmic techniques for combinatorial computation. Ann. Rev. Comput. Sci 3, 233–283 (1988)
Sedgewick, R., Wayne, K.: Algorithms, 4th edn. Addison-Wesley Professional, Boston (2011)
Prokop, H.: Cache-oblivious algorithms. In: Proceedings of 40th Annual Symposium on Foundations of Computer Science (FOCS ‘99), pp. 285–299. IEEE Press, New York (1999)
Andon, F.I., Tseitlin, G.E., Yatsenko, E.A.: Algebraic-Algorithmic Models and Methods of Parallel Programming. Akademperiodika, Kyiv (2007). (in Russian)
Ma, W., Krishnamoorthy, S., Agrawal, G.: Parameterized micro-benchmarking: an auto-tuning approach for complex applications. In: Proceedings of 9th Conference on Computing Frontiers, pp. 213–222. ACM, New York (2012)
TermWare Tutorial. http://www.gradsoft.ua/rus/Products/termware/docs/tutorial_eng.html
Prusov, V.A., Doroshenko, A.E., Chernysh, R.I., Guk, L.N.: Theoretical study of a numerical method to solve a diffusion-convection problem. Cybern. Syst. Anal. 44(2), 283–291 (2008)
Prusov, V.A., Doroshenko, A.E., Chernysh, R.I.: A method for numerical solution of a multidimensional convection-diffusion problem. Cybern. Syst. Anal. 45(1), 89–95 (2009)
Belov, P.N., Borisenkov, E.P., Panin, B.D.: Numerical Methods of Weather Prediction. Gidrometeoizdat, Leningrad (1989). (in Russian)
Database of Regional Meteorological Center Offenbach. ftp://ftp-outgoing.dwd.de
Karcher, T., Schaefer, C., Pankratius, V.: Auto-tuning support for manycore applications - perspectives for operating systems and compilers. Technical report, University of Karlsruhe, Karlsruhe, Germany (2009)
Whaley, R., Petitet, A., Dongarra, J.J.: Automated empirical optimizations of software and the ATLAS project. Parallel Comput. 27(1–2), 3–35 (2001)
Frigo, M., Johnson, S.: FFTW: an adaptive software architecture for the FF. Acoust. Speech Signal Process. 3, 1381–1384 (1998)
Irigoin, F., Jouvelot, P., Triolet, R.: Semantical interprocedural parallelization: an overview of the PIPS project. In: 5th International Conference on Supercomputing, pp. 244–251. ACM, New York (1991)
Katagiri, T., Kise, K., Honda, H., Yuba, T.: ABCLibScript: a directive to support specification of an auto-tuning facility for numerical software. J. Parallel Comput. 32(1), 92–112 (2006)
Schaefer, C.A., Pankratius, V., Tichy, W.F.: Atune-IL: an instrumentation language for auto-tuning parallel applications. In: Sips, H., Epema, D., Lin, H.-X. (eds.) Euro-Par 2009. LNCS, vol. 5704, pp. 9–20. Springer, Heidelberg (2009)
Parr, T.: A Functional language for generating structured text, May 2004
Parr, T.: The StringTemplate Homepage. http://www.stringtemplate.org/
Wirth, N.: Algorithms + Data Structures = Programs. Prentice Hall PTR, Upper Saddle River (1978)
Apache Ivy. http://ant.apache.org/ivy/
Apache Maven. http://maven.apache.org/
Apache Buildr. http://buildr.apache.org/
Kennedy, K., Allen, J.R.: Optimizing Compilers for Modern Architectures: a Dependence-based Approach. Morgan Kaufmann, San Francisco (2002)
Blair-Chappell, S., Stokes, A.: Parallel Programming with Intel Parallel Studio XE. Wiley, Hoboken (2012)
Chapman, B., Jost, G., Van Der Pas, R.: Using OpenMP: Portable Shared Memory Parallel Programming. The MIT Press, Cambridge (2007)
Amarasinghe, S.P., Anderson, J.M., Lam, M.S., Tseng, C.W.: An overview of the suif compiler for scalable parallel machines. In: PPSC (1995)
Huang, L., Eachempati, D., Hervey, M.W., Chapman, B.: Exploiting global optimizations for OpenMP programs in the OpenUH compiler. SIGPLAN Not. 44(4), 289–290 (2009)
Kusano, K., Satoh, S., Sato, Mitsuhisa: Performance evaluation of the omni openmp compiler. In: Valero, Mateo, Joe, Kazuyuki, Kitsuregawa, Masaru, Tanaka, Hatsukazu (eds.) ISHPC 2000. LNCS, vol. 1940, pp. 403–414. Springer, Heidelberg (2000)
Tinetti, F.G., Méndez, M.: Fortran legacy software: source code update and possible parallelisation issues. SIGPLAN Fortran Forum 31(1), 5–22 (2012)
Ventroux, N. et al.: SESAM/Par4All: a tool for joint exploration of mpsoc architectures and dynamic dataflow code generation. In: Proceedings of Workshop on Rapid Simulation and Performance Evaluation: Methods and Tools, pp. 9–16. ACM, New York (2012)
Tournavitis, G., Wang, Z., Franke, B., O’Boyle, M.F.P.: Towards a holistic approach to auto-parallelization: integrating profile-driven parallelism detection and machine-learning based mapping. ACM SIGPLAN Not. 44(6), 177–187 (2009)
Oancea, C.E., Rauchwerger, L.: Logical inference techniques for loop parallelization. In: 33rd ACM SIGPLAN Conference on Programming Language Design and Implementation, pp. 509–520. ACM, New York (2012)
Liao, S.-W. et al.: SUIF Explorer: an interactive and interprocedural parallelizer. In: 7th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, pp. 37–48. ACM, New York (1999)
Ishihara, M., Honda, H.: Development and implementation of an interactive parallelization assistance tool for OpenMP: iPat/OMP. IEICE Trans. Inf. Syst. 89(2), 399–407 (2006)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Ivanenko, P.A., Doroshenko, A.Y., Zhereb, K.A. (2014). TuningGenie: Auto-Tuning Framework Based on Rewriting Rules. In: Ermolayev, V., Mayr, H., Nikitchenko, M., Spivakovsky, A., Zholtkevych, G. (eds) Information and Communication Technologies in Education, Research, and Industrial Applications. ICTERI 2014. Communications in Computer and Information Science, vol 469. Springer, Cham. https://doi.org/10.1007/978-3-319-13206-8_7
Download citation
DOI: https://doi.org/10.1007/978-3-319-13206-8_7
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-13205-1
Online ISBN: 978-3-319-13206-8
eBook Packages: Computer ScienceComputer Science (R0)