Toward Optimizing Enzo, an AMR Cosmology Application
Enzo is a parallel hybrid SAMR / N-body code designed to simulate cosmological structure formation. This paper describes our approach to gathering and visualizing performance information from Enzo, which will be used to direct our subsequent modeling and optimization effort. Understanding the performance of AMR applications on distributed memory architectures is challenging, owing in part to the dynamic multilevel data structures and variety of communication patterns involved. To facilitate the task of measuring, modeling, and optimizing Enzo’s performance, we are developing the Enzo Performance Monitoring System (EPMS). We review some existing performance tools, describe the EPMS, and show some preliminary performance data obtained using the EPMS.
KeywordsLoad Balance Loaded Processor User Counter Storage Access High Performance Computing Application
Unable to display preview. Download preview PDF.
- 1.G. Bryan and M.L. Norman, A Hybrid AMR Application for Cosmology and Astrophysics, in Workshop on Structured Adaptive Mesh Refinement Grid Methods, N. Chrisochoides (ed), March 1997Google Scholar
- 3.Daniel A. Reed and Ruth A. Aydt, Tools for Performance Tuning and Debugging, in Sourcebook of Parallel Computing, J. Dongarra (ed), I. Foster, G. Fox (ed), W. Gropp, K. Kennedy, L. Torczon and A. White (ed), Morgan Kaufman Publishers, 2003.Google Scholar
- 4.O. Zaki, E. Lusk, W. Gropp, and D. Swider, Toward Scalable Performance Visualization with Jumpshot, Int’l J. of High Performance Computing Applications, 1999.Google Scholar
- 5.Luiz DeRose and Daniel A. Reed, SvPablo: A Multi-Language Architecture-Independent Performance Analysis System, Proceedings of the International Conference on Parallel Processing (ICPP’99), Fukushima, Japan, September 1999.Google Scholar
- 7.Z. Lan, V. Taylor, and G. Bryan, Dynamic Load Balancing of SAMR applications on Distributed Systems, Journal of Scientic Programming, Vol 10(4), pp. 319–328, 2002.Google Scholar
- 8.Lorie M. Liebrock, Using Problem Topology in Parallelization, Rice University Technical Report CRPC-TR94477-S, September 1994.Google Scholar