Apex-Map: A Synthetic Scalable Benchmark Probe to Explore Data Access Performance on Highly Parallel Systems
With the increasing gap between processor, memory, and interconnect speed, the performances of scientific applications on high performance computing systems have become dominated by the ability to move global data. However, many benchmarks in the field of high performance computing focus on measuring the achieved CPU speed in MFlop/s. In this paper, we introduced a novel benchmark, Apex-Map, which focuses on global data movement and measures how fast global data can be fed into computational units. Apex-Map is a parameterized synthetic performance probe and integrates concepts for temporal and spatial locality into its design. By measuring the Apex-Map performance for a whole range of temporal and spatial localities performance surfaces can be generated which can be used to study the characteristics of the computational platforms and which are useful for performance comparison. Results on a vector platform and two superscalar platforms clearly reflect the design differences between these two types of systems.
KeywordsSpatial Locality Temporal Locality Performance Surface Performance Ratio Memory Bandwidth
Unable to display preview. Download preview PDF.
- 2.STREAM: Sustainable Memory Bandwidth in High Performance Computers, http://www.cs.virginia.edu/stream/
- 3.HPC Challenge Benchmark, http://icl.cs.utk.edu/hpcc/
- 4.Apex-Map: Application Characterization-Memory Access Probe, http://ftg.lbl.gov
- 5.NAS Parallel Benchmarks, http://www.nas.nasa.gov/Software/NPB/
- 6.SPEC, http://www.spec.org/
- 7.Strohmaier, E., Shan, H.: Architecture Independent Performance Characterization and Benchmarking for Scientific Applications. In: International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems. Volendam, The Netherlands (October 2004)Google Scholar
- 8.Pallas MPI Benchmarks, http://www.pallas.com/e/products/pmb/