Scalable Detection of MPI-2 Remote Memory Access Inefficiency Patterns
- Cite this paper as:
- Hermanns MA., Geimer M., Mohr B., Wolf F. (2009) Scalable Detection of MPI-2 Remote Memory Access Inefficiency Patterns. In: Ropo M., Westerholm J., Dongarra J. (eds) Recent Advances in Parallel Virtual Machine and Message Passing Interface. EuroPVM/MPI 2009. Lecture Notes in Computer Science, vol 5759. Springer, Berlin, Heidelberg
Wait states in parallel applications can be identified by scanning event traces for characteristic patterns. In our earlier work, we have defined such patterns for mpi-2 one-sided communication, although still based on a trace-analysis scheme with limited scalability. Taking advantage of a new scalable trace-analysis approach based on a parallel replay, which was originally developed for mpi-1 point-to-point and collective communication, we show how wait states in one-sided communications can be detected in a more scalable fashion. We demonstrate the scalability of our method and its usefulness for the optimization cycle with applications running on up to 8,192 cores.
Keywordsmpi-2 remote memory access performance analysis scalability pattern search
Unable to display preview. Download preview PDF.