Profiling Non-numeric OpenSHMEM Applications with the TAU Performance System
- John LinfordAffiliated withParaTools Inc.
- , Tyler A. SimonAffiliated withUniversity of Maryland Baltimore CountyParaTools Inc.
- , Sameer ShendeAffiliated withParaTools Inc.University of Oregon
- , Allen D. MalonyAffiliated withParaTools Inc.University of Oregon
The recent development of a unified SHMEM framework, OpenSHMEM, has enabled further study in the porting and scaling of applications that can benefit from the SHMEM programming model. This paper focuses on non-numerical graph algorithms, which typically have a low FLOPS/byte ratio. An overview of the space and time complexity of Kruskal’s and Prim’s algorithms for generating a minimum spanning tree (MST) is presented, along with an implementation of Kruskal’s algorithm that uses OpenSHEM to generate the MST in parallel without intermediate communication. Additionally, a procedure for applying the TAU Performance System to OpenSHMEM applications to produce indepth performance profiles showing time spent in code regions, memory access patterns, and network load is presented. Performance evaluations from the Cray XK7 “Titan” system at Oak Ridge National Laboratory and a 48 core shared memory system at University of Maryland, Baltimore County are provided.
- Profiling Non-numeric OpenSHMEM Applications with the TAU Performance System
- Book Title
- OpenSHMEM and Related Technologies. Experiences, Implementations, and Tools
- Book Subtitle
- First Workshop, OpenSHMEM 2014, Annapolis, MD, USA, March 4-6, 2014. Proceedings
- pp 105-119
- Print ISBN
- Online ISBN
- Series Title
- Lecture Notes in Computer Science
- Series Volume
- Series ISSN
- Springer International Publishing
- Copyright Holder
- Springer International Publishing Switzerland
- Additional Links
- Industry Sectors
- eBook Packages
- Editor Affiliations
- 16. Oak Ridge National Laboratory
- Author Affiliations
- 18. ParaTools Inc., USA
- 17. University of Maryland Baltimore County, USA
- 19. University of Oregon, USA
To view the rest of this content please follow the download PDF link above.