Advertisement

A Dynamic Tracing Mechanism for Performance Analysis of OpenMP Applications

  • Jordi Caubet
  • Judit Gimenez
  • Jesus Labarta
  • Luiz DeRose
  • Jeffrey Vetter
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 2104)

Abstract

In this paper we present OMPtrace, a dynamic tracing mechanism that combines traditional tracing with dynamic instrumentation and access to hardware performance counters to create a powerful tool for performance analysis and optimization of OpenMP applications. Performance data collected with OMPtrace is used as input to the Paraver visualization tool for detailed analysis of the parallel behavior of the application. We demonstrate the usefulness of OMPtrace and the power of Paraver for tuning OpenMP applications with a case study running the US DOE ASCI Sweep3D benchmark on the IBM SP system at the Lawrence Livermore National Laboratory.

Keywords

Critical Section Lawrence Livermore National Laboratory Large Problem Size Parallel Loop Small Problem Size 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    B. R. Buck and J. K. Hollingsworth. An API for Runtime Code Patching. In Journal of High Performance Computing Applications, 14(4):317–329, Winter 2000.CrossRefGoogle Scholar
  2. 2.
    L. DeRose and T. H. Hoover Jr. and J. K. Hollingsworth The Dynamic Probe Class Library — An Infrastructure for Developing Instrumentation for Performance Tools. In Proceedings of 2001 International Parallel and Distributed Processing Symposium, April 2001.Google Scholar
  3. 3.
    European Center for Parallelism of Barcelona (CEPBA). Paraver — Parallel Program Visualization and Analysis Tool — Reference Manual, November 2000. http://www.cepba.upc.es/paraver.
  4. 4.
    K. R. Koch, R. S. Baker, R. E. Alcouffe. Solution of the First-Order Form of the 3-D Discrete Ordinates Equation on a Massively Parallel Processor. In Trans. Amer. Nuc. Soc. 65(198), 1992.Google Scholar
  5. 5.
    B. P. Miller, M. D. Callaghan, J. M. Cargille, J. K. Hollingsworth, R. B. Irvin, K. L. Karavanic, K. Kunchithapadam, and T. Newhall. The Paradyn Parallel Performance Measurement Tools. In IEEE Computer, 28(11):37–46, November 1995.Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2001

Authors and Affiliations

  • Jordi Caubet
    • 1
  • Judit Gimenez
    • 1
  • Jesus Labarta
    • 1
  • Luiz DeRose
    • 2
  • Jeffrey Vetter
    • 3
  1. 1.European Center for Parallelism of BarcelonaDepartment of Computer Architecture Technical University of CataloniaBarcelonaSpain
  2. 2.Advanced Computing Technology CenterIBM T. J. Watson Research CenterYorktown HeightsUSA
  3. 3.Center for Applied Scientific ComputingLawrence Livermore National LaboratoryLivermoreUSA

Personalised recommendations