Performance Analysis of Shared-Memory Parallel Applications Using Performance Properties

  • Karl Fürlinger
  • Michael Gerndt
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 3726)


Tuning parallel code can be a time-consuming and difficult task. We present our approach to automate the performance analysis of OpenMP applications that is based on the notion of performance properties. Properties are formally specified in the APART specification language (ASL) with respect to a specific data model. We describe a data model for summary (profiling) data of OpenMP applications and present performance properties based on this data model. We evaluate the usability of the properties on several example codes using our OpenMP profiler ompP to acquire the profiling data.


Performance Property Critical Section Application Developer Parallel Region Parallel Code 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Crovella, M.E., LeBlanc, T.J.: Parallel performance prediction using lost cycles analysis. In: Proceedings of the 1994 Conference on Supercomputing (SC 1994), pp. 600–609. ACM Press, New York (1994)CrossRefGoogle Scholar
  2. 2.
    Dorta, A.J., Rodríguez, C., de Sande, F., Gonzáles-Escribano, A.: The OpenMP source code repository. In: Proceedings of the 13th Euromicro Conference on Parallel, Distributed and Network-Based Processing (PDP 2005), February 2005, pp. 244–250 (2005)Google Scholar
  3. 3.
    Fahringer, T., Gerndt, M., Mohr, B., Wolf, F., Riley, G., Träff, J.L.: Knowledge specification for automatic performance analysis. APART technical report, revised edition. Technical Report FZJ-ZAM-IB-2001-08, Forschungszentrum Jülich (2001)Google Scholar
  4. 4.
    Fahringer, T., Gerndt, M., Riley, G.D., Träff, J.L.: Formalizing openMP performance properties with ASL. In: Valero, M., Joe, K., Kitsuregawa, M., Tanaka, H. (eds.) ISHPC 2000. LNCS, vol. 1940, pp. 428–439. Springer, Heidelberg (2000)CrossRefGoogle Scholar
  5. 5.
    Fahringer, T., Júnior, C.S.: Automatic search for performance problems in parallel and distributed programs by using multi-experiment analysis. In: Sahni, S.K., Prasanna, V.K., Shukla, U. (eds.) HiPC 2002. LNCS, vol. 2552, pp. 151–162. Springer, Heidelberg (2002)CrossRefGoogle Scholar
  6. 6.
    Fürlinger, K., Gerndt, M.: ompP: A profiling tool for OpenMP. In: Proceedings of the First International Workshop on OpenMP, IWOMP 2005 (2005) Accepted for publicationGoogle Scholar
  7. 7.
    Gerndt, M.: Specification of performance properties of hybrid programs on hitachi SR8000. Technical report, Lehrstuhl für Rechnertechnik und Rechnerorganisation, Institut für Informatik, Technische Universität München (2002)Google Scholar
  8. 8.
    Júnior, C.S., Fahringer, T., Geissler, M., Madsen, G., Moritsch, H.: On using aksum for semi-automatically searching of performance problems in parallel and distributed programs. In: Proceedings of the 11th Euromicro Conference on Parallel, Distributed and Network-Based Processing (PDP 2003), February 2003, pp. 385–392. IEEE Computer Society Press, Los Alamitos (2003)Google Scholar
  9. 9.
    Miller, B.P., Callaghan, M.D., Cargille, J.M., Hollingsworth, J.K., Irvin, R.B., Karavanic, K.L., Kunchithapadam, K., Newhall, T.: The Paradyn parallel performance measurement tool. IEEE Computer 28(11), 37–46 (1995)Google Scholar
  10. 10.
    Mohr, B., Malony, A.D., Hoppe, H.-C., Schlimbach, F., Haab, G., Hoeflinger, J., Shah, S.: A performance monitoring interface for OpenMP. In: Proceedings of the Fourth Workshop on OpenMP (EWOMP 2002) (September 2002)Google Scholar
  11. 11.
    Mohr, B., Malony, A.D., Shende, S.S., Wolf, F.: Towards a performance tool interface for OpenMP: An approach based on directive rewriting. In: Proceedings of the Third Workshop on OpenMP (EWOMP 2001) (September 2001)Google Scholar
  12. 12.
    Mohr, B., Träff, J.L.: Initial design of a test suite for automatic performance analysis tools. In: Proc. HIPS, pp. 77–86 (2003)Google Scholar
  13. 13.
    Morajko, A., Morajko, O., Jorba, J., Margalef, T.: Automatic performance analysis and dynamic tuning of distributed applications. Parallel Processing Letters 13(2), 169–187 (2003)CrossRefMathSciNetGoogle Scholar
  14. 14.
  15. 15.
    Roth, P.C., Arnold, D.C., Miller, B.P.: MRNet: A software-based multicast/reduction network for scalable tools. In: Proceedings of the 2003 Conference on Supercomputing (SC 2003) (November 2003)Google Scholar
  16. 16.
    Süß, M., Leopold, C.: A user’s experience with parallel sorting and openmp. In: Proceedings of the Sixth Workshop on OpenMP (EWOMP 2004) (October 2004)Google Scholar
  17. 17.
    Wolf, F., Mohr, B.: Automatic performance analysis of hybrid MPI/OpenMP applications. In: Proceedings of the 11th Euromicro Conference on Parallel, Distributed and Network-Based Processing (PDP 2003), February 2003, pp. 13–22. IEEE Computer Society Press, Los Alamitos (2003)CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2005

Authors and Affiliations

  • Karl Fürlinger
    • 1
  • Michael Gerndt
    • 1
  1. 1.Institut für Informatik, Lehrstuhl für Rechnertechnik und RechnerorganisationTechnische Universität München 

Personalised recommendations