Skip to main content

Improving the Scalability of Performance Evaluation Tools

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 7134))

Abstract

Performance evaluation tools play an important role in helping understand application performance, diagnose performance problems and guide tuning decisions on modern HPC systems. Tools to observe parallel performance must evolve to keep pace with the ever-increasing complexity of these systems. In this paper, we describe our experience in building novel tools and techniques in the TAU Performance System® to observe application performance effectively and efficiently at scale. It describes the extensions to TAU to contend with large data volumes associated with increasing core counts. These changes include new instrumentation choices, efficient handling of disk I/O operations in the measurement layer, and strategies for visualization of performance data at scale in TAU’s analysis layer, among others. We also describe some techniques that allow us to fully characterize the performance of applications running on hundreds of thousands of cores.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Brunst, H., Kranzlmüller, D., Nagel, W.E.: Tools for Scalable Parallel Program Analysis - Vampir NG and DeWiz. In: Distributed and Parallel Systems, Cluster and Grid Computing, vol. 777 (2004)

    Google Scholar 

  2. Buck, B., Hollingsworth, J.: An API for Runtime Code Patching. Journal of High Performance Computing Applications 14(4), 317–329 (2000)

    Article  Google Scholar 

  3. Frings, W., Wolf, F., Petkov, V.: Scalable Massively Parallel I/O to Task-Local Files. In: Proc. SC 2009 Conference (2009)

    Google Scholar 

  4. Geimer, M., Shende, S.S., Malony, A.D., Wolf, F.: A Generic and Configurable Source-Code Instrumentation Component. In: Allen, G., Nabrzyski, J., Seidel, E., van Albada, G.D., Dongarra, J., Sloot, P.M.A. (eds.) ICCS 2009, Part II. LNCS, vol. 5545, pp. 696–705. Springer, Heidelberg (2009)

    Chapter  Google Scholar 

  5. Geimer, M., Wolf, F., Wylie, B., Brian, J.N., Ábrahám, E., Becker, D., Mohr, B.: The SCALASCA Performance Toolset Architecture. In: Proc. of the International Workshop on Scalable Tools for High-End Computing (STHEC), pp. 51–65 (2008)

    Google Scholar 

  6. Lindlan, K.A., Cuny, J., Malony, A.D., Shende, S., Mohr, B., Rivenburgh, R., Rasmussen, C.: A Tool Framework for Static and Dynamic Analysis of Object-Oriented Software with Templates. In: Proc. of SC 2000 Conference (2000)

    Google Scholar 

  7. Mucci, P.: Dynaprof (2010), http://www.cs.utk.edu/~mucci/dynaprof

  8. Nataraj, A., Malony, A., Morris, A., Arnold, D., Miller, B.: In Search of Sweet-Spots in Parallel Performance Monitoring. In: Proc. IEEE International Conference on Cluster Computing (2008)

    Google Scholar 

  9. Roth, P., Arnold, D., Miller, B.: Proc. High-Performance Grid Computing Workshop, IPDPS (2004)

    Google Scholar 

  10. Shende, S., Malony, A., Ansell-Bell, R.: Instrumentation and Measurement Strategies for Flexible and Portable Empirical Performance Evaluation. In: Proc. Tools and Techniques for Performance Evaluation Workshop, PDPTA. CSREA, pp. 1150–1156 (2001)

    Google Scholar 

  11. Shende, S., Malony, A.D.: The TAU Parallel Performance System. The International Journal of High Performance Computing Applications 20(2), 287–311 (2006)

    Article  Google Scholar 

  12. Shende, S., Malony, A.D., Morris, A.: Simplifying Memory, I/O, and Communication Performance Assessment using TAU. In: Proc. DoD UGC 2010 Conference. IEEE Computer Society (2010)

    Google Scholar 

  13. Barcelona Supercomputing Center, “Paraver” (2010), http://www.bsc.es/paraver

  14. VI-HPS, “SILC” (2010), http://www.vi-hps.org/projects/silc

  15. Lee, C.W., Malony, A.D., Morris, A.: TAUmon: Scalable Online Performance Data Analysis in TAU. In: Guarracino, M.R., Vivien, F., Träff, J.L., Cannatoro, M., Danelutto, M., Hast, A., Perla, F., Knüpfer, A., Di Martino, B., Alexander, M. (eds.) Euro-Par-Workshop 2010. LNCS, vol. 6586, pp. 493–499. Springer, Heidelberg (2011)

    Chapter  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Kristján Jónasson

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Shende, S.S., Malony, A.D., Morris, A. (2012). Improving the Scalability of Performance Evaluation Tools. In: Jónasson, K. (eds) Applied Parallel and Scientific Computing. PARA 2010. Lecture Notes in Computer Science, vol 7134. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-28145-7_43

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-28145-7_43

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-28144-0

  • Online ISBN: 978-3-642-28145-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics