Scalasca

Wolf, Felix

doi:10.1007/978-0-387-09766-4_61

Felix Wolf²

101 Accesses

Synonyms

The predecessor of Scalasca, from which Scalasca evolved, is known by the name of KOJAK.

Definition

Scalasca is an open-source software tool that supports the performance optimization of parallel programs by measuring and analyzing their runtime behavior. The analysis identifies potential performance bottlenecks – in particular those concerning communication and synchronization – and offers guidance in exploring their causes. Scalasca targets mainly scientific and engineering applications based on the programming interfaces MPI and OpenMP, including hybrid applications based on a combination of the two. The tool has been specifically designed for use on large-scale systems including IBM Blue Gene and Cray XT, but is also well suited for small- and medium-scale HPC platforms.

Discussion

Introduction

Driven by growing application requirements and accelerated by current trends in microprocessor design, the number of processor cores on modern supercomputers is expanding from...

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 1,600.00; Price excludes VAT (USA)

Hardcover Book: USD 1,799.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Bibliography

Becker D, Rabenseifner R, Wolf F, Linford J (2009) Scalable timestamp synchronization for event traces of message-passing applications. Parallel Comput 35(12):595–607
MathSciNet Google Scholar
Becker D, Wolf F, Frings W, Geimer M, Wylie BJN, Mohr B (2007) Automatic trace-based performance analysis of metacomputing applications. In: Proceedings of the international parallel and distributed processing symposium (IPDPS), Long Beach, CA, USA. IEEE Computer Society, Washington, DC
Google Scholar
Böhme D, Geimer M, Wolf F, Arnold L (2010) Identifying the root causes of wait states in large-scale parallel applications. In: Proceedings of the 39th international conference on parallel processing (ICPP), San Diego, CA, USA. IEEE Computer Society, Washington, DC, pp 90–100
Google Scholar
Frings W, Wolf F, Petkov V (2009) Scalable massively parallel I/O to task-local files. In: Proceedings of the ACM/IEEE conference on supercomputing (SC09), Portland, OR, USA, Nov 2009
Google Scholar
Geimer M, Wolf F, Wylie BJN, Ábrahám E, Becker D, Mohr B (2010) The Scalasca performance toolset architecture. Concurr Comput Pract Exper 22(6):702–719
Google Scholar
Geimer M, Wolf F, Wylie BJN, Mohr B (2009) A scalable tool architecture for diagnosing wait states in massively-parallel applications. Parallel Comput 35(7):375–388
Google Scholar
Gibbon P, Frings W, Dominiczak S, Mohr B (2006) Performance analysis and visualization of the n-body tree code PEPC on massively parallel computers. In: Proceedings of the conference on parallel computing (ParCo), Málaga, Spain, Sept 2005 (NIC series), vol 33. John von Neumann-Institut für Computing, Jülich, pp 367–374
Google Scholar
Hayes JC, Norman ML, Fiedler RA, Bordner JO, Li PS, Clark SE, ud-Doula A, Mac Low M-M (2006) Simulating radiating and magnetized flows in multiple dimensions with ZEUS-MP. Astrophys J Suppl 165(1):188–228
Google Scholar
Hermanns M-A, Geimer M, Mohr B, Wolf F (2009) Scalable detection of MPI-2 remote memory access inefficiency patterns. In: Proceedings of the 16th European PVM/MPI users’ group meeting (EuroPVM/MPI), Espoo, Finland. Lecture notes in computer science, vol 5759. Springer, Berlin, pp 31–41
Google Scholar
Hermanns M-A, Geimer M, Wolf F, Wylie BJN (2009) Verifying causality between distant performance phenomena in large-scale MPI applications. In Proceedings of the 17th Euromicro international conference on parallel, distributed, and network-based processing (PDP), Weimar, Germany. IEEE Computer Society, Washington, DC, pp 78–84
Google Scholar
Jülich Supercomputing Centre and German Research School for Simulation Sciences. Scalasca parallel performance analysis toolset documentation (performance properties). http://www.scalasca.org/download/documentation/
Meira W Jr, LeBlanc TJ, Poulos A (1996) Waiting time analysis and performance visualization in Carnival. In: Proceedings of the SIGMETRICS symposium on parallel and distributed tools (SPDT’96), Philadelphia, PA, USA. ACM
Google Scholar
Mohr B, Malony A, Shende S, Wolf F (2002) Design and prototype of a performance tool interface for OpenMP. J Supercomput 23(1):105–128
MATH Google Scholar
Song F, Wolf F, Bhatia N, Dongarra J, Moore S (2004) An algebra for cross-experiment performance analysis. In: Proceedings of the international conference on parallel processing (ICPP), Montreal, Canada. IEEE Computer Society, Washington, DC, pp 63–72
Google Scholar
Szebenyi Z, Gamblin T, Schulz M, de Supinski BR, Wolf F, Wylie BJN (2011) Reconciling sampling and direct instrumentation for unintrusive call-path profiling of MPI programs. In: Proceedings of the international parallel and distributed processing symposium (IPDPS), Anchorage, AK, USA. IEEE Computer Society, Washington, DC
Google Scholar
Szebenyi Z, Wolf F, Wylie BJN (2009) Space-efficient time-series call-path profiling of parallel applications. In: Proceedings of the ACM/IEEE conference on supercomputing (SC09), Portland, OR, USA, Nov 2009
Google Scholar
Szebenyi Z, Wylie BJN, Wolf F (2008) SCALASCA parallel performance analyses of SPEC MPI2007 applications. In: Proceedings of the 1st SPEC international performance evaluation workshop (SIPEW), Darmstadt, Germany. Lecture notes in computer science, vol 5119. Springer, Berlin, pp 99–123
Google Scholar
Szebenyi Z, Wylie BJN, Wolf F (2009) Scalasca parallel performance analyses of PEPC. In: Proceedings of the workshop on productivity and performance (PROPER) in conjunction with Euro-Par, Las Palmas de Gran Canaria, Spain, August 2008. Lecture notes in computer science, vol 5415. Springer, Berlin, pp 305–314
Google Scholar
Wolf F (2003) Automatic Performance Analysis on Parallel Computers with SMP Nodes. PhD thesis, RWTH Aachen, Forschungszentrum Jülich. ISBN 3-00-010003-2
Google Scholar
Wolf F, Mohr B (2001) Specifying performance properties of parallel applications using compound events. Parallel Distrib Comput Pract 4(3):301–317
Google Scholar
Wolf F, Mohr B (2003) Automatic performance analysis of hybrid MPI/OpenMP applications. J Syst Archit 49(10–11):421–439
Google Scholar
Wolf F, Mohr B, Dongarra J, Moore S (2007) Automatic analysis of inefficiency patterns in parallel applications. Concurr Comput Pract Exper 19(11):1481–1496
Google Scholar
Wylie BJN, Geimer M, Mohr B, Böhme D, Szebenyi Z, Wolf F (2010) Large-scale performance analysis of Sweep3D with the Scalasca toolset. Parallel Process Lett 20(4):397–414
MathSciNet Google Scholar
Wylie BJN, Geimer M, Wolf F (2008) Performance measurement and analysis of large-scale parallel applications on leadership computing systems. Sci Program 16(2–3):167–181
Google Scholar

Download references

Author information

Authors and Affiliations

German Research School for Simulation Sciences Jlich Supercomputing Centre RWTH Aachen University, 52056, Aachen, Germany
Felix Wolf (Prof.)

Authors

Felix Wolf
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

University of Illinois at Urbana-Champaign, Urbana, IL, USA
David Padua

Rights and permissions

Reprints and permissions

Copyright information

About this entry

Cite this entry

Wolf, F. (2011). Scalasca. In: Padua, D. (eds) Encyclopedia of Parallel Computing. Springer, Boston, MA. https://doi.org/10.1007/978-0-387-09766-4_61

Download citation

DOI: https://doi.org/10.1007/978-0-387-09766-4_61
Publisher Name: Springer, Boston, MA
Print ISBN: 978-0-387-09765-7
Online ISBN: 978-0-387-09766-4
eBook Packages: Computer ScienceReference Module Computer Science and Engineering

Publish with us

Policies and ethics