The Art of High Performance Computing for Computational Science, Vol. 1

Techniques of Speedup and Parallelization for General Purposes

  • Masaaki Geshi

Table of contents

  1. Front Matter
    Pages i-ix
  2. Takahiro Katagiri
    Pages 1-25
  3. Takahiro Katagiri
    Pages 27-44
  4. Takahiro Katagiri
    Pages 45-59
  5. Takahiro Katagiri
    Pages 61-68
  6. Daisuke Takahashi
    Pages 137-168
  7. Hiroshi Watanabe
    Pages 169-189
  8. Shin’chi Oishi, Yusuke Morikura, Kouta Sekine, Hisayasu Kuroda, Maho Nakata
    Pages 191-213
  9. Back Matter
    Pages 215-219

About this book


This book provides basic and practical techniques of parallel computing and related methods of numerical analysis for researchers who conduct numerical calculation and simulation. Although the techniques provided in this book are field-independent, these methods can be used in fields such as physics, chemistry, biology, earth sciences, space science, meteorology, disaster prevention, and manufacturing. In particular, those who develop software code in these areas will find this book useful. The contents are suitable for graduate students and researchers in computational science rather than novices at programming or informed experts in computer science.

Starting with an introduction to the recent trends in computer architecture and parallel processing, Chapter 1 explains the basic knowledge of speedup programs with simple examples of numerical computing. Chapters 2 – 4 detail the basics of parallel programming, the message passing interface (MPI), and OpenMP and discuss hybrid parallelization techniques. Showing an actual example of adaptation, Chapter 5 gives an overview of performance tuning and communication optimizations. To deal with dense matrix calculations, Chapter 6 details the basics and practice of linear algebra calculation libraries BLAS and LAPACK, including some examples that can be easily reproduced by readers using free software. Focusing on sparse matrix calculations, Chapter 7 explains high performance algorithms for numerical linear algebra. Chapter 8 introduces the fast Fourier transform in large-scale systems from the basics. Chapter 9 explains optimization and related topics such as debug methods and version control systems. Chapter 10 discusses techniques for increasing computation accuracy as an essential topic in numerical calculation.

This is the first of the two volumes that grew out of a series of lectures in the K computer project in Japan. The second volume will focus on advanced techniques and examples of applications in materials science.


High Performance Computing HPC Parallel Processing MPI computational science numerical analysis

Editors and affiliations

  • Masaaki Geshi
    • 1
  1. 1.Osaka UniversityToyonakaJapan

Bibliographic information