Using Blue Gene/P and GPUs to Accelerate Computations in the EULAG Model

  • Roman Wyrzykowski
  • Krzysztof Rojek
  • Łukasz Szustak
Part of the Lecture Notes in Computer Science book series (LNCS, volume 7116)


EULAG (Eulerian/semi-Lagrangian fluid solver) is an established computational model developed by the group headed by Piotr K. Smolarkiewicz for simulating thermo-fluid flows across a wide range of scales and physical scenarios. This paper presents perspectives of the EULAG parallelization based on the MPI, OpenMP, and OpenCL standards. We focus on development of computational kernels of the EULAG model. They consist of the most time-consuming calculations of the model, which are: laplacian algorithm (laplc) and multidimensional positive definite advection transport algorithm (MPDATA).

The first challenge of our work was parallelization of the laplc subroutine using MPI across nodes and OpenMP within nodes, on the BlueGene/P supercomputer located in the Bulgarian Supercomputing Center. The second challenge was to accelerate computations of the Eulag model using modern GPUs. We discuss the scalability issue for the OpenCL implementation of the linear part of MPDATA on ATI Radeon HD 5870 GPU with AMD Phenom II X4 CPU, and NVIDIA Tesla C1060 GPU with AMD Phenom II X4 CPU.


Global Memory EULAG Model Multicore Architecture Modern GPUs Peak Bandwidth 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    AMD Corporation: ATI Radeon HD 5870 Feature Summary,
  2. 2.
    Dokken, T., Hagen, T.R., Hjelmervik, J.M.: An Introduction to General-Purpose Computing on Programmable Graphics Hardware. In: Geometric Modelling, Numerical Simulation, and Optimization: Applied Mathematics at SINTEF, pp. 123–161. Springer, Heidelberg (2007)CrossRefGoogle Scholar
  3. 3.
    Eulag Research Model for Geophysical Flows,
  4. 4.
    IBM Blue Gene Team: Overview of the IBM Blue Gene/P project. IBM Journal of Research and Development 52, 199–220 (2008)Google Scholar
  5. 5.
    Khronos OpenCL Working Group: The OpenCL C++ Wrapper API,
  6. 6.
    Khronos OpenCL Working Group: The OpenCL Specification,
  7. 7.
    Lindholm, E., Nickolls, J., Oberman, S., Montrym, J.: NVIDIA Tesla: A Unified Graphics and Computing Architecture. IEEE Micro 28, 39–55 (2008)CrossRefGoogle Scholar
  8. 8.
    Smolarkiewicz, P., Szmelter, J.: MPDATA: An edge-based unstructured-grid formulation. Elsevier Journal of Computational Physics 206, 624–649 (2005)zbMATHCrossRefGoogle Scholar
  9. 9.
    Sviercoski, R., Winter, C., Warrick, A.: Analytical approximation for the generalized Laplace equation with step function coefficient. J. Appl. Math. 68, 1268–1281 (2008)MathSciNetzbMATHGoogle Scholar
  10. 10.
    Tsuchiyama, R., Nakamura, N., Iizuka, T., Asahara, A., Miki, S.: The OpenCL Programming Book. Fixstars Corporation (2010)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2012

Authors and Affiliations

  • Roman Wyrzykowski
    • 1
  • Krzysztof Rojek
    • 1
  • Łukasz Szustak
    • 1
  1. 1.Czestochowa University of TechnologyCzestochowaPoland

Personalised recommendations