Skip to main content
Log in

GPU-accelerated MART and concurrent cross-correlation for tomographic PIV

  • Research Article
  • Published:
Experiments in Fluids Aims and scope Submit manuscript

Abstract

This paper presents a novel Graphics Processing Unit (GPU)-accelerated method for large-scale data processing of tomographic particle image velocimetry. The multiplicative algebraic reconstruction technique (MART) is utilized to reconstruct three-dimensional (3D) particle fields, and cross-correlation with fast Fourier transform is used to generate the displacement vectors. The Compute Unified Device Architecture (CUDA) C programming model is used to port the velocity field reconstruction from CPU code to GPU code to improve efficiency. For similar reconstruction tasks, a particular thread grid hierarchy is designed to construct the corresponding computational kernel functions, and each task is launched in a single thread. A modified strategy of pixel batch processing is then used to manage the GPU memory access. Subsequently, the asynchronous stream concurrency is used to generate the velocity field with the GPU cuFFT library. A synthetic 3D experiment with a ring vortex is carried out to verify the accuracy and efficiency of the developed method. The parallel results agree well with the generated data and other research conclusions reported in the literature. The speed-up ratio by multi-core CPU (Intel® Xeon® Platinum 8168) parallel implementation with OpenMP converges to 2.5 × in MFG-MART and 3.0 × in cross-correlation. In contrast to a 24-core CPU implementation, a GPU (NVIDIA Tesla V100S, 32 GB) under maximum memory usage achieves an impressive speed-up ratio of over 20 × in parallel MFG-MART and 4 × in concurrent cross-correlation. The measurement of turbulent flow in a circular jet flow at Reynolds 3,000 is used to examine the efficiency promotion of the parallelized framework in real experimental settings. For the synthetic volume reconstruction of 700 × 700 × 140 voxels and cross-correlation with 413 voxels window in a 75% overlap, and the experimental volume reconstruction of 550 × 1100 × 550 voxels and cross-correlation with 323 voxels window in a 50% overlap, a frame of velocity field can be completed within 2 min in each domain.

Graphical abstract

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16

Similar content being viewed by others

Abbreviations

d :

Scalar velocity (mm/s)

D :

Diameter of circular nozzle (mm)

E(X j, Y j, Z j):

Light intensity distribution of voxels (−)

I(x i, y i):

Gray level of image pixels (−)

l :

Vortex width (mm)

N i :

Number of voxels associated with each line of sight (−)

Q :

Reconstruction quality factor (−)

R :

Distance to the voxel-center ring (voxels)

Re:

Reynolds number (−)

u, v, w :

Three Cartesian components of velocity (mm/s)

w i , j :

Weighting coefficient (−)

μ :

Scalar relaxation parameter, which must be ≤ 1 for MART

ωD/U 0 :

Normalized vorticity (−)

U 0 :

Free-stream or bulk velocity (m/s)

ART:

Algebraic reconstruction technique

CPU:

Central Processing Unit

CUDA:

Compute Unified Device Architecture

FFTW:

The Fastest Fourier Transform in the West

GPU:

Graphics Processing Unit

MART:

Multiplicative algebraic reconstruction technique

MKL:

Intel Math Kernel Library

Tomo-PIV:

Tomographic particle image velocimetry

References

Download references

Acknowledgements

The authors gratefully acknowledge the financial support from the National Natural Science Foundation of China (11725209, 12002208) and the Natural Science Foundation of Shanghai (20ZR1425700).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Chuangxin He.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Zeng, X., He, C. & Liu, Y. GPU-accelerated MART and concurrent cross-correlation for tomographic PIV. Exp Fluids 63, 91 (2022). https://doi.org/10.1007/s00348-022-03444-3

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s00348-022-03444-3

Navigation