Abstract
Numerical algorithms in the seismic industry are among the most challenging areas of High Performance Computing and require an ever growing number of compute power and main memory. The Global Address Space Programming Interface (GPI) provides a model to program distributed memory clusters based on RDMA transfers in a Partitioned Global Address Space (PGAS). Based on GPI a generic straight forward parallelization of an Anisotropic Diffusion Filter (ADF) is implemented as an example of an Explicit Finite Difference scheme. Key features of the implementation are a complete overlay of the computation with network data transfers, a dynamic load distribution scheme and the usage of one-sided communication patterns throughout the algorithm to orchestrate read and write accesses to the image data. Synchronization points between the compute nodes or barriers are completely avoided. Benchmarks on a cluster with 260 nodes and 1040 cores reveal a constant communication overhead of less than 6% of the total computation time. This figure is still met if the compute nodes in the cluster differ significantly in performance capacity.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Bruhn, A., Jakob, T., Fischer, M., Kohlberger, T., Weickert, J., Brüning, U., Schnörr, C.: High performance cluster computing with 3-D nonlinear diffusion filters. Real Time Imag. 10(1), 41–51 (2004). doi: 10.1016/j.rti.2003.12.002
Machado, R., Lojewski, C.: The Fraunhofer virtual machine: a communication library and runtime system based on the RDMA model. Comput. Sci. Res. Dev. 23(3–4), 125–132 (2009). doi: 10.1007/s00450-009-0088-2
Weickert, J.: Theoretical foundations of anisotropic diffusion in image processing. Computing, Suppl 11, 221–236 (1996)
Weickert, J.: Anisotropic diffusion in image processing. Teubner (1998)
Weickert, J.: Coherence-enhancing difusion filtering. Internationals Journal of Computer Vision 31(2/3), 111–127 (1999)
Acknowledgements
We like to thank Joachim Weickert, University of Saarbrücken for providing a single threaded reference implementation of the PSPro Edge- and Coherence-Enhancing Anisotropic Diffusion Filter.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kühn, M. (2011). Parallelization of an Edge- and Coherence-Enhancing Anisotropic Diffusion Filter with a Distributed Memory Approach Based on GPI. In: Bischof, C., Hegering, HG., Nagel, W., Wittum, G. (eds) Competence in High Performance Computing 2010. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-24025-6_9
Download citation
DOI: https://doi.org/10.1007/978-3-642-24025-6_9
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-24024-9
Online ISBN: 978-3-642-24025-6
eBook Packages: Computer ScienceComputer Science (R0)