Abstract
To achieve high parallel efficiency for the global MASNUM surface wave model, the algorithm of an irregular quasi-rectangular domain decomposition and related serializing of calculating points and data exchanging schemes are developed and conducted, based on the environment of Message Passing Interface (MPI). The new parallel version of the surface wave model is tested for parallel computing on the platform of the Sunway BlueLight supercomputer in the National Supercomputing Center in Jinan. The testing involves four horizontal resolutions, which are 1°×1°, (1/2)°×(1/2)°, (1/4)°×(1/4)°, and (1/8)°×(1/8)°. These tests are performed without data Input/Output (IO) and the maximum amount of processors used in these tests reaches to 131072. The testing results show that the computing speeds of the model with different resolutions are all increased with the increasing of numbers of processors. When the number of processors is four times that of the base processor number, the parallel efficiencies of all resolutions are greater than 80%. When the number of processors is eight times that of the base processor number, the parallel efficiency of tests with resolutions of 1°×1°, (1/2)°×(1/2)° and (1/4)°×(1/4)° is greater than 80%, and it is 62% for the test with a resolution of (1/8)°×(1/8)° using 131072 processors, which is the nearly all processors of Sunway BlueLight. When the processor’s number is 24 times that of the base processor number, the parallel efficiencies for tests with resolutions of 1°×1°, (1/2)°×(1/2)°, and (1/4)° ×(1/4)° are 72%, 62%, and 38%, respectively. The speedup and parallel efficiency indicate that the irregular quasi-rectangular domain decomposition and serialization schemes lead to high parallel efficiency and good scalability for a global numerical wave model.
Similar content being viewed by others
References
Bao Q, Wu G X, Liu Y M, et al. 2010. An introduction to the coupled model FGOALS1.1-s and its performance in East Asia. Adv Atmos Sci, 27: 1131–1142
Brian E. 2012. User’s Guide to the Community Atmosphere Model CAM-5.1.1
Chen G L. 2003. Parallel Series Parallel Computing: Structure Algorithm Programming (in Chinese). 2nd ed. Beijing: Higher Education Press. 450
Cowles G W. 2008. Parallelization of the Fvcom coastal ocean model. Int J High Perform Comput Appl, 22: 177–193
Craig A P, Vertenstein M, Jacob R. 2012. A new flexible coupler for earth system modeling developed for CCSM4 and CESM1. Int J High Perform Comput Appl, 26: 31–42
Dennis J M. 2008. Scaling climate simulation applications on the IBM Blue Gene/L system. IBM J Res Dev, 52: 117–126
Donner L J, Wyman B L, Hemler R S, et al. 2011. The dynamical core, physical parameterizations, and basic simulation characteristics of the atmospheric component AM3 of the GFDL global coupled model CM3. J Clim, 24: 3484–3519
Du Z H. 2011. High Performance Computing Series Parallel Programming Technique: MPI Parallel Program Design (in Chinese). Beijing: Tsinghua University Press. 336
Griffies S M, Harrison M J, Pacanowski R C, et al. 2008. A Technical Guide to MOM4, GFDL Ocean Group Technical Report No. 5. Technical Report. Geophysical Fluid Dynamics Laboratory. NOAA.
Kerbyson D J, Jones P W. 2005. A performance model of the parallel ocean program. Int J High Perform Comput Appl, 19: 261–276
Liu H L, Yu Y Q, Li W, et al. 2003. Manual for LASG/IAP Climate System Ocean Model (LICOM1.0) (in Chinese). Beijing: Science Press. 108
Message Passing Interface Forum. 2012. MPI: A Message-Passing Interface Standard, Version 3.0. High Performance Computing Center Stuttgart. 852
Pu Y, Li L J. 2010. The application of thousands of CPU cores in high resolution earth system model (in Chinese). e-Sci Tech Appl, V1: 69–75
Qiao F L, Yuan Y L, Ezer T, et al. 2010. A three-dimensional surface wave-ocean circulation coupled model and its initial testing. Ocean Dynam, 60: 1339–1355
Qiao F L, Yuan Y L, Yang Y Z, et al. 2004. Wave-induced mixing in the upper ocean: Distribution and application to a global ocean circulation model. Geophys Res Lett, 31: 1–4
Roeckner E, Bauml G, Bonaventura L, et al. 2003. The Atmospheric General Circulation Model ECHAM5 Part I: Model Description. Technical Report. Max Planck Institute for Meteorology
Smith R, Jones P, Briegleb B, et al. 2010. The Parallel Ocean Program (POP) Reference Manual, Report No. LAUR-10-01853. Technical Report. Los Alamos National Laboratory
Snavely A, Gao X, Lee C, et al. 2004. Performance modeling of HPC applications. Adv Parallel Comput, 13: 777–784
Song Z Y, Qiao F L, Zhao W. 2009. The improvement on prediction ability of climate model CCSM3 (in Chinese). Prog Nat Sci, 19: 203–211
Tolman H L. 2009. User Manual and System Documentation of WAVEWATCH III Version 3.14, MMAB Contribution No. 276. Technical Report. National Weather Service National Centers for Environmental Prediction. NOAA
Valcke S, Hanke M, Coquart L. 2011. OASIS4_1 User Guide
Wallcraft A J, Kara A B, Hurlburt H E, et al. 2003. The NRL Layered Global Ocean Model (NLOM) with an embedded mixed layer submodel: Formulation and tuning. J Atmos Oceanic Technol, 20: 1601–1615
Wang G S, Qiao F L, Xia C S. 2010. Parallelization of a coupled wave-circulation model and its application. Ocean Dynam, 60: 331–339
Wang G S, Qiao F L, Yang Y Z. 2007. Study on parallel algorithm for MPI-Based LAGFD-WAM numerical wave model (in Chinese). Adv Mar Sci, 25: 401–407
Worley P H, Craig A P, Dennis J M, et al. 2011. Performance of the Community Earth System Model. High Performance Computing, Networking, Storage and Analysis (SC) 2011 International Conference. 1–11
Yang Y Z, Qiao F L, Zhao W, et al. 2005. MASNUM ocean wave numerical model in spherical coordinates and its application (in Chinese). Acta Oceanol Sin, 27: 1–7
Yuan Y L, Hua F, Pan Z D, et al. 1991. LAGFD-WAM numerical wave model-I: Basic physical model (in Chinese). Acta Oceanol Sin, 10: 483–488
Zappa C J, McGillis W R, Raymond P A, et al. 2007. Environmental turbulent mixing controls on air-water gas exchange in marine and aquatic systems. Geophys Res Lett, 34: L10601, doi: 10.1029/2006GL028790
Zhang L L, Zhao J, Wu J Q, et al. 2009. Parallel computing of POP ocean model on quad-core intel xeon cluster (in Chinese). Comput Eng Appl, 45: 189–192
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Zhao, W., Song, Z., Qiao, F. et al. High efficient parallel numerical surface wave model based on an irregular quasi-rectangular domain decomposition scheme. Sci. China Earth Sci. 57, 1869–1878 (2014). https://doi.org/10.1007/s11430-014-4842-3
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11430-014-4842-3