Abstract.
An efficient parallel iterative method for finite element method has been developed for symmetric multiprocessor (SMP) cluster architectures with vector processors such as the Earth Simulator. The method is based on a three-level hybrid parallel programming model, including message passing for inter-SMP node communication, loop directives by OpenMP for intra-SMP node parallelization and vectorization for each processing element (PE). Simple 3D linear elastic problems with more than 2.2 x 109 DOF have been solved using 3 x 3 block ICCG(0) method with additive Schwarz domain decomposition and PDJDS/CM-RCM reordering on 176 nodes of the Earth Simulator, achieving performance of 3.80 TFLOPS.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Barrett, R., Berry, M., Chan, T.F., Donato, J., Dongarra, J.J., Eijkhout, V., Pozo, R., Romine, C., van der Vorst, H.: Templates for the Solution of Linear Systems: Building Blocks for Iterative Methods. SIAM, Philadelphia (1994)
Cappelo, F., Etiemble, D.: MPI versus MPI+OpenMP on the IBM SP for the NAS Benchmarks, SC 2000 Technical Paper, Dallas, Texas (2000)
Djomehri, M.J., Jin, H.H.: Hybrid MPI+OpenMP Programming of an Overset CFD Solver and Performance Investigations. NASA/NAS Technical Report (NASA Ames Research Center), NAS-02-002Â 16 (2002)
Doi, S., Washio, T.: Using Multicolor Ordering with Many Colors to Strike a Better Balance between Parallelism and Convergence. In: RIKEN Symposium on Linear Algebra and its Applications, The Institute of Physical and Chemical Research, pp. 19–26 (1999)
Dongara, J.J., Duff, I.S., Sorensen, D.C., van der Vorst, H.A.: Solving Linear Systems on Vector and Shared Memory Computers. SIAM, Philadelphia (1990)
Falgout, R., Jones, J.: Multigrid on Massively Parallel Architectures. In: Sixth European Multigrid Conference, Ghent, Belgium, September 27-30 (1999)
Garatani, K., Nakamura, H., Okuda, H., Yagawa, G.: GeoFEM: High Performance Parallel FEM for Solid Earth. In: Sloot, P.M.A., Hoekstra, A.G., Bubak, M., Hertzberger, B. (eds.) HPCN-Europe 1999. LNCS, vol. 1593, pp. 133–140. Springer, Heidelberg (1999)
Kerbyson, D.J., Hoisie, A., Wasserman, H.: A Comparison Between the Earth Simulator and Alpha Server Systems using Predictive Application Performance Models, LA-UR-02-5222, Los Alamos National Laboratory, USA (2002)
Nakajima, K., Okuda, H.: Parallel Iterative Solvers with Localized ILU Preconditioning for Unstructured Grids on Workstation Clusters. International Journal for Computational Fluid Dynamics 12, 315–322 (1999)
Nakajima, K., Okuda, H.: Parallel Iterative Solvers for Unstructured Grids using an OpenMP/MPI Hybrid Programming Model for the GeoFEM Platform on SMP Cluster Architectures. In: Zima, H.P., Joe, K., Sato, M., Seo, Y., Shimasaki, M. (eds.) ISHPC 2002. LNCS, vol. 2327, pp. 437–448. Springer, Heidelberg (2002)
Nakajima, K.: Parallel Iterative Solvers of GeoFEM with Selective Blocking Preconditioning for Nonlinear Contact Problems on the Earth Simulator, RIST/Tokyo GeoFEM Report 2003-006 (2003), http://geofem.tokyo.rist.or.jp/report_en/2003_006.html/
Oliker, L., Li, X., Husbands, P., Biswas, R.: Effects of Ordering Strategies and Programming Paradigms on Sparse Matrix Computations. SIAM Review 44(3), 373–393 (2002)
Rabenseifner, R.: Communication Bandwidth of Parallel Programming Models on Hybrid Architectures. In: Zima, H.P., Joe, K., Sato, M., Seo, Y., Shimasaki, M. (eds.) ISHPC 2002. LNCS, vol. 2327, pp. 437–448. Springer, Heidelberg (2002)
Saad, Y.: Iterative Methods for Sparse Linear Systems. PWS Publishing Company (1996)
Smith, B., Bjφrstad, P., Gropp, W.: Domain Decomposition, Parallel Multilevel Methods for Elliptic Partial Differential Equations. Cambridge Press, Cambridge (1996)
Washio, T., Maruyama, K., Osoda, T., Shimizu, F., Doi, S.: Efficient implementations of block sparse matrix operations on shared memory vector machines. In: SNA 2000: The Fourth International Conference on Supercomputing in Nuclear Applications (2000)
Accelerated Strategic Computing Initiative (ASCI) Web Site: http://www.llnl.gov/asci/
Earth Simulator Center Web Site: http://www.es.jamstec.go.jp/
GeoFEM Web Site: http://geofem.tokyo.rist.or.jp/
MPI Web Site: http://www.mpi.org/
OpenMP Web Site: http://www.openmp.org/
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Nakajima, K. (2003). OpenMP/MPI Hybrid vs. Flat MPI on the Earth Simulator: Parallel Iterative Solvers for Finite Element Method. In: Veidenbaum, A., Joe, K., Amano, H., Aiso, H. (eds) High Performance Computing. ISHPC 2003. Lecture Notes in Computer Science, vol 2858. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-39707-6_44
Download citation
DOI: https://doi.org/10.1007/978-3-540-39707-6_44
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-20359-9
Online ISBN: 978-3-540-39707-6
eBook Packages: Springer Book Archive