Computational Efficiency of Parallel Unstructured Finite Element Simulations

Neumann, Malte; Küttler, Ulrich; Tiyyagura, Sunil Reddy; Wall, Wolfgang A.; Ramm, Ekkehard

doi:10.1007/3-540-35074-8_7

Malte Neumann⁵,
Ulrich Küttler⁶,
Sunil Reddy Tiyyagura⁷,
Wolfgang A. Wall⁶ &
…
Ekkehard Ramm⁵

481 Accesses

Abstract

In this paper we address various efficiency aspects of finite element (FE) simulations on vector computers. Especially for the numerical simulation of large scale Computational Fluid Dynamics (CFD) and Fluid-Structure Interaction (FSI) problems efficiency and robustness of the algorithms are two key requirements.

In the first part of this paper a straightforward concept is described to increase the performance of the integration of finite elements in arbitrary, unstructured meshes by allowing for vectorization. In addition the effect of different programming languages and different array management techniques on the performance will be investigated.

Besides the element calculation, the solution of the linear system of equations takes a considerable part of computation time. Using the jagged diagonal format (JAD) for the sparse matrix, the average vector length can be increased. Block oriented computation schemes lead to considerably less indirect addressing and at the same time packaging more instructions. Thus, the overall performance of the iterative solver can be improved.

The last part discusses the input and output facility of parallel scientific software. Next to efficiency the crucial requirements for the IO subsystem in a parallel setting are scalability, flexibility and long term reliability.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Behr, M., Pressel, D.M., Sturek, W.B.: Comments on CFD Code Performance on Scalable Architectures. Computer Methods in Applied Mechanics and Engineering 190 (2000) 263–277
Article MATH Google Scholar
Oliker, L., Canning, A., Carter, J., Shalf, J., Skinner, D., Ethier, S., Biswas, R., Djomehri, J., van der Wijngaart, R.: Evaluation of Cache-based Superscalar and Cacheless Vector Architectures for Scientific Computations. In: Proceedings of the ACM/IEEE Supercomputing Conference 2003, Phoenix, Arizona, USA. (2003)
Google Scholar
Veldhuizen, T.L.: Scientific Computing: C++ Versus Fortran: C++ has more than caught up. Dr. Dobb’s Journal of Software Tools 22 (1997) 34, 36–38, 91
Google Scholar
Veldhuizen, T.L., Jernigan, M.E.: Will C++ be Faster than Fortran? In: Proceedings of the 1st International Scientific Computing in Object-Oriented Parallel Environments (ISCOPE’97). Lecture Notes in Computer Science, Springer-Verlag (1997)
Google Scholar
Pohl, T., Deserno, F., Thürey, N., Rüde, U., Lammers, P., Wellein, G., Zeiser, T.: Performance Evaluation of Parallel Large-Scale Lattice Boltzmann Applications on Three Supercomputing Architectures. In: Proceedings of the ACM/IEEE Supercomputing Conference 2004, Pittsburgh, USA. (2004)
Google Scholar
Ethier, C., Steinman, D.: Exact Fully 3d Navier Stokes Solution for Benchmarking. International Journal for Numerical Methods in Fluids 19 (1994) 369–375
Article MATH Google Scholar
Wall, W.A.: Fluid-Struktur-Interaktion mit stabilisierten Finiten Elementen. phdthesis, Institut für Baustatik, Universität Stuttgart (1999)
Google Scholar
D’Azevedo, E.F., Fahey, M.R., Mills, R.T.: Vectorized Sparse Matrix Multiply for Compressed Row Storage Format. In: Proceedings of the 5th International Conference on Computational Science, Atlanta, USA. (2005)
Google Scholar
Tuminaro, R.S., Shadid, J.N., Hutchinson, S.A.: Parallel Sparse Matrix Vector Multiply Software for Matrices with Data Locality. Concurrency: Practice and Experience 10–3 (1998) 229–247
Article Google Scholar
Nakajima, K.: Parallel Iterative Solvers of GeoFEM with Selective Blocking Preconditioning for Nonlinear Contact Problems on the Earth Simulator. GeoFEM 2003-005, RIST/Tokyo (2003)
Google Scholar
National Center for Supercomputing Applications. University of Illinois: Hierarchical Data Format. http://hdf.ncsa.uiuc.edu (2005)
Google Scholar
Unidata Community: Network Common Data Form. http://my.unidata.ucar.edu/content/software/netcdf/index.html (2005)
Google Scholar
Hunt, A., Thomas, D.: The Pragmatic Programmer: From Journeyman to Master. Addison-Wesley, Reading, MA (2000)
Google Scholar
Aho, A.V., Sethi, R., Ullman, J.D.: Compilers. Addison-Wesley, Reading, MA (1986)
Google Scholar
Kennedy, J., Behr, M., Kalro, V., Tezduyar, T.: Implementation of implicit finite element methods for incompressible flows on the CM-5. Computer Methods in Applied Mechanics and Engineering 119 (1994) 95–111
Article MATH Google Scholar
Guo, M., Pan, Y.: Improving Communication Scheduling for Array Redistribution. Journal of Parallel and Distributed Computing (5)65 (2005) 553–563
Article MATH MathSciNet Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Structural Mechanics, University of Stuttgart, Pfaffenwaldring 7, D-70550, Stuttgart, Germany
Malte Neumann & Ekkehard Ramm
Computational Mechanics, Technical University of Munich, Boltzmannstraße 15, D-85747, Garching, Germany
Ulrich Küttler (Chair) & Wolfgang A. Wall
High Performance Computing Center Stuttgart (HLRS), Nobelstraße 19, D-70569, Stuttgart, Germany
Sunil Reddy Tiyyagura

Authors

Malte Neumann
View author publications
You can also search for this author in PubMed Google Scholar
Ulrich Küttler
View author publications
You can also search for this author in PubMed Google Scholar
Sunil Reddy Tiyyagura
View author publications
You can also search for this author in PubMed Google Scholar
Wolfgang A. Wall
View author publications
You can also search for this author in PubMed Google Scholar
Ekkehard Ramm
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Höchstleistungsrechenzentrum Stuttgart (HLRS), Universität Stuttgart, Nobelstraße 19, 70569, Stuttgart, Germany
Michael Resch , Thomas Bönisch & Katharina Benkert , &
NEC High Performance, Europe GmbH, Prinzenallee 11, 40459, Düsseldorf, Germany
Wolfgang Bez
NEC Corporation, Nisshin-cho 1-10, 183-8501, Tokyo, Japan
Toshiyuki Furui
NEC Corporation, Shimonumabe 1753, 211-8666, Kanagawa, Japan
Yoshiki Seo

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Neumann, M., Küttler, U., Tiyyagura, S.R., Wall, W.A., Ramm, E. (2006). Computational Efficiency of Parallel Unstructured Finite Element Simulations. In: Resch, M., Bönisch, T., Benkert, K., Bez, W., Furui, T., Seo, Y. (eds) High Performance Computing on Vector Systems. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-35074-8_7

Download citation

DOI: https://doi.org/10.1007/3-540-35074-8_7
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-29124-4
Online ISBN: 978-3-540-35074-3
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)

Publish with us

Policies and ethics