Abstract
In this paper we address various efficiency aspects of finite element (FE) simulations on vector computers. Especially for the numerical simulation of large scale Computational Fluid Dynamics (CFD) and Fluid-Structure Interaction (FSI) problems efficiency and robustness of the algorithms are two key requirements.
In the first part of this paper a straightforward concept is described to increase the performance of the integration of finite elements in arbitrary, unstructured meshes by allowing for vectorization. In addition the effect of different programming languages and different array management techniques on the performance will be investigated.
Besides the element calculation, the solution of the linear system of equations takes a considerable part of computation time. Using the jagged diagonal format (JAD) for the sparse matrix, the average vector length can be increased. Block oriented computation schemes lead to considerably less indirect addressing and at the same time packaging more instructions. Thus, the overall performance of the iterative solver can be improved.
The last part discusses the input and output facility of parallel scientific software. Next to efficiency the crucial requirements for the IO subsystem in a parallel setting are scalability, flexibility and long term reliability.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Behr, M., Pressel, D.M., Sturek, W.B.: Comments on CFD Code Performance on Scalable Architectures. Computer Methods in Applied Mechanics and Engineering 190 (2000) 263–277
Oliker, L., Canning, A., Carter, J., Shalf, J., Skinner, D., Ethier, S., Biswas, R., Djomehri, J., van der Wijngaart, R.: Evaluation of Cache-based Superscalar and Cacheless Vector Architectures for Scientific Computations. In: Proceedings of the ACM/IEEE Supercomputing Conference 2003, Phoenix, Arizona, USA. (2003)
Veldhuizen, T.L.: Scientific Computing: C++ Versus Fortran: C++ has more than caught up. Dr. Dobb’s Journal of Software Tools 22 (1997) 34, 36–38, 91
Veldhuizen, T.L., Jernigan, M.E.: Will C++ be Faster than Fortran? In: Proceedings of the 1st International Scientific Computing in Object-Oriented Parallel Environments (ISCOPE’97). Lecture Notes in Computer Science, Springer-Verlag (1997)
Pohl, T., Deserno, F., Thürey, N., Rüde, U., Lammers, P., Wellein, G., Zeiser, T.: Performance Evaluation of Parallel Large-Scale Lattice Boltzmann Applications on Three Supercomputing Architectures. In: Proceedings of the ACM/IEEE Supercomputing Conference 2004, Pittsburgh, USA. (2004)
Ethier, C., Steinman, D.: Exact Fully 3d Navier Stokes Solution for Benchmarking. International Journal for Numerical Methods in Fluids 19 (1994) 369–375
Wall, W.A.: Fluid-Struktur-Interaktion mit stabilisierten Finiten Elementen. phdthesis, Institut für Baustatik, Universität Stuttgart (1999)
D’Azevedo, E.F., Fahey, M.R., Mills, R.T.: Vectorized Sparse Matrix Multiply for Compressed Row Storage Format. In: Proceedings of the 5th International Conference on Computational Science, Atlanta, USA. (2005)
Tuminaro, R.S., Shadid, J.N., Hutchinson, S.A.: Parallel Sparse Matrix Vector Multiply Software for Matrices with Data Locality. Concurrency: Practice and Experience 10–3 (1998) 229–247
Nakajima, K.: Parallel Iterative Solvers of GeoFEM with Selective Blocking Preconditioning for Nonlinear Contact Problems on the Earth Simulator. GeoFEM 2003-005, RIST/Tokyo (2003)
National Center for Supercomputing Applications. University of Illinois: Hierarchical Data Format. http://hdf.ncsa.uiuc.edu (2005)
Unidata Community: Network Common Data Form. http://my.unidata.ucar.edu/content/software/netcdf/index.html (2005)
Hunt, A., Thomas, D.: The Pragmatic Programmer: From Journeyman to Master. Addison-Wesley, Reading, MA (2000)
Aho, A.V., Sethi, R., Ullman, J.D.: Compilers. Addison-Wesley, Reading, MA (1986)
Kennedy, J., Behr, M., Kalro, V., Tezduyar, T.: Implementation of implicit finite element methods for incompressible flows on the CM-5. Computer Methods in Applied Mechanics and Engineering 119 (1994) 95–111
Guo, M., Pan, Y.: Improving Communication Scheduling for Array Redistribution. Journal of Parallel and Distributed Computing (5)65 (2005) 553–563
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Neumann, M., Küttler, U., Tiyyagura, S.R., Wall, W.A., Ramm, E. (2006). Computational Efficiency of Parallel Unstructured Finite Element Simulations. In: Resch, M., Bönisch, T., Benkert, K., Bez, W., Furui, T., Seo, Y. (eds) High Performance Computing on Vector Systems. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-35074-8_7
Download citation
DOI: https://doi.org/10.1007/3-540-35074-8_7
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-29124-4
Online ISBN: 978-3-540-35074-3
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)