Skip to main content

HICFD: Highly Efficient Implementation of CFD Codes for HPC Many-Core Architectures

  • Conference paper
  • First Online:
Competence in High Performance Computing 2010

Abstract

The objective of the German BMBF research project Highly Efficient Implementation of CFD Codes for HPC Many-Core Architectures (HICFD) is to develop new methods and tools for the analysis and optimization of the performance of parallel computational fluid dynamics (CFD) codes on high performance computer systems with many-core processors. In the work packages of the project it is investigated how the performance of parallel CFD codes written in C can be increased by the optimal use of all parallelism levels. On the highest level Message Passing Interface (MPI) is utilized. Furthermore, on the level of the many-core architecture, highly scaling, hybrid OpenMP/MPI methods are implemented. On the level of the processor cores the parallel Single Instruction Multiple Data (SIMD) units provided by modern CPUs are exploited.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Alrutz, T., Aumann, P., Basermann, A. et al.: HICFD – Hocheffiziente Implementierung von CFD-Codes für HPC-Many-Core-Architekturen. In: Mitteilungen – Gesellschaft für Informatik e. V., http://www.fg-pars.gi-ev.de/fileadmin/gliederungen/fb-ti/fg-pars/Workshops/PARS/2009/PARS-Mitteilungen_2009.pdf, Parallel-Algorithmen und Rechnerstrukturen, ISSN 0177 - 0454, pp. 27–35 (2009)

  2. Alrutz, T., Simmendinger, C., Gerhold, T: Efficiency enhancement of an unstructured CFD-Code on distributed computing systems. In: Parallel Computational Fluid Dynamics, Recent Advances and Future Directions, DEStech Publications, Inc., Lancaster, PA, USA (2010)

    Google Scholar 

  3. Basermann, A., Cortial-Goutaudier, F., Jaekel, U., Hachiya, K.: Parallel solution techniques for sparse linear systems in circuit simulation. In: Proceedings of the 4th International Workshop on Scientific Computing in Electrical Engineering, Series: Mathematics in Industry, ISBN 3-540-21372-4, Springer Berlin Heidelberg, Germany (2004)

    Google Scholar 

  4. Hohenauer, M., Engel, F., Leupers, R., Ascheid, G., Meyr, H.: A SIMD optimization framework for retargetable compilers. ACM Trans. Archit. Code Optim. 6(1), Article No. 2 (27 pages), (2009) doi: 10.1145/1509864.1509866

    Google Scholar 

  5. Karypis, G., Kumar, V.: ParMETIS: Parallel graph partitioning and sparse matrix ordering library. Tech. rep. # 97-060, University of Minnesota (1997)

    Google Scholar 

  6. Müller-Pfefferkorn, R., Nagel, W.E., Trenkler, B.: Optimizing Cache Access: A Tool for Source-to-Source Transformations and Real-Life Compiler Tests. In: Danelutto, M., Vanneschi, M., Laforenza, D. (eds.) Lecture Notes in Computer Science 3149, pp. 72–81. Springer, Heidelberg (2004)

    Google Scholar 

  7. Pokam, G., Bihan, S., Simonnet, J., Bodin, F.: SWARP: a retargetable preprocessor for multimedia instructions. Concurr. Comput.: Pract. Exper. 16(2–3), pp. 303–318, (2004) doi: 10.1002/cpe.v16:2/3

    Google Scholar 

  8. Saad,Y., Sosonkina, M.: Distributed Schur complement techniques for general sparse linear systems. SISC 21, 1337–1356 (1999)

    MathSciNet  Google Scholar 

  9. Simmendinger, C., Kügeler, E.: Hybrid Parallelization of a Turbomachinery CFD Code: Performance Enhancements on Multicore Architectures. In: Proceedings of the V European Conference on Computational Fluid Dynamics ECCOMAS CFD 2010, J.C.F. Pereira, A. Sequeira and J.M.C. Pereira (Eds), Lisbon, Portugal, 14–17 June 2010, CD-ROM, ISBN: 978-989-96778-1-4

    Google Scholar 

  10. Performance Application Programming Interface: http://icl.cs.utk.edu/papi/.Cited15Dec2010

  11. DLR Institute of Aerodynamics and Flow Technology: http://www.dlr.de/as/.Cited15Dec2010

  12. DLR Institute of Propulsion Technology, Numerical Methods: http://www.dlr.de/at/desktopdefault.aspx/tabid-1519/2123_read-3615/.Cited15Dec2010

  13. Vampir - Performance Optimization: http://www.vampir.eu/.Cited15Dec2010

  14. VampirTrace: http://tu-dresden.de/die_tu_dresden/zentrale_einrichtungen/zih/forschung/software_werkzeuge_zur_unterstuetzung_von_programmierung_und_optimierung/vampirtrace/.Cited5Oct2011

Download references

Acknowledgements

This work has been supported by the German Federal Ministry of Education and Research (BMBF) under grant 01IH08012 A.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Achim Basermann .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Basermann, A. et al. (2011). HICFD: Highly Efficient Implementation of CFD Codes for HPC Many-Core Architectures. In: Bischof, C., Hegering, HG., Nagel, W., Wittum, G. (eds) Competence in High Performance Computing 2010. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-24025-6_1

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-24025-6_1

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-24024-9

  • Online ISBN: 978-3-642-24025-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics