Abstract
We describe a performance study of a multi-zone application benchmark implemented in several OpenMP approaches that exploit multi-level parallelism and deal with unbalanced workload. The multi-zone application was derived from the well-known NAS Parallel Benchmarks (NPB) suite that involves flow solvers on collections of loosely coupled discretization meshes. Parallel versions of this application have been developed using the Subteam concept and Workqueuing model as extensions to the current OpenMP. We examine the performance impact of these extensions to OpenMP on a large shared memory machine and compare with hybrid and nested OpenMP programming models.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Ayguade, E., Gonzalez, M., Martorell, X., Jost, G.: Employing Nested OpenMP for the Parallelization of Multi-Zone Computational Fluid Dynamics Applications. Monien, B. (ed.) J. of Parallel and Distributed Computing, special issue, 66(5), 686 (2006)
Bailey, D., Barton, J., Lasinksi, T., Simon, H.: The NAS Parallel Benchmarks. NAS Technical Report RNR-91-002, NASA Ames Research Center (1991)
Biswas, R., Djomehri, M.J., Hood, R., Jin, H., Kiris, C., Saini, S.: An Application-Based Performance Characterization of the Columbia Supercluster. In: Proc. of the ACM/IEEE SC 2005 Conference (2005)
Bull, M.: OpenMP 3.0 Overview. In: OpenMP BoF at the SC 2006 conference (2006), http://www.compunity.org/futures/
Chapman, B., Huang, L., Jin, H., Jost, G., de Supinski, B.: Toward Enhancing OpenMP’s Work-Sharing Directives. In: Nagel, W.E., Walter, W.V., Lehner, W. (eds.) Euro-Par 2006. LNCS, vol. 4128, pp. 645–654. Springer, Heidelberg (2006)
Jin, H., Van der Wijngaart, R.F.: Performance Characteristics of the Multi-Zone NAS Parallel Benchmarks. Monien, B. (ed.) J. of Parallel and Distributed Computing, special issue, 66(5), 674 (2006)
Open64 Research Compiler, http://www.open64.net/
The OpenMP Standard, http://www.openmp.org/
Shah, S., Haab, G., Petersen, P., Throop, J.: Flexible Control Structure for Parallelism in OpenMP. In: European Workshop on OpenMP (EWOMP 1999) (1999)
Su, E., Tian, X., Girkar, M., Haab, G., Shah, S., Petersen, P.: Compiler Support of the Workqueuing Execution Model for Intel SMP Architectures. In: European Workshop on OpenMP (EWOMP 2002) (2002)
Van der Wijngaart, R.F., Jin, H.: The NAS Parallel Benchmarks, Multi-Zone Versions. NAS Technical Report NAS-03-010, NASA Ames Research Center (2003), http://www.nas.nasa.gov/Software/NPB/
Van Zee, F., Bientinesi, P., Low, T.M., Van de Geijn, R.: Scalable Parallelization of FLAME Code via the Workquenuing Model. ACM Trans. on Math.Software (submitted, 2006)
Zhang, G.: Extending the OpenMP Standard for Thread Mapping and Grouping. In: International Workshop on OpenMP (IWOMP 2006), Reims, France (2006)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Jin, H., Chapman, B., Huang, L. (2008). Performance Evaluation of a Multi-zone Application in Different OpenMP Approaches. In: Chapman, B., Zheng, W., Gao, G.R., Sato, M., Ayguadé, E., Wang, D. (eds) A Practical Programming Model for the Multi-Core Era. IWOMP 2007. Lecture Notes in Computer Science, vol 4935. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-69303-1_3
Download citation
DOI: https://doi.org/10.1007/978-3-540-69303-1_3
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-69302-4
Online ISBN: 978-3-540-69303-1
eBook Packages: Computer ScienceComputer Science (R0)