Fine-Tuning an OpenMP-Based TVD–Hopmoc Method Using Intel® Parallel Studio XE Tools on Intel® Xeon® Architectures

Cabral, Frederico L.; Osthoff, Carla; Souto, Roberto P.; Costa, Gabriel P.; Gonzaga de Oliveira, Sanderson L.; Brandão, Diego; Kischinhevsky, Mauricio

doi:10.1007/978-3-030-16205-4_15

Frederico L. Cabral¹²,
Carla Osthoff¹²,
Roberto P. Souto¹²,
Gabriel P. Costa¹²,
Sanderson L. Gonzaga de Oliveira¹³,
Diego Brandão¹⁴ &
…
Mauricio Kischinhevsky¹⁵

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 979))

Included in the following conference series:

Latin American High Performance Computing Conference

408 Accesses
2 Citations

Abstract

This paper is concerned with parallelizing the TVD–Hopmoc method for numerical time integration of evolutionary differential equations. Using Intel^® Parallel Studio XE tools, we studied three OpenMP implementations of the TVD–Hopmoc method (naive, CoP and EWS-Sync), with executions performed on Intel^® Xeon^® Many Integrated Core Architecture and Scalable processor. Our implementation, named EWS-Sync, defines an array that represents threads and the scheme consists of synchronizing only adjacent threads. Moreover, this approach reduces the OpenMP scheduling time by employing an explicit work-sharing strategy. Instead of permitting the OpenMP API to perform thread scheduling implicitly, this implementation of the 1-D TVD-Hopmoc method partitions among threads the array that represents the computational mesh of the numerical method. Thereby, this scheme diminishes the OpenMP spin time by avoiding barriers using an explicit synchronization mechanism where a thread only waits for its two adjacent threads. Numerical simulations show that this approach achieves promising performance gains in shared memory for multi-core and many-core environments.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Holstad, A.: The Koren upwind scheme for variable gridsize. Appl. Numer. Math. 37, 459–487 (2001)
Article MathSciNet Google Scholar
Oliveira, S.R.F., Gonzaga de Oliveira, S.L., Kischinhevsky, M.: Convergence analysis of the Hopmoc method. Int. J. Comput. Math. 86, 1375–1393 (2009)
Article MathSciNet Google Scholar
Cabral, F.L., Osthoff, C., Costa, G., Gonzaga de Oliveira, S.L., Brandão, D.N., Kischinhevsky, M.: Tuning up TVD HOPMOC method on Intel MIC Xeon Phi architectures with Intel Parallel Studio Tools. In: Proceedings of the 8th Workshop on Applications for Multi-Core Architectures (2017)
Google Scholar
Harten, A.: High resolution schemes for hyperbolic conservation laws. J. Comput. Phys. 49, 357–393 (1983)
Article MathSciNet Google Scholar
Brandão, D.N., Gonzaga de Oliveira, S.L., Kischinhevsky, M., Osthoff, C., Cabral, F.: A total variation diminishing Hopmoc scheme for numerical time integration of evolutionary differential equations. In: Gervasi, O., et al. (eds.) ICCSA 2018, Part I. LNCS, vol. 10960, pp. 53–66. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-95162-1_4
Chapter Google Scholar
Cabral, F.L., Osthoff, C., Costa, G.P., Gonzaga de Oliveira, S.L., Brandão, D., Kischinhevsky, M.: An OpenMP implementation of the TVD–hopmoc method based on a synchronization mechanism using locks between adjacent threads on Xeon Phi (TM) accelerators. In: Shi, Y., et al. (eds.) ICCS 2018. LNCS, vol. 10862, pp. 701–707. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-93713-7_67
Chapter Google Scholar
Burton, F.W., Sleep, M.R.: Executing functional programs on a virtual tree of processors. In: Proceedings of the 1981 Conference on Functional Programming Languages and Computer Architecture, Portsmouth, N.H., pp. 187–194. ACM, New York, October 1981
Google Scholar
Blumofe, R.D., Leiserson, C.E.: Scheduling multithreaded computations by work stealing. J. ACM (JACM) 46(5), 720–748 (1999)
Article MathSciNet Google Scholar
Penna, P.H., Castro, M., Plentz, P., Freitas, H.C., Broquedis, F., Mehaut, J.F.: BinLPT: a novel worload-aware loop scheduler for irregular parallel loops. Braz. Simp. High Perfom. Comput. 11, 527–536 (2017)
Google Scholar
Ma, H., Zhao, R., Gao, X., Zhang, Y.: Barrier optimization for OpenMP program. In: Proceedings of 10th ACIS International Conference on Software Engineering, Artificial Intelligences, Networking, Parallel and Distributed Computing, pp. 495–500 (2009)
Google Scholar
Caballero, D., Duran, A., Martorell, X.: An OpenMP* barrier using SIMD instructions for Intel^® Xeon Phi^TM coprocessor. In: Rendell, A.P., Chapman, B.M., Müller, M.S. (eds.) IWOMP 2013. LNCS, vol. 8122, pp. 99–113. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-40698-0_8
Chapter Google Scholar
Cabral, F.L., Osthoff, C., Kischinhevsky, M., Brandão, D.: Hybrid MPI/OpenMP/OpenACC implementations for the solution of convection diffusion equations with Hopmoc method. In: Proceedings of 14th International Conference on Computational Science and Its Applications (ICCSA), pp. 196–199 (2014)
Google Scholar
Intel. Clockticks per Instructions Retired (CPI). https://software.intel.com/en-us/vtune-amplifier-help-clockticks-per-instructions-retired-cpi. Accessed 30 Nov 2017

Download references

Acknowledgments

CNPq, CAPES, and FAPERJ supported this work. We would like to thank the Núcleo de Computação Científica at Universidade Estadual Paulista (NCC/UNESP) for letting us execute our simulations on its heterogeneous multi-core cluster. These resources were partially funded by Intel^® through the projects entitled Intel Parallel Computing Center, Modern Code Partner, and Intel/Unesp Center of Excellence in Machine Learning.

Author information

Authors and Affiliations

Laboratório Nacional de Computação Científica - LNCC, Petrópolis-RJ, Brazil
Frederico L. Cabral, Carla Osthoff, Roberto P. Souto & Gabriel P. Costa
Universidade Federal de Lavras - UFLA, Lavras-MG, Brazil
Sanderson L. Gonzaga de Oliveira
Centro Federal de Educação Tecnológica Celso Suckow da Fonseca - CEFET-RJ, Rio de Janeiro, Brazil
Diego Brandão
Universidade Federal Fluminense - UFF, Niterói-RJ, Brazil
Mauricio Kischinhevsky

Authors

Frederico L. Cabral
View author publications
You can also search for this author in PubMed Google Scholar
Carla Osthoff
View author publications
You can also search for this author in PubMed Google Scholar
Roberto P. Souto
View author publications
You can also search for this author in PubMed Google Scholar
Gabriel P. Costa
View author publications
You can also search for this author in PubMed Google Scholar
Sanderson L. Gonzaga de Oliveira
View author publications
You can also search for this author in PubMed Google Scholar
Diego Brandão
View author publications
You can also search for this author in PubMed Google Scholar
Mauricio Kischinhevsky
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Carla Osthoff .

Editor information

Editors and Affiliations

Instituto Tecnológico de Costa Rica, Centro Nacional de Alta Tecnología , Pavas, Costa Rica
Esteban Meneses
Universidad de los Andes, Bogotá, Colombia
Harold Castro
Universidad Industrial de Santander, Bucaramanga, Colombia
Carlos Jaime Barrios Hernández
Universidad de Antioquia, Medellín, Colombia
Raul Ramos-Pollan

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Cabral, F.L. et al. (2019). Fine-Tuning an OpenMP-Based TVD–Hopmoc Method Using Intel^® Parallel Studio XE Tools on Intel^® Xeon^® Architectures. In: Meneses, E., Castro, H., Barrios Hernández, C., Ramos-Pollan, R. (eds) High Performance Computing. CARLA 2018. Communications in Computer and Information Science, vol 979. Springer, Cham. https://doi.org/10.1007/978-3-030-16205-4_15

Download citation

DOI: https://doi.org/10.1007/978-3-030-16205-4_15
Published: 31 March 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-16204-7
Online ISBN: 978-3-030-16205-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics