Scalability of Gaussian Processes Using Asynchronous Tasks: A Comparison Between HPX and PETSc

Strack, Alexander; Pflüger, Dirk

doi:10.1007/978-3-031-32316-4_5

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 13861))

Included in the following conference series:

Workshop on Asynchronous Many-Task Systems and Applications

112 Accesses

Abstract

Gaussian processes are a widely used alternative to neural networks for non-linear system identification. The method requires computing the inversion of a large covariance matrix. In this work, we introduce our new task-based asynchronous implementation, focusing on its most popular solver, the Cholesky decomposition. Our implementation is based on HPX, utilizing its asynchronous many-task runtime system. We can therefore investigate its scaling on multi-core hardware and for GPU offloading. Furthermore, we compare our HPX implementation against a high-level reference implementation based on PETSc. We demonstrate that the HPX implementation’s performance is directly tied to the chosen tile size. Compared to the PETSc reference, our task-based implementation is faster in the entire node-level strong scaling experiment on EPYC ROME, showing better parallel efficiency.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 44.99; Price excludes VAT (USA)

Softcover Book: USD 59.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Agullo, E., et al.: Achieving High Performance on Supercomputers with a Sequential Task-based Programming Model. Research report, Inria Bordeaux (2016)
Google Scholar
Balay, S., et al.: PETSc, the portable, extensible toolkit for scientific computation. Argonne National Laboratory, vol. 2 (1998)
Google Scholar
Basak, S., Petit, S., Bect, J., Vazquez, E.: Numerical issues in maximum likelihood parameter estimation for Gaussian process interpolation. In: Nicosia, G., et al. (eds.) Machine Learning, Optimization, and Data Science. LNCS, vol. 13164, pp. 116–131. Springer, Cham (2022). https://doi.org/10.1007/978-3-030-95470-3_9
Chapter Google Scholar
Buttari, A., et al.: A class of parallel tiled linear algebra algorithms for multicore architectures. Parallel Comput. 35, 38–53 (2009)
Article MathSciNet Google Scholar
Chen, S., Billings, S.A., Grant, P.M.: Non-linear system identification using neural networks. Int. J. Control 51(6), 1191–1214 (1990)
Article MATH Google Scholar
Daiß, G., et al.: Beyond fork-join: integration of performance portable Kokkos kernels with HPX. In: 2021 IEEE IPDPSW, pp. 377–386 (2021)
Google Scholar
Dongarra, J., et al.: Plasma: parallel linear algebra software for multicore using OpenMP. ACM Trans. Math. Softw. 45(2), 1–35 (2019)
Article MathSciNet MATH Google Scholar
Dorris, J., Kurzak, J., Luszczek, P., YarKhan, A., Dongarra, J.: Task-based cholesky decomposition on knights corner using OpenMP. In: Taufer, M., Mohr, B., Kunkel, J.M. (eds.) High Performance Computing. LNCS, vol. 9945, pp. 544–562. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46079-6_37
Chapter Google Scholar
Gates, M., et al.: Slate: design of a modern distributed and accelerated linear algebra library. In: SC 2019. Association for Computing Machinery (2019)
Google Scholar
Huck, K., et al.: An autonomic performance environment for exascale. Supercomput. Front. Innov. 2, 49–66 (2015)
Google Scholar
Intel: Intel math kernel library (2023). https://www.intel.com/content/www/us/en/developer/tools/oneapi/onemkl.html
Kaiser, H., et al.: HPX - the C++ standard library for parallelism and concurrency. J. Open Source Softw. 5(53), 2352 (2020)
Article Google Scholar
Kocijan, J.: Gaussian process models for systems identification (2008)
Google Scholar
Kocijan, J.: Modelling and Control of Dynamic Systems Using Gaussian Process Models. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-21021-6
Book MATH Google Scholar
Marcello, D.C., et al.: Octo-Tiger: a new, 3D hydrodynamic code for stellar mergers that uses HPX parallelization. Mon. Notices Royal Astron. Soc. 504(4), 5345–5382 (2021)
Article Google Scholar
Poulson, J., et al.: Elemental: a new framework for distributed memory dense matrix computations. ACM Trans. Math. Softw. 39(2), 1–24 (2012)
Article MathSciNet Google Scholar
Rasmussen, C., Williams, C.: Gaussian Processes for Machine Learning. Adaptive Computation and Machine Learning. MIT Press (2006)
Google Scholar
Revay, M., Wang, R., Manchester, I.: A convex parameterization of robust recurrent neural networks. IEEE Contr. Syst. Lett. 5, 1363–1368 (2021)
Article MathSciNet Google Scholar
Särkkä, S.: The Use of Gaussian Processes in System Identification. In: Baillieul, J., Samad, T. (eds.) Encyclopedia of Systems and Control, pp. 1–10. Springer, London (2019). https://doi.org/10.1007/978-1-4471-5102-9_100087-1
Chapter Google Scholar
Schoukens, J., Ljung, L.: Nonlinear system identification: a user-oriented road map. IEEE Control Syst. 39, 28–99 (2019)
Article MathSciNet MATH Google Scholar
Thoman, P., et al.: A taxonomy of task-based technologies for high-performance computing. In: Wyrzykowski, R., Dongarra, J., Deelman, E., Karczewski, K. (eds.) PPAM 2017. LNCS, vol. 10778, pp. 264–274. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-78054-2_25
Chapter Google Scholar
Titsias, M.: Variational learning of inducing variables in sparse Gaussian processes. J. Mach. Learn. Res. Proc. Track 5, 567–574 (2009)
Google Scholar
Valero-Lara, P., et al.: sLASs: a fully automatic auto-tuned linear algebra library based on OpenMP extensions implemented in OmpSs (LASs library). J. Parallel Distrib. Comput. 138, 153–171 (2020)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Parallel and Distributed Systems, University of Stuttgart, 70569, Stuttgart, Germany
Alexander Strack & Dirk Pflüger

Authors

Alexander Strack
View author publications
You can also search for this author in PubMed Google Scholar
Dirk Pflüger
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Alexander Strack .

Editor information

Editors and Affiliations

Louisiana State University, CCT, Baton Rouge, LA, USA
Patrick Diehl
University of Innsbruck, Innsbruck, Austria
Peter Thoman
Louisiana State University, CCT, Baton Rouge, LA, USA
Hartmut Kaiser
University of Illinois at Urbana-Champaign, Urbana, IL, USA
Laxmikant Kale

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Strack, A., Pflüger, D. (2023). Scalability of Gaussian Processes Using Asynchronous Tasks: A Comparison Between HPX and PETSc. In: Diehl, P., Thoman, P., Kaiser, H., Kale, L. (eds) Asynchronous Many-Task Systems and Applications. WAMTA 2023. Lecture Notes in Computer Science, vol 13861. Springer, Cham. https://doi.org/10.1007/978-3-031-32316-4_5

Download citation

DOI: https://doi.org/10.1007/978-3-031-32316-4_5
Published: 11 May 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-32315-7
Online ISBN: 978-3-031-32316-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Scalability of Gaussian Processes Using Asynchronous Tasks: A Comparison Between HPX and PETSc