StarPU: A Unified Platform for Task Scheduling on Heterogeneous Multicore Architectures

Augonnet, Cédric; Thibault, Samuel; Namyst, Raymond; Wacrenier, Pierre-André

doi:10.1007/978-3-642-03869-3_80

Cédric Augonnet¹⁷,
Samuel Thibault¹⁷,
Raymond Namyst¹⁷ &
…
Pierre-André Wacrenier¹⁷

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 5704))

Included in the following conference series:

European Conference on Parallel Processing

1978 Accesses
131 Citations

Abstract

In the field of HPC, the current hardware trend is to design multiprocessor architectures that feature heterogeneous technologies such as specialized coprocessors (e.g. Cell/BE SPUs) or data-parallel accelerators (e.g. GPGPUs).

Approaching the theoretical performance of these architectures is a complex issue. Indeed, substantial efforts have already been devoted to efficiently offload parts of the computations. However, designing an execution model that unifies all computing units and associated embedded memory remains a main challenge.

We have thus designed StarPU, an original runtime system providing a high-level, unified execution model tightly coupled with an expressive data management library. The main goal of StarPU is to provide numerical kernel designers with a convenient way to generate parallel tasks over heterogeneous hardware on the one hand, and easily develop and tune powerful scheduling algorithms on the other hand.

We have developed several strategies that can be selected seamlessly at run time, and we have demonstrated their efficiency by analyzing the impact of those scheduling policies on several classical linear algebra algorithms that take advantage of multiple cores and GPUs at the same time. In addition to substantial improvements regarding execution times, we obtained consistent superlinear parallelism by actually exploiting the heterogeneous nature of the machine.

Download to read the full chapter text

Chapter PDF

CellCilk: Extending Cilk for Heterogeneous Multicore Platforms

The LAMA Approach for Writing Portable Applications on Heterogenous Architectures

Accelerating Scientific Applications on Heterogeneous Systems with HybridOMP

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Augonnet, C., Namyst, R.: A unified runtime system for heterogeneous multicore architectures. In: Euro-Par 2008 Workshops - Parallel Processing, Las Palmas de Gran Canaria, Spain (August 2008)
Google Scholar
Banino, C., Beaumont, O., Carter, L., Ferrante, J., Legrand, A., Robert, Y.: Scheduling strategies for master-slave tasking on heterogeneous processor platforms. IEEE Trans. Parallel Distrib. Syst. 15(4), 319–330 (2004)
Article Google Scholar
Barrachina, S., Castillo, M., Igual, F.D., Mayo, R., Quintana-Ort, E.S.: Solving Dense Linear Systems on Graphics Processors. Technical report, Universidad Jaime I, Spain (February 2008)
Google Scholar
Bellens, P., Perez, J.M., Badia, R.M., Labarta, J.: Cellss: a programming model for the cell be architecture. In: SC 2006: Proceedings of the 2006 ACM/IEEE conference on Supercomputing, p. 86. ACM, New York (2006)
Google Scholar
Buttari, A., Langou, J., Kurzak, J., Dongarra, J.: A class of parallel tiled linear algebra algorithms for multicore architectures (2007)
Google Scholar
Crawford, C.H., Henning, P., Kistler, M., Wright, C.: Accelerating computing with the cell broadband engine processor. In: CF 2008 (2008)
Google Scholar
Dolbeau, R., Bihan, S., Bodin, F.: HMPP: A hybrid multi-core parallel programming environment (2007)
Google Scholar
Duran, A., Perez, J.M., Ayguade, E., Badia, R., Labarta, J.: Extending the openmp tasking model to allow dependant tasks. In: IWOMP Proceedings (2008)
Google Scholar
Jiménez, V.J., Vilanova, L., Gelado, I., Gil, M., Fursin, G., Navarro, N.: Predictive runtime code scheduling for heterogeneous architectures. In: HiPEAC, pp. 19–33 (2009)
Google Scholar
Kunzman, D.: Charm++ on the Cell Processor. Master’s thesis, Dept. of Computer Science, University of Illinois (2006)
Google Scholar
McCool, M.D.: Data-parallel programming on the cell be and the gpu using the rapidmind development platform. In: GSPx Multicore Applications Conference (2006)
Google Scholar
Nijhuis, M., Bos, H., Bal, H.E., Augonnet, C.: Mapping and synchronizing streaming applications on cell processors. In: HiPEAC, pp. 216–230 (2009)
Google Scholar
Ohara, M., Inoue, H., Sohda, Y., Komatsu, H., Nakatani, T.: Mpi microtask for programming the cell broadband enginetm processor. IBM Syst. J. 45(1) (2006)
Google Scholar
Owens, J.D., Luebke, D., Govindaraju, N., Harris, M., Krüger, J., Lefohn, A.E., Purcell, T.J.: A survey of general-purpose computation on graphics hardware. Computer Graphics Forum 26(1), 80–113 (2007)
Article Google Scholar
Ramet, P., Roman, J.: Pastix: A parallel sparse direct solver based on a static scheduling for mixed 1d/2d block distributions. In: Proceedings of Irregular’2000, Cancun, Mexique, pp. 519–525. Springer, Heidelberg (2000)
Google Scholar
Wesolowski, L.: An application programming interface for general purpose graphics processing units in an asynchronous runtime system. Master’s thesis, Dept. of Computer Science, University of Illinois (2008)
Google Scholar
Whaley, R.C., Dongarra, J.: Automatically Tuned Linear Algebra Software. In: Ninth SIAM Conference on Parallel Processing for Scientific Computing (1999)
Google Scholar

Download references

Author information

Authors and Affiliations

University of Bordeaux – LaBRI – INRIA Bordeaux Sud-Ouest, France
Cédric Augonnet, Samuel Thibault, Raymond Namyst & Pierre-André Wacrenier

Authors

Cédric Augonnet
View author publications
You can also search for this author in PubMed Google Scholar
Samuel Thibault
View author publications
You can also search for this author in PubMed Google Scholar
Raymond Namyst
View author publications
You can also search for this author in PubMed Google Scholar
Pierre-André Wacrenier
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Software Technology, Delft University of Technology, Mekelweg 4, 2628, Delft, CD, The Netherlands
Henk Sips , Dick Epema & Hai-Xiang Lin , &

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Augonnet, C., Thibault, S., Namyst, R., Wacrenier, PA. (2009). StarPU: A Unified Platform for Task Scheduling on Heterogeneous Multicore Architectures. In: Sips, H., Epema, D., Lin, HX. (eds) Euro-Par 2009 Parallel Processing. Euro-Par 2009. Lecture Notes in Computer Science, vol 5704. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-03869-3_80

Download citation

DOI: https://doi.org/10.1007/978-3-642-03869-3_80
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-03868-6
Online ISBN: 978-3-642-03869-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

StarPU: A Unified Platform for Task Scheduling on Heterogeneous Multicore Architectures

Abstract

Chapter PDF

Similar content being viewed by others

CellCilk: Extending Cilk for Heterogeneous Multicore Platforms

The LAMA Approach for Writing Portable Applications on Heterogenous Architectures

Accelerating Scientific Applications on Heterogeneous Systems with HybridOMP

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

StarPU: A Unified Platform for Task Scheduling on Heterogeneous Multicore Architectures

Abstract

Chapter PDF

Similar content being viewed by others

CellCilk: Extending Cilk for Heterogeneous Multicore Platforms

The LAMA Approach for Writing Portable Applications on Heterogenous Architectures

Accelerating Scientific Applications on Heterogeneous Systems with HybridOMP

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation