Task Scheduling of Data-Parallel Applications on HSA Platform

Bao, Zhenshan; Chen, Chong; Zhang, Wenbo

doi:10.1007/978-981-13-2203-7_35

Zhenshan Bao¹⁴,
Chong Chen¹⁴ &
Wenbo Zhang¹⁴

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 901))

Included in the following conference series:

International Conference of Pioneering Computer Scientists, Engineers and Educators

1546 Accesses
2 Citations

Abstract

As CPU processing speed has slowed down year-on-year, heterogeneous “CPU-GPU” architectures combining multi-core CPU and GPU accelerators have become increasingly attractive. Under this backdrop, the Heterogeneous System Architecture (HSA) standard was released in 2012. New Accelerated Processing Unit (APU) architectures – AMD Kaveri and Carrizo – were released in 2014 and 2015 respectively, and are compliant with HSA. These architectures incorporate two technologies central to HSA, hUMA (heterogeneous Unified Memory Access) and hQ (heterogeneous Queuing). This paper realizes radix sort and matrix-vector multiplication – two data-parallel applications on Kaveri platform. By analyzing the performance, a dynamic task scheduling stratgy is proposed. The experimental results show that the running efficiency of algorithm can be greatly improved by using APU with reasonable task scheduling. In the same way, the other data-parallel algorithm would also be optimized on these heterogeneous multi-core architecture.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Rogers, P.: Heterogeneous system architecture overview. In: 25th IEEE Hot Chips Symposium, HCS 2013, pp. 7–48. IEEE, New York (2016)
Google Scholar
Heterogeneous System Architecture: A Technical Review. http://developer.amd.com/wordpress/media/2012/10/hsa10.pdf
Bouvier, D., Sander, B.: Applying AMD’s Kaveri APU for heterogeneous computing. In: 2014 IEEE Hot Chips 26 Symposium, vol. 30, no. 4, pp. 1–42 (2014)
Google Scholar
Krishnan, G., Bouvier, D., Zhang, L., et al.: Energy efficient graphics and multimedia in 28 nm Carrizo APU. In: 2015 IEEE Hot Chips 27 Symposium, HCS 2015, pp. 1–34. IEEE, New York (2015)
Google Scholar
Krishnan, G., Bouvier, D., Naffziger, S.: Energy-efficient graphics and multimedia in 28-nm carrizo accelerated processing unit. IEEE Micro 36(2), 22–33 (2016)
Article Google Scholar
Bao, Z.S., Chen, C., Zhang, W.B., et al.: Study on heterogeneous queuing. In: International Conference on Information Engineering and Communications Technology (IECT2016), Shanghai, China (2016)
Google Scholar
Ukidave, Y., Ziabari, A.K., Mistry, P., Schirner, G., Kaeli, D.: Quantifying the energy efficiency of FFT on heterogeneous platforms. In: Proceedings of the 2013 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS 2013), pp. 235–244 (2013)
Google Scholar
Franz, W., Thulasiraman, P., Thulasiram, R.K.: Optimization of an OpenCL-based multi-swarm PSO algorithm on an APU. In: Wyrzykowski, R., Dongarra, J., Karczewski, K., Waśniewski, J. (eds.) PPAM 2013. LNCS, vol. 8385, pp. 140–150. Springer, Heidelberg (2014). https://doi.org/10.1007/978-3-642-55195-6_13
Chapter Google Scholar
Che, S., Orr, M., Rodgers, G., et al.: Betweenness centrality in an HSA-enabled system. In: Proceedings of the ACM Workshop on High Performance Graph Processing, Co-located with HPDC 2016, pp. 35–38. ACM, New York (2016)
Google Scholar
Sun, Y.F., Gong, X., Ziabari, A.K., et al.: Hetero-mark, a benchmark suite for CPU-GPU collaborative computing. In: Proceedings of the 2016 IEEE International Symposium on Workload Characterization, pp. 13–22. IEEE, New York (2016)
Google Scholar
Calandra, H., Dolbeau, R., Fortin, P., Lamotte, J.-L., Said, I.: Evaluation of successive CPUs/APUs/GPUs based on an OpenCL finite difference stencil. In: 2013 21st Euromicro International Conference on Parallel, Distributed, and Network-Based Processing (PDP 2013), Belfast, United kingdom (2013)
Google Scholar
AMD: CLOC. https://github.com/HSAFoundation

Download references

Acknowledgement

This work was supported by the significant special project for Core electronic devices, high-end general chips and basic software products (2012ZX01039-004), and also supported by Beijing Key Laboratory on Integration and Analysis of Large Scale Stream Data.

Author information

Authors and Affiliations

Faculty of Information Technology, Beijing University of Technology, Beijing, 100124, China
Zhenshan Bao, Chong Chen & Wenbo Zhang

Authors

Zhenshan Bao
View author publications
You can also search for this author in PubMed Google Scholar
Chong Chen
View author publications
You can also search for this author in PubMed Google Scholar
Wenbo Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Chong Chen or Wenbo Zhang .

Editor information

Editors and Affiliations

Zhengzhou University, Zhengzhou, Henan, China
Qinglei Zhou
Zhengzhou University of Light Industry, Zhengzhou, Henan, China
Yong Gan
Northeast Forestry University, Harbin, China
Weipeng Jing
Harbin University of Science and Technology, Harbin, China
Xianhua Song
Zhengzhou Institute of Technology, Zhengzhou, China
Yan Wang
National Academy of Guo Ding Institute of Data Science, Beijing, China
Zeguang Lu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Bao, Z., Chen, C., Zhang, W. (2018). Task Scheduling of Data-Parallel Applications on HSA Platform. In: Zhou, Q., Gan, Y., Jing, W., Song, X., Wang, Y., Lu, Z. (eds) Data Science. ICPCSEE 2018. Communications in Computer and Information Science, vol 901. Springer, Singapore. https://doi.org/10.1007/978-981-13-2203-7_35

Download citation

DOI: https://doi.org/10.1007/978-981-13-2203-7_35
Published: 09 September 2018
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-2202-0
Online ISBN: 978-981-13-2203-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics