Abstract
Efficiently executing the massively parallel applications has become an important goal of developing a modern high-performance multicore computer. In these parallel programs, the collective communication among these cores consume a large portion of inter-core communication. In order to prevent the collective communication from the performance bottleneck of the on-chip network, this paper proposed a new on-chip network, call Hierarchy Self Similar Cubic (HSSC), to reduce the latency of the collective communication on the multicore system. The corresponding transmission mechanisms and packet scheduling mechanism are proposed to analyze and grouping the packets, and determine a suitable transmission mechanism for each packet group on-the-fly. The experiments compare the performance of several on-chip networks. The advantages of proposed transmission mechanisms and packet scheduling mechanism are also discussed.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Dvorak, V., Jaros, J.: Optimizing collective communications on 2D-mesh and fat tree NoC. In: 2010 Ninth International Conference on Networks, pp. 22–27. IEEE, Menuires (2010)
Chu, S.L., Lee, G.S., Peng, Y.W.: Self similar cubic: a novel interconnection network for many-core architectures. In: 2012 Fifth International Symposium on Parallel Architectures, Algorithms and Programming, pp. 303–310. IEEE, Taipei (2012)
Moadeli, M., Vanderbauwhede, W.: A communication model of broadcast in wormhole-routed networks on-chip. In: 2009 International Conference on Advanced Information Networking and Applications, pp. 315–322. IEEE, Bradford (2009)
Ajima, Y., et al.: Tofu interconnect 2: system-on-chip integration of high-performance interconnect. In: Kunkel, J.M., Ludwig, T., Meuer, H.W. (eds.) ISC 2014. LNCS, vol. 8488, pp. 498–507. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-07518-1_35
Ma, S., Jerger, N. E., Wang, Z.: Supporting efficient collective communication in NoCs. In: IEEE International Symposium on High-Performance Comp Architecture, pp. 1–12. IEEE, New Orleans (2012)
Liu, M.H.: The mechanisms for improving performance of SSC on-chip network. Master Thesis, Department of Information & Computer Engineering, CYCU (2017)
Black, D.C., Donovan, J., Bunton, B., Keist, A.: SystemC: From the Ground Up. Springer, Boston (2009). https://doi.org/10.1007/978-0-387-69958-5
Trahay, F., Rue, F., Faverge, M., Ishikawa, Y., Namyst, R., Dongarra, J.: EZTrace: a generic framework for performance analysis. In: 2011 11th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing, pp. 618–619. IEEE, Newport Beach (2011)
Bailey, D.H.: NAS parallel benchmarks. In: Padua, D. (ed.) Encyclopedia of Parallel Computing, pp. 1254–1259. Springer, Boston (2011). https://doi.org/10.1007/978-0-387-09766-4_133
Snir, M., Gropp, W., Otto, S., Huss-Lederman, S., Dongarra, J., Walker, D.: MPI–The Complete Reference: The MPI Core. MIT Press, London (1998)
Acknowledgments
This work is supported in part by the Ministry of Science and Technology of Republic of China, Taiwan under Grant MOST 105-2221-E-033-047.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Chu, SL., Ho, WC., Jiang, YJ. (2020). Improving the Performance of Collective Communication for the On-Chip Network. In: Shen, H., Sang, Y. (eds) Parallel Architectures, Algorithms and Programming. PAAP 2019. Communications in Computer and Information Science, vol 1163. Springer, Singapore. https://doi.org/10.1007/978-981-15-2767-8_5
Download citation
DOI: https://doi.org/10.1007/978-981-15-2767-8_5
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-2766-1
Online ISBN: 978-981-15-2767-8
eBook Packages: Computer ScienceComputer Science (R0)