Developing Efficient Implementations of Shortest Paths and Page Rank Algorithms for NEC SX-Aurora TSUBASA Architecture

Afanasyev, I. V.; Voevodin, Vad. V.; Voevodin, Vl. V.; Komatsu, Kazuhiko; Kobayashi, Hiroaki

doi:10.1134/S1995080219110039

Developing Efficient Implementations of Shortest Paths and Page Rank Algorithms for NEC SX-Aurora TSUBASA Architecture

Published: 27 November 2019

Volume 40, pages 1753–1762, (2019)
Cite this article

Lobachevskii Journal of Mathematics Aims and scope Submit manuscript

I. V. Afanasyev¹,
Vad. V. Voevodin¹,
Vl. V. Voevodin¹,
Kazuhiko Komatsu² &
…
Hiroaki Kobayashi²

63 Accesses
6 Citations
Explore all metrics

Abstract

The main goal of this paper is to demonstrate that the newest generation of NEC SX-Aurora TSUBASA architecture can perform large-scale graph processing extremely efficiently. This paper proposes approaches, which can be used for the development of high-performance vector-oriented implementations of page rank and shortest paths algorithms, including vectorised graph storage format, efficient vector-friendly graph traversals, optimised cache-aware memory accesses and efficient load-balancing. The developed implementations are optimised according to the most important features and properties of SX-Aurora architecture, which allows them achieve up to 15 times better performance compared to the optimised Intel Skylake parallel implementations and up to 5 times better performance compared to NVGRAPH library implementations for Pascal GPU architecture.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Comparison of HPC Architectures for Computing All-Pairs Shortest Paths. Intel Xeon Phi KNL vs NVIDIA Pascal

Blocked All-Pairs Shortest Paths Algorithm on Intel Xeon Phi KNL Processor: A Case Study

Accelerating Processing of Scale-Free Graphs on Massively-Parallel Architectures

References

K. Komatsu, S. Momose, Y. Isobe, O. Watanabe, A. Musa, M. Yokokawa, T. Aoyama, M. Sato, and H. Kobayashi, in Proceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis (IEEE, Piscataway, NJ, USA, 2018), SC'18, pp. 54:1–54:12. http://dl.acm.org/citation.cfm?id=3291656.3291728.
Google Scholar
Y. Yamada and S. Momose, in Proceedings of the Intenational Symposium on High Performance Chips Hot Chips2018 (2018).
Google Scholar
R. Egawa, K. Komatsu, S. Momose, Y. Isobe, A. Musa, H. Takizawa, and H. Kobayashi, J. Supercomput. 73, 3948 (2017). https://doi.org/10.1007/s11227-017-1993-y
Article Google Scholar
K. Komatsu, R. Egawa, Y. Isobe, R. Ogata, H. Takizawa, and H. Kobayashi, in Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis (SC15) (2015), pp. 1–2.
Google Scholar
D. Chakrabarti, Y. Zhan, and C. Faloutsos, in Proceedings of the 2004 SIAM International Conference on Data Mining (SIAM, 2004), pp. 442–446.
Book Google Scholar
J. Leskovec and A. Krevl, SNAP Datasets: Stanford Large Network Dataset Collection (2014). http://snap.stanford.edu/data.
Google Scholar
J. Kunegis, in Proceedings of the International Conference on World Wide Web Companion (2013), pp. 1343–1350, http://userpages.uni-koblenz.de/kunegis/paper/kunegis-koblenz-network-collection.pdf.
Book Google Scholar
I. V. Afanasyev, A. S. Antonov, D. A. Nikitenko, V. V. Voevodin, V. V. Voevodin, K. Komatsu, O. Watanabe, A. Musa, and H. Kobayashi, Supercomput. Front. Innov. 5, 65 (2018).
Google Scholar
F. Busato and N. Bombieri, IEEE Trans. Parallel Distrib. Syst. 27, 2222 (2015).
Article Google Scholar
A. Davidson, S. Baxter, M. Garland, and J. D. Owens, in Proceedings of the 2014 IEEE 28th International Parallel and Distributed Processing Symposium (IEEE, 2014), pp. 349–359.
Google Scholar
M. Besta, F. Marending, E. Solomonik, and T. Hoefler, in Proceedings of the IEEE IPDPS (2017), vol. 17.
Google Scholar
S. Brin and L. Page, Comput. Networks ISDN Syst. 30, 107 (1998).
Article Google Scholar
Y. Wang, A. Davidson, Y. Pan, Y. Wu, A. Riffel, and J. D. Owens, in ACM SIGPLAN Notices (ACM, 2016), vol. 51, p. 11.
Article Google Scholar
P. Choudhari, E. Baikampadi, P. Patil, and S. Gadekar, Int. J. Comput. Sci. Inform. Technol. 6 (2015).
R. Wang, W. Zhang, H. Deng, N. Wang, Q. Miao, and X. Zhao, in Proceedings of the International Conference in Swarm Intelligence (Springer, 2013), pp. 154–162.
Google Scholar

Download references

Funding

This project was partially supported by JSPS Bilateral Joint Research Projects program, entitled “Theory and Practice of Vector Data Processing at Extreme Scale: Back to the Future”. The reported study was supported by the Russian Foundation for Basic Research, project no. 18-57-50005.

Author information

Authors and Affiliations

Research Computing Center of Moscow State University, Moscow, 119234, Russia
I. V. Afanasyev, Vad. V. Voevodin & Vl. V. Voevodin
Tohoku University, Sendai, Miyagi, 980-8579, Japan
Kazuhiko Komatsu & Hiroaki Kobayashi

Authors

I. V. Afanasyev
View author publications
You can also search for this author in PubMed Google Scholar
Vad. V. Voevodin
View author publications
You can also search for this author in PubMed Google Scholar
Vl. V. Voevodin
View author publications
You can also search for this author in PubMed Google Scholar
Kazuhiko Komatsu
View author publications
You can also search for this author in PubMed Google Scholar
Hiroaki Kobayashi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to I. V. Afanasyev, Vad. V. Voevodin, Vl. V. Voevodin, Kazuhiko Komatsu or Hiroaki Kobayashi.

Additional information

Submitted by E. E. Tyrtyshnikov

Rights and permissions

Reprints and permissions

About this article

Cite this article

Afanasyev, I.V., Voevodin, V.V., Voevodin, V.V. et al. Developing Efficient Implementations of Shortest Paths and Page Rank Algorithms for NEC SX-Aurora TSUBASA Architecture. Lobachevskii J Math 40, 1753–1762 (2019). https://doi.org/10.1134/S1995080219110039

Download citation

Received: 13 June 2019
Revised: 26 June 2019
Accepted: 17 July 2019
Published: 27 November 2019
Issue Date: November 2019
DOI: https://doi.org/10.1134/S1995080219110039

Keywords and phrases

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Developing Efficient Implementations of Shortest Paths and Page Rank Algorithms for NEC SX-Aurora TSUBASA Architecture

Abstract

Access this article

Similar content being viewed by others

Comparison of HPC Architectures for Computing All-Pairs Shortest Paths. Intel Xeon Phi KNL vs NVIDIA Pascal

Blocked All-Pairs Shortest Paths Algorithm on Intel Xeon Phi KNL Processor: A Case Study

Accelerating Processing of Scale-Free Graphs on Massively-Parallel Architectures

References

Funding

Author information

Authors and Affiliations

Corresponding authors

Additional information

Rights and permissions

About this article

Cite this article

Keywords and phrases

Navigation

Developing Efficient Implementations of Shortest Paths and Page Rank Algorithms for NEC SX-Aurora TSUBASA Architecture

Abstract

Access this article

Similar content being viewed by others

Comparison of HPC Architectures for Computing All-Pairs Shortest Paths. Intel Xeon Phi KNL vs NVIDIA Pascal

Blocked All-Pairs Shortest Paths Algorithm on Intel Xeon Phi KNL Processor: A Case Study

Accelerating Processing of Scale-Free Graphs on Massively-Parallel Architectures

References

Funding

Author information

Authors and Affiliations

Corresponding authors

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords and phrases

Search

Navigation