Layout transformation support for the disk resident arrays framework

Krishnamoorthy, Sriram; Baumgartner, Gerald; Lam, Chi-Chung; Nieplocha, Jarek; Sadayappan, P.

doi:10.1007/s11227-006-7955-4

Layout transformation support for the disk resident arrays framework

Published: May 2006

Volume 36, pages 153–170, (2006)
Cite this article

The Journal of Supercomputing Aims and scope Submit manuscript

Sriram Krishnamoorthy¹,
Gerald Baumgartner²,
Chi-Chung Lam¹,
Jarek Nieplocha³ &
…
P. Sadayappan¹

2 Citations
Explore all metrics

Abstract

The Global Arrays (GA) toolkit provides a shared-memory programming model in which data locality is explicitly managed by the programmer. It inter-operates with MPI and supports a variety of language bindings. The Disk Resident Arrays (DRA) model extends the GA programming model to secondary storage. GA and DRA together provide a convenient programming model that encourages locality-aware programming by the user, while presenting a high-level abstraction. High performance depends on the appropriate distribution of the data in the disk-resident arrays. In this paper, we discuss the addition of layout transformation support to DRA. The implementation of an efficient parallel layout transformation algorithm is done on top of existing GA/DRA functions; thus GA/DRA is itself used in implementing the enhanced DRA functionality. Experimental performance data is provided that demonstrates the effectiveness of the new layout transformation functionality.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

MongoDB Vs PostgreSQL: A comparative study on performance aspects

Article Open access 05 June 2020

Efficient High-Level Programming in Plain Java

Article 05 December 2022

A Modern Primer on Processing in Memory

References

Anderson GL (1980) A stepwise approach to computing the multidimensional fast fourier transform of large arrays. IEEE Transactions on Acoustics and Speech Signal Processing 28(3):280–284
Article MATH Google Scholar
Bailey DH (1990) FFTs in external or hierarchical memory. Journal of Supercomputing 4(1):23–35
Article Google Scholar
Baumgartner G, Bernholdt DE, Cociorva D, Harrison R, Hirata S, Lam C, Nooijen M, Pitzer R, Ramanujam J, Sadayappan P (2003) A high-level approach to synthesis of high-performance codes for quantum chemistry. In: Proceedings of Supercomputing 2002
Chen Y, Foster I, Nieplocha J, Winslett W (1997) Optimizing collective I/O performance on parallel computers: A multisystem study. In: 11th ACM Intl. Conf. on Supercomputing
Cociorva D, Baumgartner G, Lam C, Sadayappan P, Ramanujam J, Nooijen M, Bernholdt D, Harrison R (2002) Space-time trade-off optimization for a class of electronic structure calculations. In: Proc. of ACM SIGPLAN 2002 Conference on Programming Language Design and Implementation (PLDI)
Cociorva D, Gao X, Krishnan S, Baumgartner G, Lam C, Sadayappan P, Ramanujam J (2003) Global communication optimization for tensor contraction expressions under memory constraints. In: Proc. of 17th International Parallel & Distributed Processing Symposium (IPDPS)
Cociorva D, Wilkins J, Baumgartner G, Sadayappan P, Ramanujam J, Nooijen M, Bernholdt DE, Harrison R (2001) Towards automatic synthesis of high-performance codes for electronic structure calculations: Data locality optimization. In: Proc. of the Intl. Conf. on High Performance Computing
Eklundh JO (1972) A fast computer method for matrix transposing. IEEE Transactions on Computers 20(7):801–803
MathSciNet Google Scholar
The Panda Project: Data Management for High-Performance Scientific Computation. http://drl.cs.uiuc.edu/panda/
Foster I, Nieplocha J (2001) Disk Resident Arrays: An array-oriented I/O library for out-of-core computations. In: Rajkumar Buyya, Hai Jin, and Toni Cortes (eds.) Disk arrays and parallel I/O: Theory and practice. IEEE Computer Society Press
Kaushik SD, Huang C-H, Johnson RW, Sadayappan P, Johnson JR (1993) Efficient transposition algorithms for large matrices. In: Proceedings of the 1993 ACM/IEEE conference on Supercomputing ACM Press, pp. 656–665.
Kazhiyur-Mannar R, Wenger R, Crawfis R, Dey TK (2003) Adaptive resolution isosurface construction in three and four dimensions. Technical Report OSU-CISRC-7/03–TR38, Dept. of Computer and Information Science, The Ohio State University
Krishnamoorthy S, Baumgartner G, Cociorva D, Lam C, Sadayappan P (2003) Efficient parallel out-of-core matrix transposition. In: Proceedings of the International Conference on Cluster Computing. IEEE Computer Society Press
Krishnamoorthy S, Baumgartner G, Cociorva D, Lam C, Sadayappan P (2003) On efficient out-of-core matrix transposition. Technical Report OSU-CISRC-9/03-T52, Dept. of Computer and Information Science, The Ohio State University
Krishnan S, Krishnamoorthy S, Baumgartner G, Cociorva D, Lam C, Sadayappan P, Ramanujam J, Bernholdt DE, Choppella V (2003) Data locality optimization for synthesis of efficient out-of-core algoritms. In: Proc. of the Intl. Conf. on High Performance Computing
Krishnan S, Krishnamoorthy S, Baumgartner G, Lam C, Ramanujam J, Choppella V, Sadayappan P (2004) Efficient synthesis of out-of-core algorithms using a nonlinear optimization solver. In: Proc. of 18th International Parallel & Distributed Processing Symposium (IPDPS)
Mirin AA, Cohen RH, Curtis BC, Dannevik WP, Dimits AM, Duchaineau MA, Eliason DE, Schikore DR, Anderson SE, Porter DH, Woodward PR, Shieh LJ, White SW (1999) Very high resolution simulation of compressible turbulence on the IBM-SP system. In: Proceedings of the 1999 ACM/IEEE Conference on Supercomputing (CDROM) 70. ACM Press
Nieplocha J, Foster I (1996) Disk Resident Arrays: An array-oriented I/O library for out-of-core computations. In: Proceedings of the Sixth Symposium on the Frontiers of Massively Parallel Computation. IEEE Computer Society Press, pp. 196–204.
Nieplocha J, Harrison RJ, Littlefield RJ (1994) Global Arrays: A portable programming model for distributed memory computers. In: Supercomputing, pp. 340–349.
Nieplocha J, Harrison RJ, Littlefield RJ (1996) Global Arrays: A nonuniform memory access programming model for high-performance computers. The Journal of Supercomputing 10(2):169–189
Article Google Scholar
NWChem. http://www.emsl.pnl.gov/docs/nwchem/nwchem.html
Kent E Seamons and Marianne Winslett (1996) Multidimensional array I/O in Panda 1.0. The Journal of Supercomputing 10(2):191–211
Article Google Scholar
Jinwoo Suh, Prasanna VK (2002) An efficient algorithm for out-of-core matrix transposition. IEEE Transactions on Computers 51(4):420–438
Article Google Scholar
Synthesis of High-Performance Algorithms for Electronic Structure Calculations. http://www.cse.ohio-state.edu/~saday/TCE/index.html

Download references

Author information

Authors and Affiliations

Department of Computer Science and Engineering, The Ohio State University, Columbus, OH, 43210, USA
Sriram Krishnamoorthy, Chi-Chung Lam & P. Sadayappan
Department of Computer Science, Louisiana State University, Baton Rouge, LA, 70810, USA
Gerald Baumgartner
Computational Sciences and Mathematics, Pacific Northwest National Laboratory, Richland, WA, 99352, USA
Jarek Nieplocha

Authors

Sriram Krishnamoorthy
View author publications
You can also search for this author in PubMed Google Scholar
Gerald Baumgartner
View author publications
You can also search for this author in PubMed Google Scholar
Chi-Chung Lam
View author publications
You can also search for this author in PubMed Google Scholar
Jarek Nieplocha
View author publications
You can also search for this author in PubMed Google Scholar
P. Sadayappan
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sriram Krishnamoorthy.

Additional information

This work was supported in part through funding from the U.S. Department of Energy and the National Science Foundation (award 0121676).

Rights and permissions

Reprints and permissions

About this article

Cite this article

Krishnamoorthy, S., Baumgartner, G., Lam, CC. et al. Layout transformation support for the disk resident arrays framework. J Supercomput 36, 153–170 (2006). https://doi.org/10.1007/s11227-006-7955-4

Download citation

Issue Date: May 2006
DOI: https://doi.org/10.1007/s11227-006-7955-4

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Layout transformation support for the disk resident arrays framework

Abstract

Access this article

Similar content being viewed by others

MongoDB Vs PostgreSQL: A comparative study on performance aspects

Efficient High-Level Programming in Plain Java

A Modern Primer on Processing in Memory

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Layout transformation support for the disk resident arrays framework

Abstract

Access this article

Similar content being viewed by others

MongoDB Vs PostgreSQL: A comparative study on performance aspects

Efficient High-Level Programming in Plain Java

A Modern Primer on Processing in Memory

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation