Solving alignment using elementary linear algebra
Data and computation alignment is an important part of compiling sequential programs to architectures with non-uniform memory access times. In this paper, we show that elementary matrix methods can be used to determine communication-free alignment of code and data. We also solve the problem of replicating read-only data to eliminate communication. Our matrix-based approach leads to algorithms which are simpler and faster than existing algorithms for the alignment problem.
KeywordsNull Space Array Element Alignment Problem Virtual Processor Equational Constraint
Unable to display preview. Download preview PDF.
- [AL93]Jennifer M. Anderson and Monica S. Lam. Global optimizations for parallelism and locality on scalable parallel machines. ACM SIGPLAN Conference on Programming Language Design and Implementation (PLDI), pages 112–125, June 1993.Google Scholar
- [Ban93]U. Banerjee. Loop transformations for restructuring compilers. Kluwer Publishing, 1993.Google Scholar
- [CGS93]Siddartha Chatterjee, John Gilbert, and Robert Schreiber. The alignment-distribution graph. In U. Banerjee, D. Gelernter, A. Nicolau, and D. Padua, editors, Languages and Compilers for Parallel Computing. Sixth International Workshop., number 768 in LNCS. Springer-Verlag, 1993.Google Scholar
- [GGST92]Siddartha Chatterjee, John Gilbert, Robert Schreiber, and Shang-Hua Teng. Optimal evaluation of array expressions on massively parallel machines. Technical Report CSL-92-11, XEROX PARC, December 1992.Google Scholar
- [Edm67]Jack Edmonds. Systems of distinct representatives and linear algebra. Journal of research of national bureau of standards (Sect. B), 71(4):241–245, 1967.Google Scholar
- [Fea92]Paul Feautrier. Toward automatic distribution. Technical Report 92.95, IBP/MASI, December 1992.Google Scholar
- [GVL89]Gene H. Golub and Charles F. Van Loan. Matrix Computations. The John Hopkins University Press, second edition, 1989.Google Scholar
- [HS91]C.-H. Huang and P. Sadayappan. Communication-free hyperplane partitioning of nested loops. In U. Banerjee, D. Gelernter, A. Nicolau, and D. Padua, editors, Languages and Compilers for Parallel Computing. Fourth International Workshop. Santa Clara, CA., number 589 in LNCS, pages 186–200. Springer-Verlag, August 1991.Google Scholar
- [KLD92]Kathleen Knobe, Joan D. Lucas, and William J. Dally. Dynamic alignment on distributed memory systems. In Proceedings of the Third Workshop on Compilers for Parallel Computers, July 1992.Google Scholar
- [KN90]Kathleen Knobe and Venkataraman Natarajan. Data optimization: minimizing residual interprocessor motion on SIMD machines. In Proceedings of the 3rd Symposium on the Frontiers of Massively Parallel Computation — Frontiers '90, pages 416–423, October 1990.Google Scholar
- [LC89]Jingke Li and Marina Chen. Index domain alignment: minimizing cost of cross-referencing between distributed arrays. Technical Report YALEU/DCS/TR-725, Department of Computer Science, Yale University, September 1989.Google Scholar
- [LP92]W. Li and K. Pingali. Access normalization: loop restructuring for NUMA compilers. In Proceedings of the 5th International Conference on Architectural Support for Programming Languages and Operating Systems, October 1992.Google Scholar