Architecture-independent locality analysis and efficient PRAM simulations
We introduced an approach to parallel computing which unites the automatic and direct programming paradigms within the core BSPlib environment. Our PRAM simulator and companion C++ macro-based language[LS96] is scalable and portable; it has been tested on the IBM SP2, Cray T3D, SGI Power challenge, and a cluster of Sun Workstations. Directly-programmed solutions to regular problems inevitably can obtain greater performance than our simulator. Our approach wins for irregular problems with poor locality.
- [GV94]A. V. Gerbessiotis and L. G. Valiant. Direct bulk-synchronous parallel algorithms. Journal of Parallel and Distributed Computing, 22:251–267, 1994.Google Scholar
- [LS96]D. S. Lecomber and K. R. Sujithan. Transgressing the boundaries: Unified scalable parallel programming. Technical Report PRG-TR-20-96, Oxford University Computing Laboratory, November 1996.Google Scholar