Theory, practice, and a tool for BSP performance prediction
The Bulk Synchronous Parallel (BSP) model provides a theoretical framework to accurately predict the execution time of parallel programs. In this paper we describe a BSP programming library that has been developed and contrast two approaches to analysing performance: (1) a pencil and paper method; (2) a profiling tool that analyses trace information generated during program execution. These approaches are evaluated on an industrial application code that solves fluid dynamics equations around a complex aircraft geometry on IBM SP2 and SGI Power Challenge machines. We show how the profiling tool can be used to explore the communication patterns of the CFD code and accurately predict the performance of the application on any parallel machine.
Unable to display preview. Download preview PDF.
- 1.D. Burgess, P. Crumpton, and M. Giles. A parallel framework for unstructured grid solvers. In Computational Fluid Dynamics ECCOMAS'94, pages 391–396. John Wiley & Sons, 1994.Google Scholar
- 2.P. Crumpton and M. Giles. Implicit time accurate solutions on unstructured dynamic grids. AIAA Paper 95–1671, 1995.Google Scholar
- 3.M. W. Goudreau, J. M. D. Hill, K. Lang, B. McColl, S. B. Rao, D. C. Stefanescu, T. Suel, and T. Tsantilas. A proposal for the BSP Worldwide standard library, April 1996. See www.bsp-worldwide.org for more details.Google Scholar
- 4.M. W. Goudreau, K. Lang, S. B. Rao, T. Suel, and T. Tsantilas. Towards efficiency and portability: Programming with the BSP model. In Proc. 8th Annual ACM Symposium on Parallel Algorithms and Architectures, 1996.Google Scholar
- 6.W. F. McColl. Scalable computing. In J. van Leeuwen, editor, Computer Science Today: Recent Trends and Developments, number 1000 in Lecture notes in Computer Science, pages 46–61. Springer-Verlag, 1995.Google Scholar
- 7.R. Miller. Two approaches to architecture-independent parallel computation. D.Phil thesis, Oxford University, Michaelmas Term 1994.Google Scholar