Exploiting OpenMP to Provide Scalable SMP BLAS and LAPACK Routines
- Cliff AddisonAffiliated withFujitsu European Centre for Information Technology Ltd.
The present Fujitsu PRIMEPOWER 2000 system can have up to 128 processors in an SMP node. It is therefore desirable to provide users of this system with high performance parallel BLAS and LAPACK routines that scale to as many processors as possible. It is also desirable that users can obtain some level of parallel performance merely by relinking their codes with SMP Math Libraries. This talk outlines the major design decisions taken in providing OpenMP versions of BLAS and LAPACK routines to users, it discusses some of the algorithmic issues that have been addressed and it discusses some of short comings of OpenMP for this task.
A good deal has been learned about exploiting OpenMP in this on-going activity and the talk will attempt to identify what worked and what did not work. For instance, while OpenMP does not support recursion, some of the basic ideas behind linear algebra with recursive algorithms can be exploited to overlap sequential operations with parallel ones. As another example, the overheads of dynamic scheduling tended to outweigh the better load balancing that such a schedule provides so that static cyclic loop scheduling was more effective.
- Exploiting OpenMP to Provide Scalable SMP BLAS and LAPACK Routines
- Book Title
- Computational Science — ICCS 2001
- Book Subtitle
- International Conference San Francisco, CA, USA, May 28–30, 2001 Proceedings, Part I
- p 3
- Print ISBN
- Online ISBN
- Series Title
- Lecture Notes in Computer Science
- Series Volume
- Series ISSN
- Springer Berlin Heidelberg
- Copyright Holder
- Springer-Verlag Berlin Heidelberg
- Additional Links
- Industry Sectors
- eBook Packages
- Editor Affiliations
- 1. School of Computer Science, Cybernetics and Electronic Engineering, University of Reading
- 2. Innovative Computing Lab, Computer Science Department, University of Tennessee
- 3. Computer Science Department, California State University
- 4. School of Computer Science, The Queen’s University of Belfast
- Cliff Addison (5)
- Author Affiliations
- 5. Fujitsu European Centre for Information Technology Ltd., 2 Longwalk Road, Stockley Park, Uxbridge, Middlesex, England, UB11 1AB
To view the rest of this content please follow the download PDF link above.