Abstract
The Liverpool Single-transputer library [10] was ported to the i860 as part of the Esprit Genesis project, P2702. There are approximately 250 routines in this library, including the BLAS (which are a well-known set of subroutines providing functions commonly used in numerical computing, see [8],[3], [2],[1]) and a set of vector routines, known as the FLO routines
This strategy is expensive to implement on new processors, especially highly structured ones such as the i860. The software was, therefore, implemented with a similar philosophy to that of the hardware designers in designing the hardware. It was coded in a modular form to utilise a few carefully optimised core routines.
The results were encouraging: the number of assembler routines was significantly reduced and large speed-ups were obtained over Fortran.
Thanks to Cliff Addison and Mike Delves of CMSR for help and comments on this paper. Partial funding for this work was provided by Esprit Genesis project, P2702.
This work was carried out while the author was at N.A. Software, Liverpool
This article was processed using the LaTEX macro package with LLNCS style
Preview
Unable to display preview. Download preview PDF.
References
D.S. Dodson, R.G. Grimes, and Lewis J.G. Sparse extensions to the fortran basic linear algebra subprograms. ETA-TR-63, aug 1987.
J.J. Dongarra, J. Du Croz, I. Duff, and S. Hammarling. A set of level 3 basic linear algebra subprograms. ANL-MCS-TM 88 (Revision 1), 1988.
J.J. Dongarra, J. Du Croz, S. Hammarling, and J. Hanson, R. Hanson, An extended set of fortran basic linear algebra subprograms. ACM Trans. Math. Softw., 14(1):1–18, mar 1988.
J.J. Dongarra, P. Mayes, and G. Radicati di Brozolo. The ibm rise system/6000 and linear algebra operations. Technical report, LApack Report Ref. No. 28, dec 1990.
R.W. Hockney and C.R. Jesshope. Parallel Computers. Adam Hilger, second edition, 1988.
Intel. i860 Microprocessor Reference Manual. Intel, 1990.
M.S. Lam, E.E. Rothberg, and M.E. Wolf. The cache performance and optimisations of blocked algorithms. In Fourth Intern Con, on Architectural Support for Programming Languages and Operating Systems (ASPLOS IV), Palo Alto, California, 1991.
C.L. Lawson, R.J. Hanson, D.R. Kincaid, and F.T. Krogh. Basic linear algebra sub-programs for fortran usage. ACM Trans. Math. Softw., 5(3):308–323, sep 1979.
N.A. Software Limited. Liverpool Single-Processor Assembler Library Manual. N.A. Software Limited, Merseyside Innovation Centre, 131 Mount Pleasant, Liverpool, L3 5TF, July 1991.
D.C.B. Watson, R. Wilkinson, P.G.N. Howard, and C.J. Willis. Machine code implementations of basic vector subroutines for the t800. In L. Freeman and C. Phillips, editors, Applications of Transputers 2, pages 541–546.IOS Press, Amsterdam, 1990.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1992 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Wilkinson, B., Mulholland, L.S. (1992). An implementation of the BLAS on the i860: A RISC approach to software for RISC devices. In: Bougé, L., Cosnard, M., Robert, Y., Trystram, D. (eds) Parallel Processing: CONPAR 92—VAPP V. VAPP CONPAR 1992 1992. Lecture Notes in Computer Science, vol 634. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-55895-0_423
Download citation
DOI: https://doi.org/10.1007/3-540-55895-0_423
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-55895-8
Online ISBN: 978-3-540-47306-0
eBook Packages: Springer Book Archive