Abstract
The performance of a vector unit mainly depends on its start-up time and the time required for a single vector element. For vector units with a fast and limited vector memory, the achievable peak-performance is influenced by the start-up time and the vector memory size. Programming is also important to the achievable peak-performance if complicated vector expressions are evaluated. A technique known as ’strip mining’ frequently proves to be suitable. By this method, long vectors are decomposed into subvectors and the whole expression is evaluated for the current subvector before moving on to the next subvector. If many vector registers are allocated to the vector memory, the utilizable maximum vector length is considerably reduced which may have a negative influence on peak-performance. In a simple model, conditions for vector register allocation are investigated. Let M be the vector memory size, μ the ratio of the main memory access time to the vector memory access time and S the start-up time for an evaluation. A vector register should be allocated to the vector memory if the number of separate transport instructions avoidable by this allocation is greater than S/(μM). For a translation of very long expressions, a subdivision might be useful. A heuristic method is developed which subdivides long expressions into appropriate parts.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
J. R. Allen and K. Kennedy, Vector Register Allocation, Tech. Report, Rice University, Houston, Texas, April 1986.
W. K. Giloi and H. Mühlenbein, Rationale and Concepts for the Suprenum Supercomputer Architecture, Proc. Intern. Conf. on Parallel Processing, St. Charles, Illinois, 1986.
R. W. Hockney and C. R. Jesshope, Parallel Computers, Adam Hilger Ltd, Bristol, 1981.
H. Kammer, The SUPRENUM Vector floating point unit, Proc. 2nd Intern. SUPRENUM workshop, (submitt. to Parallel Computing).
N. Kroll, Mathematische Beschreibung, Konvergenzbeschleunigung und Vektorisierung des Programms FL022 zur Berechnung transonischer Strömungen um Tragflügel endlicher Spannweite, report IB129-83/26, DFVLR, Braunschweig, 1983.
K. Stüben and U. Trottenberg, Multigrid Methods: Fundamental Algorithms, Model Problem Analysis and Applications, in Hackbusch and Trottenberg (eds.), Multigrid Methods, Lecture Notes in Mathematics 960, Springer, Berlin, 1982.
H. P. Zima, H.-J. Bast, M. Gerndt and P. J. Hoppen, SUPERB: The SUPRENUM Paral-lelizer Bonn, Research Report 861203, Institut für Informatik III, Universität Bonn, West Germany, 1986.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1988 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Mierendorff, H. (1988). Optimizing the Peak-Performance of Vector Units with Dynamically Allocatable Vector Registers. In: Kastens, U., Rammig, F.J. (eds) Architektur und Betrieb von Rechensystemen. Informatik-Fachberichte, vol 168. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-73451-9_3
Download citation
DOI: https://doi.org/10.1007/978-3-642-73451-9_3
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-18994-7
Online ISBN: 978-3-642-73451-9
eBook Packages: Springer Book Archive