Skip to main content

Optimizing the Peak-Performance of Vector Units with Dynamically Allocatable Vector Registers

  • Conference paper
Book cover Architektur und Betrieb von Rechensystemen

Part of the book series: Informatik-Fachberichte ((INFORMATIK,volume 168))

  • 42 Accesses

Abstract

The performance of a vector unit mainly depends on its start-up time and the time required for a single vector element. For vector units with a fast and limited vector memory, the achievable peak-performance is influenced by the start-up time and the vector memory size. Programming is also important to the achievable peak-performance if complicated vector expressions are evaluated. A technique known as ’strip mining’ frequently proves to be suitable. By this method, long vectors are decomposed into subvectors and the whole expression is evaluated for the current subvector before moving on to the next subvector. If many vector registers are allocated to the vector memory, the utilizable maximum vector length is considerably reduced which may have a negative influence on peak-performance. In a simple model, conditions for vector register allocation are investigated. Let M be the vector memory size, μ the ratio of the main memory access time to the vector memory access time and S the start-up time for an evaluation. A vector register should be allocated to the vector memory if the number of separate transport instructions avoidable by this allocation is greater than S/(μM). For a translation of very long expressions, a subdivision might be useful. A heuristic method is developed which subdivides long expressions into appropriate parts.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 54.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 69.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. J. R. Allen and K. Kennedy, Vector Register Allocation, Tech. Report, Rice University, Houston, Texas, April 1986.

    Google Scholar 

  2. W. K. Giloi and H. Mühlenbein, Rationale and Concepts for the Suprenum Supercomputer Architecture, Proc. Intern. Conf. on Parallel Processing, St. Charles, Illinois, 1986.

    Google Scholar 

  3. R. W. Hockney and C. R. Jesshope, Parallel Computers, Adam Hilger Ltd, Bristol, 1981.

    MATH  Google Scholar 

  4. H. Kammer, The SUPRENUM Vector floating point unit, Proc. 2nd Intern. SUPRENUM workshop, (submitt. to Parallel Computing).

    Google Scholar 

  5. N. Kroll, Mathematische Beschreibung, Konvergenzbeschleunigung und Vektorisierung des Programms FL022 zur Berechnung transonischer Strömungen um Tragflügel endlicher Spannweite, report IB129-83/26, DFVLR, Braunschweig, 1983.

    Google Scholar 

  6. K. Stüben and U. Trottenberg, Multigrid Methods: Fundamental Algorithms, Model Problem Analysis and Applications, in Hackbusch and Trottenberg (eds.), Multigrid Methods, Lecture Notes in Mathematics 960, Springer, Berlin, 1982.

    Google Scholar 

  7. H. P. Zima, H.-J. Bast, M. Gerndt and P. J. Hoppen, SUPERB: The SUPRENUM Paral-lelizer Bonn, Research Report 861203, Institut für Informatik III, Universität Bonn, West Germany, 1986.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 1988 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Mierendorff, H. (1988). Optimizing the Peak-Performance of Vector Units with Dynamically Allocatable Vector Registers. In: Kastens, U., Rammig, F.J. (eds) Architektur und Betrieb von Rechensystemen. Informatik-Fachberichte, vol 168. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-73451-9_3

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-73451-9_3

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-18994-7

  • Online ISBN: 978-3-642-73451-9

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics