Integrated I-cache Way Predictor and Branch Target Buffer to Reduce Energy Consumption

Tang, Weiyu; Veidenbaum, Alexander; Nicolau, Alexandru; Gupta, Rajesh

doi:10.1007/3-540-47847-7_12

Weiyu Tang⁶,
Alexander Veidenbaum⁶,
Alexandru Nicolau⁶ &
…
Rajesh Gupta⁶

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2327))

Included in the following conference series:

International Symposium on High Performance Computing

883 Accesses
4 Citations
1 Altmetric

Abstract

In this paper, we present a Branch Target Buffer (BTB) design for energy savings in set-associative instruction caches. We extend the functionality of a BTB by caching way predictions in addition to branch target addresses. Way prediction and branch target prediction are done in parallel. Instruction cache energy savings are achieved by accessing one cache way if the way prediction for a fetch is available. To increase the number of way predictions for higher energy savings, we modify the BTB management policy to allocate entries for non-branch instructions. Furthermore, we propose to partition a BTB into ways for branch instructions and ways for non-branch instructions to reduce the BTB energy as well.

We evaluate the effectiveness of our BTB design and management policies with SPEC95 benchmarks. The best BTB configuration shows a 74% energy savings on average in a 4-way set-associative instruction cache and the performance degradation is only 0.1&. When the instruction cache energy and the BTB energy are considered together, the average energy-delay product reduction is 65%.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

K.B. Normoyle et al. UltraSparc-IIi: expanding the boundaries of system on a chip. IEEE Trans. Micro, 18(2):14–24, 1998.
Article Google Scholar
Advanced Micro Devices, Inc. AMD athlon processor architecture, 2000. White paper.
Google Scholar
D. H. Albonesi. Selective cache ways: on-demand cache resource allocation. In Int’l Symp. Microarchitecture, pages 248–259, 1999.
Google Scholar
N. Bellas, I. Hajj, and C. Polychronopoulos. Using dynamic cache management techniques to reduce energy in a high-performance processor. In Int’l Symp. on Low Power Electronics and Design, pages 64–69, 1999.
Google Scholar
D. Brooks, V. Tiwari, and M. Martonosi. Wattch: a framework for architectural-level power analysis and optimizations. In Int’l Symp. Computer Architecture, pages 83–94, 2000.
Google Scholar
D. Burger and T. Austin. The simplescalar toolset, version 2.0. Technical Report TR-97-1342, University of Wisconsin-Madison, 1997.
Google Scholar
M. Check and T. Slegel. Custom S/390 G5 and G6 microprocessors. IBM Journal of Research and Development, 43(5/6):671–680, 1999.
Article Google Scholar
G. Hinton et al. The microarchitecture of the pentium 4 processor. Intel Technology Journal, Q1, 2001.
Google Scholar
K. Ghose and M. Kamble. Reducing power in superscalar processor caches using subbanking, multiple line buffers and bit-line segmentation. In Int’l Symp. on Low Power Electronics and Design, pages 70–75, 1999.
Google Scholar
K. Inoue, T. Ishihara, and K. Murakami. Way-predicting set-associative cache for high performance and low energy consumption. In Int’l Symp. on Low Power Electronics and Design, pages 273–275, 1999.
Google Scholar
J. Montanaro et al. A 160-MHz, 32-b, 0.5-W CMOS RISC microprocessor. IEEE Journal of Solid-State Circuits, 32(11):1703–14, 1996.
Article Google Scholar
R. Kessler. The Alpha 21264 microprocessor. IEEE Micro, 19(2):24–36, 1999.
Article MathSciNet Google Scholar
J. Kin, M. Gupta, and W. Mangione-Smith. The filter cache: An energy efficient memory structure. In Int’l Symp. Microarchitecture, pages 184–193, 1997.
Google Scholar
S. Manne, A. Klauser, and D. Grunwald. Pipeline gating: speculation control for energy reduction. In Int’l Symp. Computer Architecture, pages 132–141, 1998.
Google Scholar
E. Musoll. Predicting the usefulness of a block result: a micro-architectural technique for high-performance low-power processors. In Int’l Symp. Microarchitecture, pages 238–247, 1999.
Google Scholar
C. Perleberg and A. Smith. Branch target buffer design and optimization. IEEE Trans. Computers, 42(4):396–412, 1993.
Article Google Scholar
S. Wilton and N. Jouppi. An enhanced access and cycle time model for on-chip caches. Technical Report 93/5, Digital Western Research Laboratory, 1994.
Google Scholar

Download references

Author information

Authors and Affiliations

University of California at Irvine, Irvine, CA, 92697-3425, USA
Weiyu Tang, Alexander Veidenbaum, Alexandru Nicolau & Rajesh Gupta

Authors

Weiyu Tang
View author publications
You can also search for this author in PubMed Google Scholar
Alexander Veidenbaum
View author publications
You can also search for this author in PubMed Google Scholar
Alexandru Nicolau
View author publications
You can also search for this author in PubMed Google Scholar
Rajesh Gupta
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Institute of Software Science, University of Vienna, Liechtensteinstr. 22, 1090, Vienna, Austria
Hans P. Zima
Department of Information and Computer Science, Nara Women’s University, Kitauoyanishimachi, Nara City, 630-8506, Japan
Kazuki Joe
Institute of Information Science and Electronics, University of Tsukuba, Tenno-dai 1-1-1, Tsukuba, Ibaraki, 305-8577, Japan
Mitsuhisa Sato
Internet Systems Research Laboratories, NEC Corporation, 4-1-1, Miyazaki, Miyamae, Kawasaki, Kanagawa, 216-8555, Japan
Yoshiki Seo
Kyoto University, Yoshidahonmachi, Sakyo-ku, Kyoto, 606-8501, Japan
Masaaki Shimasaki

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Tang, W., Veidenbaum, A., Nicolau, A., Gupta, R. (2002). Integrated I-cache Way Predictor and Branch Target Buffer to Reduce Energy Consumption. In: Zima, H.P., Joe, K., Sato, M., Seo, Y., Shimasaki, M. (eds) High Performance Computing. ISHPC 2002. Lecture Notes in Computer Science, vol 2327. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-47847-7_12

Download citation

DOI: https://doi.org/10.1007/3-540-47847-7_12
Published: 29 April 2002
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-43674-4
Online ISBN: 978-3-540-47847-8
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics