Abstract
Modern computing architectures use cache memory as the buffer between high speed computing units and low latency main memory. Higher capacity caches are thought to be critical for deep neural network processors, which handle large amounts of data. However, as cache memory capacity increases, it occupies large die area that can otherwise be used for computing units. This is the inherent trade off between memory capacity and performance. In this work, we present a deep neural network processing chip, with a near-memory computing architecture. We eliminate the SRAM cache and use DRAM only as on-chip memory, delivering high performance and high memory capacity.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Lapedus, M.: Memory Issues for AI Edge Chips, 23 March 2020. https://semiengineering.com/memory-issues-for-ai-edge-chips/
Khan, S.M., Mann, A.: AI Chips: What They Are and Why They Matter, April 2020. https://cset.georgetown.edu/research/why-ai-chips-matter
Joel, H.: How L1 and L2 CPU Caches Work, and Why They’re an Essential Part of Modern Chips. Extreme Tech, 14 April 2020
Microsoft, Turing-NLG: A 17-billion-parameter language model by Microsoft. https://www.microsoft.com/en-us/research/blog/turing-nlg-a-17-billion-parameter-language-model-by-microsoft/
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Tam, E. et al. (2021). DRAM-Based Processor for Deep Neural Networks Without SRAM Cache. In: Arai, K. (eds) Intelligent Computing. Lecture Notes in Networks and Systems, vol 284. Springer, Cham. https://doi.org/10.1007/978-3-030-80126-7_52
Download citation
DOI: https://doi.org/10.1007/978-3-030-80126-7_52
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-80125-0
Online ISBN: 978-3-030-80126-7
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)