Abstract
The SIMD DSP is highly efficient for embedded applications whose parallel data are aligned. However, there are many unaligned and irregular data accesses in typical embedded algorithms such as FFT, FIR. The vectorization of these kinds of algorithms will need many additional shuffle instruction operations in the SIMD architecture with alignment restriction, which greatly decreases the computation efficiency with the increasing SIMD width. This paper proposes an efficient vector memory unit (VMU) with 16 memory blocks on a 16-way SIMD DSP, M-DSP. Each memory block contains four groups of multi-bank memory structure with most-lowest-bit interleaved addressing and affords double bandwidth as needed to reduce the parallel vector access conflicts. A high-bandwidth data shuffle unit capable of dual vector accesses alignment is carried out in the vector access pipelining, which not only efficiently supports the unaligned access but also the special vector access patterns for FFT. The experimental results have shown that the VMU could afford conflict-free parallel accesses between DMA and vector Load/Stores operations with no more than 10% area overhead, and M-DSP achieves an ideal accelerate rate for FFT and FIR algorithms.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Keller, R.M.: Look-ahead microprocessors. ACM Computing surveys 7(4), 177–195 (1975)
Khailany, B., Dally, W.J., Chang, A., Kapasi, U.J., Namkoong, J., Towles, B.: VLSI design and verification of the Imagine microprocessor. In: Proceedings of the IEEE International Conference on Computer Design, pp. 289–296 (September 2002)
Woh, M., Seo, S., Mahlke, S., et al.: AnySP:Anytime Anywhere Anyway Signal Processing. In: Proceedings of the 36th Annual International Symposium on Computer Architecture, Austin, Texas, USA, pp. 128–139 (June 2009)
Rowen, C., Nicolaescu, D., Ravindran, R., et al.: The World’s Fastest DSP core: Breaking the 100 GMAC/s Barrier. In: Proceedings of the 23rd Hot Chips Conference. Memorial Auditorium, Standford University (August 2011)
Chang, H., Sung, W.: Efficent vectorization of SIMD programs with non-aligned and irregular data access hardware. In: CASES 2008, pp. 167–175 (2008)
Sheng, L.: Researches on On-chip Parallel Data Access Techniques for SIMDDSPswith Very Wide Data Path. PhD Thesis, NUDT, Hunan, China (April 4, 2012)
Berkel, K., Heinle, F., et al.: Vector processing as an enabler for software-defined radio in handheld devices. EURASIP Journal on Applied Signal Processing 16, 2613–2625 (2005)
Khailany, B., Dally, W.J., et al.: Imagine: media processing with streams. IEEE Micro 3 (2001)
Seiler, L., et al.: Larrabee: A many-core x86 architecture for visual computing. In: SIGGRAPH 2008, New York, NY, USA, pp. 1–15 (2008)
Pajuelo, A., Gonzalez, A., Valero, M.: Speculative dynamic vectorization. In: Proceedings of the 29th Ann. Int’l Symp. Computer Architecture, pp. 271–280 (2002)
Liu, Z., Chen, Y.-Y., Chen, H.-Y.: A Vectorization of FIR Filter Supporting Arbitrary Coefficients Length and Data Types. Aca Electronica Sinica 41(2), 346–351 (2013)
Rodriguez V, P.: A radix-2 FFT algorithm for modern single instruction multiple data (SIMD) architectures. In: Proc. 2002 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2002), vol. 3, pp. 3220–3223 (2002)
Gou, C., Kuzmanov, G., Gaydadjiev, G.N.: SAMS: Single-Affiliation Multiple-Stride Parallel Memory Scheme. In: Proceedings of the Workshop on Memory Access on Future Processors: a Solved Problem, Ischia, Italy, pp. 350–368 (May 2008)
Texas Instruments, C64x Fixed-Point DSPs Bench-marks, http://www.ti.com/lsds/ti/dsp/c6000_dsp/c64x/benchmarks.page
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Chen, H., Liu, Z., Liu, S., Ma, S. (2015). An Efficient Vector Memory Unit for SIMD DSP. In: Xu, W., Xiao, L., Li, J., Zhang, C., Zhu, Z. (eds) Computer Engineering and Technology. NCCET 2014. Communications in Computer and Information Science, vol 491. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-45815-0_1
Download citation
DOI: https://doi.org/10.1007/978-3-662-45815-0_1
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-662-45814-3
Online ISBN: 978-3-662-45815-0
eBook Packages: Computer ScienceComputer Science (R0)