An Efficient FIR Filter Structure Based on Technology-Optimized Multiply-Adder Unit Targeting LUT-Based FPGAs

Khurshid, Burhan; Mir, Roohie Naaz

doi:10.1007/s00034-016-0312-9

An Efficient FIR Filter Structure Based on Technology-Optimized Multiply-Adder Unit Targeting LUT-Based FPGAs

Published: 16 April 2016

Volume 36, pages 600–639, (2017)
Cite this article

Circuits, Systems, and Signal Processing Aims and scope Submit manuscript

Burhan Khurshid¹ &
Roohie Naaz Mir¹

426 Accesses
4 Citations
Explore all metrics

Abstract

Finite impulse response (FIR) filter is a fundamental element in digital signal processing (DSP) systems. Traditional implementations have been using application specific integrated circuits (ASICs) or DSP processors. However, the increase in logic capacity and versatility of the field programmable gate array (FPGA) platforms has made it possible to realize numerically intense algorithms and thus provide a complete system-on-chip (SoC) solution in a single package. With modern FPGAs fast moving from prototype designing to low and medium volume productions, it becomes imperative to consider architectural optimizations that are specific to FPGAs only. In this paper, for the first time, we attempt to optimize filtering structures by considering the technology-dependent approaches. We present a general procedure that can be used to efficiently map the Boolean networks onto FPGA fabric. Based on this procedure, a technology optimal realization of the multiply-adder unit, which is a fundamental functional unit in FIR filters is presented. Since no such implementation has been reported, we have compared our implementations against the various technology-independent optimizations that have been detailed in prior literature. Further, to give an idea about the performance speed-up achieved with our implementations, we have compared our results against the FIR structures based on the multiply-adder IP v 2.0, inherent in Xilinx FPGAs. A distinctive feature of our implementation is that a simultaneous speed-up is achieved in all three parameters (area, speed and power). This is in contrast to the technology-independent implementations where there is always a performance trade-off between different parameters.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

LUT based realization of fixed-point multipliers targeting state-of-art FPGAs

Article 10 March 2017

Technology-Optimized Fixed-Point Bit-Parallel Multipliers for FPGAs

Article 24 October 2016

A Fabric Component Based Approach to the Architecture and Design Automation of High-Performance Integer Arithmetic Circuits on FPGA

References

A. Abedelgwad, High speed and area efficient multiply Accumulate (MAC) Unit for Digital Signal Processing Applications (IEEE International Symposium on Circuits and Systems, ISCAS, 2007)
L. Aksoy, P. Flores, J. Monteiro, A tutorial on multiplier-less design of FIR filters: algorithms and architectures. Circ. Syst. Signal Process. 33, 1689–1719 (2014)
Article Google Scholar
J.H. Anderson, Q. Wang, Area-efficient FPGA logic elements: architecture and synthesis, 16th Asia and South Pacific Design Automation conference (ASP-DAC), (2011)
A. Antonion, Digital Filters: Analysis, Design and Application (McGraw Hill, New York, 1993)
Google Scholar
J.E. Carletta, M.D. Rayman, Practical Considerations in the Synthesis of High Performance Digital Filters for Implementation on FPGAs. FPL 2002, LNCS 2438, pp. 886–896, Springer, (2002)
K. Chapman, Constant Coefficient Multipliers for the XC4000E. Xilinx Technical Report (1996)
C. Cheng, K.K. Parhi, Hardware efficient fast parallel FIR filter structures based on iterated short convolution. IEEE Trans. Circ. Syst. I Reg. Pap. 51(8), 1492–1500 (2004)
Article MathSciNet Google Scholar
J. Chou, S. Mohanakrishnan, J.B. Evans, FPGA Implementation of Digital Filters, in Proceedings 4th International Conference on Signal Processing Applications and Technology, (1993)
L. Deng, K. Sobti, Y. Zhang, C. Chakarbarti, Accurate area, time and power models for FPGA based implementations. J. Signal Process. Syst. 63, 39–50 (2011)
Article Google Scholar
A. Fayed, A merged Multiplier Accumulator for High Speed Signal Processing Applications. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), (2002)
S. Gao, D. Al-Khalili, N. Chabini, Implementation of Large Size Multiplier using Ternary Adders and Higher Order Compressors. International Conference on Microelectronics, (2009)
G.R. Goslin, A Guide to using FPGAs for application-specific DSP performance, in Xilinx application note, (1998)
S. Haykin, Adaptive Filter Theory (Prentice Hall, Upper Saddle River, 1991)
MATH Google Scholar
M.K. Jaiswal, R.C.C. Cheung, Area-efficient architectures for double precision multiplier on FPGA, with run-time-reconfigurable dual single precision support. Microelectron. J. 44, 421–430 (2013)
Article Google Scholar
H.M. Kamboh, S.A. Khan, An Algorithmic Transformation for FPGA Implementation of High Throughput Filters, in Proceedings of the 7th International Conference on Emerging Technologies, (2011)
S. Kestur, J.D. Davis, O. Williams, BLAS Comparison on FPGA, CPU and GPU, in IEEE Annual Symposium on VLSI, pp. 288–293, (2010)
P. Kolling, K. Abbot, FPGA Implementation of High Performance FIR filter, Proceedings IEEE International Symposium on Circuits and Systems, ISCAS, (1997)
M. Kumm, M. Hardieck, J. Wilkomm, P. Zipf, U.M. Baese, Multiple Constant Multiplication with Ternary Adders, \(23^{\text{ rd }}\) International conference on Field Programmable Logic and Applications, (2013)
A. Ling, D.P. Singh, S.D. Brown, FPGA Technology Mapping: A Study of Optimality, in IEEE Proceedings Design Automation Conference, pp. 427–432, (2005)
P.K. Meher, Low-Latency Hardware-Efficient Memory-Based Design for Large Order FIR Digital Filters, 6th International Conference on Information, Communication and Signal Processing, pp. 10–13 (2007)
P.K. Meher, S. Chanderasekaran, A. Amira, FPGA Realization of FIR Filters by Efficient and Flexible Systolization using Distributed Arithmetic. IEEE Transactions on Signal Processing, vol. 56, No. 7, (2008)
A. Mirshekari, M. Mosleh, Hardware Implementation of a Fast FIR Filter with Residue Number System, 2nd International conference on Industrial Mechatronics and Automation, pp 312–315, 30–31 (2010)
N. Mu, G. Liu, Study on the FPGA Implementation Algorithm of Effective FIR Filter Based on Remainder Theorem, 2nd International Conference on Consumer Electronics, Communication and Networks, pp. 21–23 (2012)
R. Naseer, M. Balakrishnan, A. Kumar, Direct Mapping of RTL Structures onto LUT-Based FPGAs, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, Vol. 17, No. 7, (1998)
M. Nikolic, M. Lutovac, Sharpening of the multistage modified comb filters. Serbian J. Electric. Eng. 8(3), 281–291 (2011)
Article Google Scholar
B. Parhami, Computer Arithmetic Algorithms and Hardware Designs (Oxford University Press, New York, 2000)
Google Scholar
K.K. Parhi, Chapter 13, Bit-Level Arithmetic Architectures. VLSI Digital Signal Processing Systems Design and Implementation (Wiley, 1999)
V.D. Pavlovic, N. Doncov, D. Ciric, 1D and 2D Economical FIR filters generated by Chebyshev polynomials of the first kind. Int. J. Electronic. (2013)
A. Perwaiz, S.A. Khan, H.M. Kamboh, Optimization for quantization & embedded resources on FPGA. Proc. Int. Conf. Emerg. Technol. (2011)
J.G. Proakis, D.G. Manolakis, Digital Signal Processing: Principles, Algorithms and Applications (Prentice Hall, Upper Saddle River, 1996)
Google Scholar
P. Shi, Y.J. Yu, Design of linear phase FIR filters with high probability of achieving minimum number of adders. IEEE Circ. Syst. Soc. 58(1), 126–136 (2011)
MathSciNet Google Scholar
G. Stitt, F. Vahid, S. Nematbakhsh, Energy savings and speed-ups from partitioning critical software loops to hardware in embedded systems. ACM Trans. Embed. Comput. Syst. 3(1), 218–232 (2004)
Article Google Scholar
R. Tessier, W. Burleson, Reconfigurable computing for DSP: a survey. J. VLSI Signal Process. 28, 7–27 (2001)
Article MATH Google Scholar
T.J. Todman, G.A. Constantinides, S.J.E. Wilton, O. Mencer, W. Luk, P.Y.K. Cheung, Reconfigurable Computing: Architecture and Design Methods. in IEEE Proceedings. Computer Digital Technology, vol. 152, No. 2, (2005)
Y.C. Tsao, K. Choi, Area efficient parallel FIR digital filter structures for symmetric convolutions based on fast FIR algorithm. IEEE Trans. VLSI Syst. 20(2), 366–371 (2010)
Article Google Scholar
Y.C. Tsao, K. Choi, Area efficient VLSI implementation for parallel linear-phase FIR digital filters of odd length based on fast FIR algorithm. IEEE Trans. Circ. Syst.-II Express Briefs 59(6), 371–375 (2012)
Article Google Scholar
Y.C. Tsao, K. Choi, Hardware-Efficient VLSI Implementation for 3-Parallel Linear-Phase FIR Digital Filters of Odd Length. IEEE International Symposium on Circuits and Systems, Seoul, (2012)
R. Uma, J. Ponnian, Systolic FIR Filter Design with Various Parallel Prefix Adders in FPGA: Performance Analysis. International Symposium on Electronic System Design, Kolkatta, 19–22 (2012)
G. Wei, W.F. Ying, The Implementation of FIR Low-Pass Filter Based on FPGA and DA. 4th International Conference on Intelligent Control and Information Processing, Beijing, China, (2013)
M.J. Wirthlin, Constant coefficient multiplication using look-up tables. J. VLSI Signal Process. 36, 7–15 (2004)
Article Google Scholar
M.J. Wirthlin, B. McMurtrey, Efficient Constant Coefficient Multiplication Using Advanced FPGA Architectures. International Conference on Field Programmable Logic and Applications (FPL), (2001)
R. Woods, J. McAllister, G. Lightbody, Y. Yi, FPGA-based Implementation of Signal Processing Systems, (Wiley, New York, 2008)
Book Google Scholar
R. Woods, S. Ludwig, J. Heron, D. Trainor, S. Gehring, FPGA Synthesis on the XC6200 using IRIS and Trianus/Hades. Proceedings 5th IEEE Symposium on FPGA based Custom Computing Machines, pp. 155–164, (1997)
http://www.xilinx.com
Xilinx, Spartan-6 Family Overview. DS160 (v 2.0) (October 25, 2011). www.xilinx.com
Xilinx, “Virtex-5 Family Overview,” DS100 (v 5.0) (Feb. 6, 2009). www.xilinx.com
Xilinx, Virtex-6 Libraries Guide for HDL Designs, UG623 (v 12.3) (September 21, 2010). www.xilinx.com
X. Yuan, T. Ying, G. Chunpeng, Improved Design of Multiplier in the Digital Filter. International Conference on Computer and Communication Technologies in Agriculture Engineering, (2010)
L. Zhao, W.H. Bi, F. Liu, Design of digital FIR band pass filter using distributed algorithm based on FPGA. Electron. Meas. Technol. 30(7), 101–104 (2010)
Google Scholar
Y. Zhon, P. Shi, Distributed Arithmetic for FIR Implementation on FPGA. International Conference on Multimedia Technology, Hangzhou, China, (2011)

Download references

Author information

Authors and Affiliations

Department of Computer Science and Engineering, National Institute of Technology Srinagar, Srinagar, J&K, India
Burhan Khurshid & Roohie Naaz Mir

Authors

Burhan Khurshid
View author publications
You can also search for this author in PubMed Google Scholar
Roohie Naaz Mir
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Burhan Khurshid.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Khurshid, B., Mir, R.N. An Efficient FIR Filter Structure Based on Technology-Optimized Multiply-Adder Unit Targeting LUT-Based FPGAs. Circuits Syst Signal Process 36, 600–639 (2017). https://doi.org/10.1007/s00034-016-0312-9

Download citation

Received: 30 March 2015
Revised: 29 March 2016
Accepted: 31 March 2016
Published: 16 April 2016
Issue Date: February 2017
DOI: https://doi.org/10.1007/s00034-016-0312-9

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

An Efficient FIR Filter Structure Based on Technology-Optimized Multiply-Adder Unit Targeting LUT-Based FPGAs

Abstract

Access this article

Similar content being viewed by others

LUT based realization of fixed-point multipliers targeting state-of-art FPGAs

Technology-Optimized Fixed-Point Bit-Parallel Multipliers for FPGAs

A Fabric Component Based Approach to the Architecture and Design Automation of High-Performance Integer Arithmetic Circuits on FPGA

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

An Efficient FIR Filter Structure Based on Technology-Optimized Multiply-Adder Unit Targeting LUT-Based FPGAs

Abstract

Access this article

Similar content being viewed by others

LUT based realization of fixed-point multipliers targeting state-of-art FPGAs

Technology-Optimized Fixed-Point Bit-Parallel Multipliers for FPGAs

A Fabric Component Based Approach to the Architecture and Design Automation of High-Performance Integer Arithmetic Circuits on FPGA

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation