Skip to main content
Log in

Design optimization of the quantization and a pipelined 2D-DCT for real-time applications

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

The Discrete Cosine Transform (DCT) is one of the most widely used techniques for image compression. Several algorithms are proposed to implement the DCT-2D. The scaled SDCT algorithm is an optimization of the DCT-1D, which consists in gathering all the multiplications at the end. In this paper, in addition to the hardware implementation on an FPGA, an extended optimization has been performed by merging the multiplications in the quantization block without having an impact on the image quality. A simplified quantization has been performed also to keep higher the performances of the all chain. Tests using MATLAB environment have shown that our proposed approach produces images with nearly the same quality of the ones obtained using the JPEG standard. FPGA-based implementations of this proposed approach is presented and compared to other state of the art techniques. The target is an an Altera Cyclone II FPGA using the Quartus synthesis tool. Results show that our approach outperforms the other ones in terms of processing-speed, used resources and power consumption. A comparison has been done between this architecture and a distributed arithmetic based architecture.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7

Similar content being viewed by others

References

  1. Agostini L, Bampi S (2001) Pipelined Fast 2-D DCT Architecture for JPEG image compression. Proceedings of the 14th Annual Symposium on Integrated Circuits and Systems Design, Pirenopolis, Brazil. IEEE Computer Society, pp 226–231

  2. Andraka R (1998) A survey of CORDIC algorithms for FPGA based computers. Proceedings of the ACM/SIGDA sixth international symposium on Field programmable gate arrays, pp 191–200

  3. Arai Y, Agui T, Nakajima M (1988) A fast DCT-SQ scheme for images. Trans IEICE E71:1095–1097

    Google Scholar 

  4. Belkouch S, El Aakif M, Ouahman AA, Hassani MM (2010) Improved implementation of a modified discrete cosine transform on low-cost FPGA. 5th International Symposium on I/V Communications and Mobile Network (ISVC), pp 1–4

  5. Bougzeel S and Omair Ahmed M and Swamy MNS (2009) A fast 8 × 8 for image compressing. Proc. of the International Conference on Microelectronics, pp 74–77

  6. Chen J, Liu KJR (1998) A complete pipelined parallel CORDIC architecture for motion estimation. IEEE Transactions on Circuits and Systems II: Analog and Digital Signal Processing 45:653–660

    Article  Google Scholar 

  7. Chen WH, Smith C, Fralick S (1977) A fast computational algorithm for the discrete cosine transform. IEEE Trans Commun 25:1004–1009

    Article  MATH  Google Scholar 

  8. Chiu CT, Liu KJR (1992) Real-time parallel and fully pipelined twodimensional DCT lattice structures with application to HDTV systems. IEEE Trans Circ Syst Video Tech 2:25–37

    Article  Google Scholar 

  9. El Aakif M, Belkouch S, Chabini N, Hassani, MM (2011) Low power and fast DCT architecture using multiplier-less method. Faible Tens Faible Consomm (FTFC) pp 63–66

  10. Feig E, Winograd S (1992) On the multiplicative complexity of discrete cosine transform. IEEE Trans Inform Theor 38:1387–1391

    Article  MathSciNet  MATH  Google Scholar 

  11. Haggag MN, El-Sharkawy M and Fahmy G (2010) Efficient fast multiplication-free integer transformation for the 2-D DCT H.265 Standard. IEEE 17th International Conference on Image Processing (ICIP), pp 3769–3772

  12. http://www.altera.com/devices/fpga/cyclone2/cy2-index.jsp

  13. Huang J, Parris M, Lee J, De Mara RF (2009) Scalable FPGA-based architecture for DCT computation using dynamic partial reconfiguration. ACM Trans Embed Comput Syst 9(1):1–18

    Article  Google Scholar 

  14. Ismail Y, McNeely J, Shaaban M, Al Najjar M, Bayoumi MA (2010) A fast discrete transform architecture for frequency domain motion estimation. 17th IEEE International Conference on Image Processing (ICIP), pp 1249–1252

  15. Kassem A, Hamad M, Haidamous E (2009) Image compression on FPGA using DCT. Intern Conf Advance Comput Tools Engineer Appl ACTEA pp 320–323

  16. Kusuma ED, Widodo TS (2010) FPGA implementation of pipelined 2D-DCT and quantization architecture for JPEG image compression. Intern Symp Inform Tech (ITSim) pp 1–6

  17. Le Gall D (1991) MPEG: a video compression standard for multimedia applications. Commun ACM 34:46–58

    Article  Google Scholar 

  18. Linzer E and Feig E (1991) New scaled DCT algorithms for fused multiply/add architectures. in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process, pp 2201–2204

  19. Loeffler C, Lightenberg A and Moschytz GS (1989) Practical Fast 1-D DCT algorithms with 11-multiplications. Proc. of ICASSP, Glagow, UK vol. 2, pp 988–991

  20. Peled A, Liu B (1974) A new hardware realization of digital filters. IEEE Trans Acoust Speech Signal Process ASSP-22:456–462

    Article  Google Scholar 

  21. Rao KR and Yip P (1990) Discrete cosine transform: algorithms, advantages, applications. Academic Press

  22. Tumeo A, Monchiero M, Palermo G, Ferrandi F, Sciuto D (2007) A Pipelined Fast 2D-DCT Accelerator for FPGA-based SoCs. ISVLSI ’07. IEEE Comp Soc Ann Symp VLSI pp 331–336

  23. Wahid KA, Khan AW, Vassil SD, Graham AJ (2007) On the error-free realization of a Scaled DCT Algorithm and Its VLSI Implementation. IEEE Trans Circ Syst Exp Briefs 54(8):700–704

    Article  Google Scholar 

  24. Wallace GK (1991) The JPEG still picture compression standard. Commun ACM 34:30–44

    Article  Google Scholar 

  25. Wang Z (1984) Fast algorithms for the discrete w transform and for the discrete fourier transform. IEEE Trans Acoust Speech Signal Process 32:803–816

    Article  MathSciNet  MATH  Google Scholar 

  26. Wang Z, Bovik AC, Sheikh HR, Simoncelli EP (2004) Image quality assessment: from error visibility to structural similarity. IEEE Trans Image Process 13(4):600–612

    Article  Google Scholar 

  27. White SA (1989) Applications of distributed arithmetic to digital sequence processing: a tutorial review. IEEE ASSP Mag 6:5–19

    Article  Google Scholar 

  28. Wu Z, Sha J, Wang Z, Li L, Gao M (2009) An improved scaled DCT architecture. IEEE Trans Consum Electron 55–2:685–689

    Article  Google Scholar 

  29. Xiuhua J, Caiming Z, Xuefen Z (2011) An efficient joint implementation of three stages for fast computation of color space conversation in image coding/decoding. Multimed Tool Appl pp 1–15

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Anas Hatim.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Hatim, A., Belkouch, S., El Aakif, M. et al. Design optimization of the quantization and a pipelined 2D-DCT for real-time applications. Multimed Tools Appl 67, 667–685 (2013). https://doi.org/10.1007/s11042-012-1043-y

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-012-1043-y

Keywords

Navigation