Skip to main content

Advertisement

Log in

Metaheuristic-based vector quantization approach: a new paradigm for neural network-based video compression

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

Video compression has great significance in the communication of motion pictures. Video compression techniques try to remove the different types of redundancy within or between video sequences. In the temporal domain, the video compression techniques remove the redundancies between the highly correlated consequence frames of the video. In the spatial domain, the video compression techniques remove the redundancies between the highly correlated consequence pixels (samples) in the same frame. Evolving neural-networks based video coding research efforts are focused on improving existing video codecs by performing better predictions that are incorporated within the same codec framework or holistic methods of end-to-end video compression schemes. Current neural network-based video compression adapts static codebook to achieve compression that leads to learning inability from new samples. This paper proposes a modified video compression model that adapts the genetic algorithm to build an optimal codebook for adaptive vector quantization that is used as an activation function inside the neural network’s hidden layer. Background subtraction algorithm is employed to extract motion objects within frames to generate the context-based initial codebook. Furthermore, Differential Pulse Code Modulation (DPCM) is utilized for lossless compression of significant wavelet coefficients; whereas low energy coefficients are lossy compressed using Learning Vector Quantization (LVQ) neural networks. Finally, Run Length Encoding (RLE) is engaged to encode the quantized coefficients to achieve a higher compression ratio. Experiments have proven the system’s ability to achieve higher compression ratio with acceptable efficiency measured by PSNR.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5

Similar content being viewed by others

References

  1. Atheeshwar M, Mahesh K (2014) Efficient and robust video compression using Huffman coding. Int J Adv Res Eng Technol 2(8):5–8

  2. Bernatin T, Sundari G (2014) Video Compression based on Hybrid Transform and Quantization with Huffman Coding for Video Codec. In: Proceedings of the IEEE International Conference on Control, Instrumentation, Communication and Computational Technologies, IEEE, India, pp452–456 Jul 2014

  3. Boufares O, Aloui N, Cherif A (2016) Adaptive threshold for background subtraction in moving object detection using stationary wavelet transforms 2D. Int J Adv Comput Sci Appl 7(8):29–36

    Google Scholar 

  4. Chavan PU, Chavan PP, Dandawate YH (2009) Codebook Optimization in Vector Quantization using Genetic Algorithm. In: Proceedings of the 2nd IEEE International Conference on Computer and Electrical Engineering, vol 1. IEEE, Dubai, pp 280–283 Dec 2009

  5. Chen T, Liu H, Shen Q, Yue T, Cao X, Ma Z (2017) Deepcoder: A Deep Neural Network Based Video Compression. In: Proceedings of the IEEE Visual Communications and Image Processing (VCIP), IEEE pp1–4 Dec 2017

  6. Choi Y, Jun D, Cheong W, Kim B (2019) Design of Efficient Perspective Affine Motion Estimation/Compensation for Versatile Video Coding (VVC) Standard. Electronics 8(9):1–15

    Google Scholar 

  7. Duch W, Kacprzyk J, Zadrożny S (2005) Artificial Neural Networks: Formal Models and Their Applications. In: Proceedings of the 15th International Conference on Science & Business Media. Springer, Poland, pp 11–15 Aug 2005

  8. Elmolla AM, Salama GI, Elbayoumy AD (2015) A novel video compression scheme based on fast Curvelet transform. Int J Comput Sci Telecommun 6(3):7–10

    Google Scholar 

  9. AM Elsayad (2016). “Classification of breast Cancer database using learning vector quantization neural networks”, Technical Report, Saudi Association of Health Informatics, Saudi Arabia, Jul 2016

  10. Epitropakis M, Plagianakos V, Vrahatis M (2008) Balancing the Exploration and Exploitation Capabilities of the Differential Evolution Algorithm. In: Proceedings of the IEEE International Congress on Evolutionary Computation, IEEE, Hong Kong, pp 2686–2693 Jun 2008

  11. Esakkirajan S, Veerakumar T, Navaneethan P (2009) Adaptive Vector Quantization based Video Compression Scheme. In: Proceedings of the IEEE International Conference on Signal Processing and Communication Technologies, IEEE, India, pp 40–43 Mar 2009

  12. Feng H, Tang M, Qi J (2011) A Back-propagation neural network based on a hybrid genetic algorithm and particle swarm optimization for image compression. Int Congress Image Sig Process 3:1315–1131

    Google Scholar 

  13. George NP, Anitha J (2015) Motion Estimation in Video Compression based on Artificial Bee Colony. In: Proceedings of the 2nd IEEE International Conference on Electronics and Communication Systems, IEEE, India, pp 730–733Feb 2015

  14. Goswami K, Lee D, Kim J, Jeong S, Kim H, Kim B (2017) Two-Step Rate Distortion Optimization Algorithm for High Efficiency Video Coding. J Multimed Inf Syst 4(4):311–316

    Google Scholar 

  15. Guo J, Chao H (2017) Building an End-to-End Spatial-Temporal Convolutional Network for Video Super-Resolution. In: Proceedings of the International Conference on Artificial Intelligence Thirty-First, China, pp 4053–4060 Feb 2017

  16. Ida MP (2006) Fundamental data compression, 1st edn. Elsevier chapter 3, pp.49–65, Nov 2006

  17. Inoue K, Hasegawa T, Mori N, Matsumoto K (2015) Analyzing Exploration Exploitation Trade-off by Means of P-I Similarity Index and Dictyostelium based Genetic Algorithm. In: Proceedings of the IEEE International Congress on Evolutionary Computation (CEC), IEEE, Japan pp 2548–2555 May 2015

  18. Kanithi A (2011) Study of spatial and transform domain filters for efficient noise reduction, Master Thesis, National Institute of Technology, Rourkela, India

  19. Kim B (2008) Fast Selective Intra-Mode Search Algorithm Based on Adaptive Thresholding Scheme For H. 264/AVC Encoding. IEEE Trans Circ Syst Video Technol 18(1):127–133

    Article  Google Scholar 

  20. Kim B (2008) Novel Inter-Mode Decision Algorithm Based on Macro block (MB) Tracking for The P-Slice In H. 264/AVC Video Coding. IEEE Trans Circ Syst Video Technol 18(2):273–281

    Article  Google Scholar 

  21. Kim B (2017) Fast coding unit (CU) determination algorithm for high-efficiency video coding (HEVC) in smart surveillance application. J Supercomput 73(3):1063–1084

    Article  Google Scholar 

  22. Kim H, Lee J, Kim C, Kim B (2012) Zoom Motion Estimation Using Block-Based Fast Local Area Scaling. IEEE Trans Circ Syst Video Technol 22(9):1280–1291

    Article  Google Scholar 

  23. Knop M, Cierniak R, Shah N (2014) Video Compression Algorithm based on Neural Network Structures. In: Proceedings of the International Conference on Artificial Intelligence and Soft Computing, Springer, Cham, pp 715–724 India

  24. Knop M, Kapuściński T, Mleczko WK, Angryk R (2016) Neural Video Compression based on RBM Scene Change Detection Algorithm. In: Proceedings of the International Conference on Artificial Intelligence and Soft Computing, pp 660–669 Cambodia

  25. Kumar G, Sharma S, Malik H (2016) Learning Vector Quantization Neural Network based External Fault Diagnosis Model for Three Phase Induction Motor using Current Signature Analysis. In: Proceedings of 6th the International Conference on Advances in Computing & Communications, India, Procedia Computer Science. 93(1):1010–1016

  26. Lee J, Ebrahimi T (2012) Perceptual video compression: a survey. IEEE Trans Sel Top Sig Process 6(6):684–697

    Article  Google Scholar 

  27. Lin H, Lianga S (2014) Discrete wavelet transform based noise removal and feature extraction for ECG signals. Int J Innov Res Bio-Med Eng 35(6):351–361

    Google Scholar 

  28. Lu T, Chang Y (July 2010) A survey of VQ codebook generation. Int J Inf Hiding Multimed Signal Process 1(3):190–203

    Google Scholar 

  29. Lu G, Ouyang W, Xu D, Zhang X, Cai C, Gao Z (2019) Dvc: An End-To-End Deep Video Compression Framework. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 11006–11015 Jun 2019

  30. Ma X, Pan Z, Hu S, Wang L (2014) Enhanced side match vector quantization based on constructing complementary state codebook. IET Image Process 9(4):290–299

    Article  Google Scholar 

  31. Ma S, Zhang X, Jia C, Zhao Z, Wang S, Wanga S (June 2020) Image And Video Compression With Neural Networks: A Review. IEEE Trans Circ Syst Video Technol 30(Issue 6):1683–1698

    Article  Google Scholar 

  32. Mirzaei B, Nezamabadi H, Abbasi-Moghada D (2014) An Effective Codebook Initialization Technique for LBG Algorithm using Subtractive Clustering. In: Proceedings of the IEEE International Conference on Intelligent Systems, IEEE, Iran pp 1–5 Feb 2014

  33. Mittal S, Vetter JS (2015) A survey of architectural approaches for data compression in cache and Main memory systems. IEEE Trans Parallel Distrib Syst 27(5):1524–1536

    Article  Google Scholar 

  34. Nithin S, Suresh LP (2016) Video Coding on Fast Curvelet Transform and Burrows Wheeler Transform (BCH). In: Proceedings of the IEEE International Conference on Circuit, Power and Computing Technologies, IEEE, India pp 1–5 Mar 2016

  35. Patel BK, Agrawal S (October 2013) Image compression techniques using artificial neural network. Int J Adv Res Comput Eng Technol 2(10):1–5

    Google Scholar 

  36. Ponlatha S, Sabeenian R (December 2013) Comparison of video compression standards. Int J Comput Electron Eng 5(6):549–549

    Article  Google Scholar 

  37. Rubina I (2015) Novel Method for Fast 3D DCT for Video Compression. In: Proceedings of the International Conference on Creativity in Intelligent Technologies and Data Science, vol 535. Communications, Russia, pp 674–685

  38. Singh MP, Arya KV, Sharma K (2009) Video Compression using Self Organizing Map and Pattern Storage using Hopfield Neural Network. In: Proceedings of the International Conference on Industrial and Information Systems, IEEE, Sri Lanka, pp 272–278 Dec 2009

  39. Singh AV, Murthy KS (2013) Neuro-Curvelet Model for Efficient Image Compression using Vector Quantization. In: Proceedings of the International Conference on VLSI Communication Advanced Devices Signals and Systems and Networking, Lecture Notes in Electrical Engineering, vol 258. Springer, India, pp 179–185

  40. Sivanandam SN, Deepa SN (2007) Introduction to genetic algorithms. Springer Science & Business Media Chapter 2, pp 15–37

  41. Sun H, Lam K-Y, Chung S-L, Dong W, Gu M, Sun J (2005) Efficient vector quantization using genetic algorithm. Neural Comput Applic 14(3):203–211

    Article  Google Scholar 

  42. Suri A, Goraya A (2014) Hybrid approach for video compression using ant Colony optimization and modified fast Haar wavelet transform. Int J Comput Appl 97(17):26–30

    Google Scholar 

  43. Tomar R, Jain K (2015) Lossless Image Compression using Differential Pulse Code Modulation and its Application. In: Proceedings of the IEEE International Conference on Computational Intelligence and Communication Networks, IEEE, India, pp 397–400 Dec 2015

  44. Montgomery C, Rosedale T. Video test media (n.d.) https://media.xiph.org/video/derf/. (Accessed 03 Oct. 2016).

  45. Wang W, Yang S, Tung C (2005) Codebook Design for Vector Quantization using genetic algorithm. Int J Electron Bus 3(2):83–89

    Google Scholar 

  46. Wei J (2015) Application of Hybrid Back Propagation Neural Network in Image Compression. In: Proceedings of the IEEE International Conference on Intelligent Computation Technology and Automation, Nanchang, IEEE, China, pp 209–212 Jun 2015

  47. Zhang L, Wang SQ, Fang ZJ, Shu ZH, Zhang WR (2008) MCTF-Based Curvelet Video Compression Algorithm. In: International Conference on Natural Computation, vol 5. IEEE, China, pp 559–562 Oct 2008

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Saad M. Darwish.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Darwish, S.M., Almajtomi, A.A.J. Metaheuristic-based vector quantization approach: a new paradigm for neural network-based video compression. Multimed Tools Appl 80, 7367–7396 (2021). https://doi.org/10.1007/s11042-020-10003-7

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-020-10003-7

Keywords

Navigation