Metaheuristic-based vector quantization approach: a new paradigm for neural network-based video compression

Darwish, Saad M.; Almajtomi, Ahmed A. J.

doi:10.1007/s11042-020-10003-7

Metaheuristic-based vector quantization approach: a new paradigm for neural network-based video compression

Published: 28 October 2020

Volume 80, pages 7367–7396, (2021)
Cite this article

Multimedia Tools and Applications Aims and scope Submit manuscript

302 Accesses
7 Citations
Explore all metrics

Abstract

Video compression has great significance in the communication of motion pictures. Video compression techniques try to remove the different types of redundancy within or between video sequences. In the temporal domain, the video compression techniques remove the redundancies between the highly correlated consequence frames of the video. In the spatial domain, the video compression techniques remove the redundancies between the highly correlated consequence pixels (samples) in the same frame. Evolving neural-networks based video coding research efforts are focused on improving existing video codecs by performing better predictions that are incorporated within the same codec framework or holistic methods of end-to-end video compression schemes. Current neural network-based video compression adapts static codebook to achieve compression that leads to learning inability from new samples. This paper proposes a modified video compression model that adapts the genetic algorithm to build an optimal codebook for adaptive vector quantization that is used as an activation function inside the neural network’s hidden layer. Background subtraction algorithm is employed to extract motion objects within frames to generate the context-based initial codebook. Furthermore, Differential Pulse Code Modulation (DPCM) is utilized for lossless compression of significant wavelet coefficients; whereas low energy coefficients are lossy compressed using Learning Vector Quantization (LVQ) neural networks. Finally, Run Length Encoding (RLE) is engaged to encode the quantized coefficients to achieve a higher compression ratio. Experiments have proven the system’s ability to achieve higher compression ratio with acceptable efficiency measured by PSNR.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A survey of the recent architectures of deep convolutional neural networks

Article 21 April 2020

Video summarization using deep learning techniques: a detailed analysis and investigation

Article 15 March 2023

A review of convolutional neural network architectures and their optimizations

Article 22 June 2022

References

Atheeshwar M, Mahesh K (2014) Efficient and robust video compression using Huffman coding. Int J Adv Res Eng Technol 2(8):5–8
Bernatin T, Sundari G (2014) Video Compression based on Hybrid Transform and Quantization with Huffman Coding for Video Codec. In: Proceedings of the IEEE International Conference on Control, Instrumentation, Communication and Computational Technologies, IEEE, India, pp452–456 Jul 2014
Boufares O, Aloui N, Cherif A (2016) Adaptive threshold for background subtraction in moving object detection using stationary wavelet transforms 2D. Int J Adv Comput Sci Appl 7(8):29–36
Google Scholar
Chavan PU, Chavan PP, Dandawate YH (2009) Codebook Optimization in Vector Quantization using Genetic Algorithm. In: Proceedings of the 2nd IEEE International Conference on Computer and Electrical Engineering, vol 1. IEEE, Dubai, pp 280–283 Dec 2009
Chen T, Liu H, Shen Q, Yue T, Cao X, Ma Z (2017) Deepcoder: A Deep Neural Network Based Video Compression. In: Proceedings of the IEEE Visual Communications and Image Processing (VCIP), IEEE pp1–4 Dec 2017
Choi Y, Jun D, Cheong W, Kim B (2019) Design of Efficient Perspective Affine Motion Estimation/Compensation for Versatile Video Coding (VVC) Standard. Electronics 8(9):1–15
Google Scholar
Duch W, Kacprzyk J, Zadrożny S (2005) Artificial Neural Networks: Formal Models and Their Applications. In: Proceedings of the 15th International Conference on Science & Business Media. Springer, Poland, pp 11–15 Aug 2005
Elmolla AM, Salama GI, Elbayoumy AD (2015) A novel video compression scheme based on fast Curvelet transform. Int J Comput Sci Telecommun 6(3):7–10
Google Scholar
AM Elsayad (2016). “Classification of breast Cancer database using learning vector quantization neural networks”, Technical Report, Saudi Association of Health Informatics, Saudi Arabia, Jul 2016
Epitropakis M, Plagianakos V, Vrahatis M (2008) Balancing the Exploration and Exploitation Capabilities of the Differential Evolution Algorithm. In: Proceedings of the IEEE International Congress on Evolutionary Computation, IEEE, Hong Kong, pp 2686–2693 Jun 2008
Esakkirajan S, Veerakumar T, Navaneethan P (2009) Adaptive Vector Quantization based Video Compression Scheme. In: Proceedings of the IEEE International Conference on Signal Processing and Communication Technologies, IEEE, India, pp 40–43 Mar 2009
Feng H, Tang M, Qi J (2011) A Back-propagation neural network based on a hybrid genetic algorithm and particle swarm optimization for image compression. Int Congress Image Sig Process 3:1315–1131
Google Scholar
George NP, Anitha J (2015) Motion Estimation in Video Compression based on Artificial Bee Colony. In: Proceedings of the 2nd IEEE International Conference on Electronics and Communication Systems, IEEE, India, pp 730–733Feb 2015
Goswami K, Lee D, Kim J, Jeong S, Kim H, Kim B (2017) Two-Step Rate Distortion Optimization Algorithm for High Efficiency Video Coding. J Multimed Inf Syst 4(4):311–316
Google Scholar
Guo J, Chao H (2017) Building an End-to-End Spatial-Temporal Convolutional Network for Video Super-Resolution. In: Proceedings of the International Conference on Artificial Intelligence Thirty-First, China, pp 4053–4060 Feb 2017
Ida MP (2006) Fundamental data compression, 1st edn. Elsevier chapter 3, pp.49–65, Nov 2006
Inoue K, Hasegawa T, Mori N, Matsumoto K (2015) Analyzing Exploration Exploitation Trade-off by Means of P-I Similarity Index and Dictyostelium based Genetic Algorithm. In: Proceedings of the IEEE International Congress on Evolutionary Computation (CEC), IEEE, Japan pp 2548–2555 May 2015
Kanithi A (2011) Study of spatial and transform domain filters for efficient noise reduction, Master Thesis, National Institute of Technology, Rourkela, India
Kim B (2008) Fast Selective Intra-Mode Search Algorithm Based on Adaptive Thresholding Scheme For H. 264/AVC Encoding. IEEE Trans Circ Syst Video Technol 18(1):127–133
Article Google Scholar
Kim B (2008) Novel Inter-Mode Decision Algorithm Based on Macro block (MB) Tracking for The P-Slice In H. 264/AVC Video Coding. IEEE Trans Circ Syst Video Technol 18(2):273–281
Article Google Scholar
Kim B (2017) Fast coding unit (CU) determination algorithm for high-efficiency video coding (HEVC) in smart surveillance application. J Supercomput 73(3):1063–1084
Article Google Scholar
Kim H, Lee J, Kim C, Kim B (2012) Zoom Motion Estimation Using Block-Based Fast Local Area Scaling. IEEE Trans Circ Syst Video Technol 22(9):1280–1291
Article Google Scholar
Knop M, Cierniak R, Shah N (2014) Video Compression Algorithm based on Neural Network Structures. In: Proceedings of the International Conference on Artificial Intelligence and Soft Computing, Springer, Cham, pp 715–724 India
Knop M, Kapuściński T, Mleczko WK, Angryk R (2016) Neural Video Compression based on RBM Scene Change Detection Algorithm. In: Proceedings of the International Conference on Artificial Intelligence and Soft Computing, pp 660–669 Cambodia
Kumar G, Sharma S, Malik H (2016) Learning Vector Quantization Neural Network based External Fault Diagnosis Model for Three Phase Induction Motor using Current Signature Analysis. In: Proceedings of 6th the International Conference on Advances in Computing & Communications, India, Procedia Computer Science. 93(1):1010–1016
Lee J, Ebrahimi T (2012) Perceptual video compression: a survey. IEEE Trans Sel Top Sig Process 6(6):684–697
Article Google Scholar
Lin H, Lianga S (2014) Discrete wavelet transform based noise removal and feature extraction for ECG signals. Int J Innov Res Bio-Med Eng 35(6):351–361
Google Scholar
Lu T, Chang Y (July 2010) A survey of VQ codebook generation. Int J Inf Hiding Multimed Signal Process 1(3):190–203
Google Scholar
Lu G, Ouyang W, Xu D, Zhang X, Cai C, Gao Z (2019) Dvc: An End-To-End Deep Video Compression Framework. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 11006–11015 Jun 2019
Ma X, Pan Z, Hu S, Wang L (2014) Enhanced side match vector quantization based on constructing complementary state codebook. IET Image Process 9(4):290–299
Article Google Scholar
Ma S, Zhang X, Jia C, Zhao Z, Wang S, Wanga S (June 2020) Image And Video Compression With Neural Networks: A Review. IEEE Trans Circ Syst Video Technol 30(Issue 6):1683–1698
Article Google Scholar
Mirzaei B, Nezamabadi H, Abbasi-Moghada D (2014) An Effective Codebook Initialization Technique for LBG Algorithm using Subtractive Clustering. In: Proceedings of the IEEE International Conference on Intelligent Systems, IEEE, Iran pp 1–5 Feb 2014
Mittal S, Vetter JS (2015) A survey of architectural approaches for data compression in cache and Main memory systems. IEEE Trans Parallel Distrib Syst 27(5):1524–1536
Article Google Scholar
Nithin S, Suresh LP (2016) Video Coding on Fast Curvelet Transform and Burrows Wheeler Transform (BCH). In: Proceedings of the IEEE International Conference on Circuit, Power and Computing Technologies, IEEE, India pp 1–5 Mar 2016
Patel BK, Agrawal S (October 2013) Image compression techniques using artificial neural network. Int J Adv Res Comput Eng Technol 2(10):1–5
Google Scholar
Ponlatha S, Sabeenian R (December 2013) Comparison of video compression standards. Int J Comput Electron Eng 5(6):549–549
Article Google Scholar
Rubina I (2015) Novel Method for Fast 3D DCT for Video Compression. In: Proceedings of the International Conference on Creativity in Intelligent Technologies and Data Science, vol 535. Communications, Russia, pp 674–685
Singh MP, Arya KV, Sharma K (2009) Video Compression using Self Organizing Map and Pattern Storage using Hopfield Neural Network. In: Proceedings of the International Conference on Industrial and Information Systems, IEEE, Sri Lanka, pp 272–278 Dec 2009
Singh AV, Murthy KS (2013) Neuro-Curvelet Model for Efficient Image Compression using Vector Quantization. In: Proceedings of the International Conference on VLSI Communication Advanced Devices Signals and Systems and Networking, Lecture Notes in Electrical Engineering, vol 258. Springer, India, pp 179–185
Sivanandam SN, Deepa SN (2007) Introduction to genetic algorithms. Springer Science & Business Media Chapter 2, pp 15–37
Sun H, Lam K-Y, Chung S-L, Dong W, Gu M, Sun J (2005) Efficient vector quantization using genetic algorithm. Neural Comput Applic 14(3):203–211
Article Google Scholar
Suri A, Goraya A (2014) Hybrid approach for video compression using ant Colony optimization and modified fast Haar wavelet transform. Int J Comput Appl 97(17):26–30
Google Scholar
Tomar R, Jain K (2015) Lossless Image Compression using Differential Pulse Code Modulation and its Application. In: Proceedings of the IEEE International Conference on Computational Intelligence and Communication Networks, IEEE, India, pp 397–400 Dec 2015
Montgomery C, Rosedale T. Video test media (n.d.) https://media.xiph.org/video/derf/. (Accessed 03 Oct. 2016).
Wang W, Yang S, Tung C (2005) Codebook Design for Vector Quantization using genetic algorithm. Int J Electron Bus 3(2):83–89
Google Scholar
Wei J (2015) Application of Hybrid Back Propagation Neural Network in Image Compression. In: Proceedings of the IEEE International Conference on Intelligent Computation Technology and Automation, Nanchang, IEEE, China, pp 209–212 Jun 2015
Zhang L, Wang SQ, Fang ZJ, Shu ZH, Zhang WR (2008) MCTF-Based Curvelet Video Compression Algorithm. In: International Conference on Natural Computation, vol 5. IEEE, China, pp 559–562 Oct 2008

Download references

Author information

Authors and Affiliations

Department of Information Technology, Institute of Graduate Studies and Research, Alexandria University, 163 Horreya Avenue, El-Shatby 21526, P.O. Box 832, Alexandria, Egypt
Saad M. Darwish
Department of Computer Science, College of Science, Al-Nahrain University, Baghdad, Iraq
Ahmed A. J. Almajtomi

Authors

Saad M. Darwish
View author publications
You can also search for this author in PubMed Google Scholar
Ahmed A. J. Almajtomi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Saad M. Darwish.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Darwish, S.M., Almajtomi, A.A.J. Metaheuristic-based vector quantization approach: a new paradigm for neural network-based video compression. Multimed Tools Appl 80, 7367–7396 (2021). https://doi.org/10.1007/s11042-020-10003-7

Download citation

Received: 04 April 2020
Revised: 26 August 2020
Accepted: 29 September 2020
Published: 28 October 2020
Issue Date: February 2021
DOI: https://doi.org/10.1007/s11042-020-10003-7

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Metaheuristic-based vector quantization approach: a new paradigm for neural network-based video compression

Abstract

Access this article

Similar content being viewed by others

A survey of the recent architectures of deep convolutional neural networks

Video summarization using deep learning techniques: a detailed analysis and investigation

A review of convolutional neural network architectures and their optimizations

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Metaheuristic-based vector quantization approach: a new paradigm for neural network-based video compression

Abstract

Access this article

Similar content being viewed by others

A survey of the recent architectures of deep convolutional neural networks

Video summarization using deep learning techniques: a detailed analysis and investigation

A review of convolutional neural network architectures and their optimizations

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation