Abstract
Video compression has great significance in the communication of motion pictures. Video compression techniques try to remove the different types of redundancy within or between video sequences. In the temporal domain, the video compression techniques remove the redundancies between the highly correlated consequence frames of the video. In the spatial domain, the video compression techniques remove the redundancies between the highly correlated consequence pixels (samples) in the same frame. Evolving neural-networks based video coding research efforts are focused on improving existing video codecs by performing better predictions that are incorporated within the same codec framework or holistic methods of end-to-end video compression schemes. Current neural network-based video compression adapts static codebook to achieve compression that leads to learning inability from new samples. This paper proposes a modified video compression model that adapts the genetic algorithm to build an optimal codebook for adaptive vector quantization that is used as an activation function inside the neural network’s hidden layer. Background subtraction algorithm is employed to extract motion objects within frames to generate the context-based initial codebook. Furthermore, Differential Pulse Code Modulation (DPCM) is utilized for lossless compression of significant wavelet coefficients; whereas low energy coefficients are lossy compressed using Learning Vector Quantization (LVQ) neural networks. Finally, Run Length Encoding (RLE) is engaged to encode the quantized coefficients to achieve a higher compression ratio. Experiments have proven the system’s ability to achieve higher compression ratio with acceptable efficiency measured by PSNR.
Similar content being viewed by others
References
Atheeshwar M, Mahesh K (2014) Efficient and robust video compression using Huffman coding. Int J Adv Res Eng Technol 2(8):5–8
Bernatin T, Sundari G (2014) Video Compression based on Hybrid Transform and Quantization with Huffman Coding for Video Codec. In: Proceedings of the IEEE International Conference on Control, Instrumentation, Communication and Computational Technologies, IEEE, India, pp452–456 Jul 2014
Boufares O, Aloui N, Cherif A (2016) Adaptive threshold for background subtraction in moving object detection using stationary wavelet transforms 2D. Int J Adv Comput Sci Appl 7(8):29–36
Chavan PU, Chavan PP, Dandawate YH (2009) Codebook Optimization in Vector Quantization using Genetic Algorithm. In: Proceedings of the 2nd IEEE International Conference on Computer and Electrical Engineering, vol 1. IEEE, Dubai, pp 280–283 Dec 2009
Chen T, Liu H, Shen Q, Yue T, Cao X, Ma Z (2017) Deepcoder: A Deep Neural Network Based Video Compression. In: Proceedings of the IEEE Visual Communications and Image Processing (VCIP), IEEE pp1–4 Dec 2017
Choi Y, Jun D, Cheong W, Kim B (2019) Design of Efficient Perspective Affine Motion Estimation/Compensation for Versatile Video Coding (VVC) Standard. Electronics 8(9):1–15
Duch W, Kacprzyk J, Zadrożny S (2005) Artificial Neural Networks: Formal Models and Their Applications. In: Proceedings of the 15th International Conference on Science & Business Media. Springer, Poland, pp 11–15 Aug 2005
Elmolla AM, Salama GI, Elbayoumy AD (2015) A novel video compression scheme based on fast Curvelet transform. Int J Comput Sci Telecommun 6(3):7–10
AM Elsayad (2016). “Classification of breast Cancer database using learning vector quantization neural networks”, Technical Report, Saudi Association of Health Informatics, Saudi Arabia, Jul 2016
Epitropakis M, Plagianakos V, Vrahatis M (2008) Balancing the Exploration and Exploitation Capabilities of the Differential Evolution Algorithm. In: Proceedings of the IEEE International Congress on Evolutionary Computation, IEEE, Hong Kong, pp 2686–2693 Jun 2008
Esakkirajan S, Veerakumar T, Navaneethan P (2009) Adaptive Vector Quantization based Video Compression Scheme. In: Proceedings of the IEEE International Conference on Signal Processing and Communication Technologies, IEEE, India, pp 40–43 Mar 2009
Feng H, Tang M, Qi J (2011) A Back-propagation neural network based on a hybrid genetic algorithm and particle swarm optimization for image compression. Int Congress Image Sig Process 3:1315–1131
George NP, Anitha J (2015) Motion Estimation in Video Compression based on Artificial Bee Colony. In: Proceedings of the 2nd IEEE International Conference on Electronics and Communication Systems, IEEE, India, pp 730–733Feb 2015
Goswami K, Lee D, Kim J, Jeong S, Kim H, Kim B (2017) Two-Step Rate Distortion Optimization Algorithm for High Efficiency Video Coding. J Multimed Inf Syst 4(4):311–316
Guo J, Chao H (2017) Building an End-to-End Spatial-Temporal Convolutional Network for Video Super-Resolution. In: Proceedings of the International Conference on Artificial Intelligence Thirty-First, China, pp 4053–4060 Feb 2017
Ida MP (2006) Fundamental data compression, 1st edn. Elsevier chapter 3, pp.49–65, Nov 2006
Inoue K, Hasegawa T, Mori N, Matsumoto K (2015) Analyzing Exploration Exploitation Trade-off by Means of P-I Similarity Index and Dictyostelium based Genetic Algorithm. In: Proceedings of the IEEE International Congress on Evolutionary Computation (CEC), IEEE, Japan pp 2548–2555 May 2015
Kanithi A (2011) Study of spatial and transform domain filters for efficient noise reduction, Master Thesis, National Institute of Technology, Rourkela, India
Kim B (2008) Fast Selective Intra-Mode Search Algorithm Based on Adaptive Thresholding Scheme For H. 264/AVC Encoding. IEEE Trans Circ Syst Video Technol 18(1):127–133
Kim B (2008) Novel Inter-Mode Decision Algorithm Based on Macro block (MB) Tracking for The P-Slice In H. 264/AVC Video Coding. IEEE Trans Circ Syst Video Technol 18(2):273–281
Kim B (2017) Fast coding unit (CU) determination algorithm for high-efficiency video coding (HEVC) in smart surveillance application. J Supercomput 73(3):1063–1084
Kim H, Lee J, Kim C, Kim B (2012) Zoom Motion Estimation Using Block-Based Fast Local Area Scaling. IEEE Trans Circ Syst Video Technol 22(9):1280–1291
Knop M, Cierniak R, Shah N (2014) Video Compression Algorithm based on Neural Network Structures. In: Proceedings of the International Conference on Artificial Intelligence and Soft Computing, Springer, Cham, pp 715–724 India
Knop M, Kapuściński T, Mleczko WK, Angryk R (2016) Neural Video Compression based on RBM Scene Change Detection Algorithm. In: Proceedings of the International Conference on Artificial Intelligence and Soft Computing, pp 660–669 Cambodia
Kumar G, Sharma S, Malik H (2016) Learning Vector Quantization Neural Network based External Fault Diagnosis Model for Three Phase Induction Motor using Current Signature Analysis. In: Proceedings of 6th the International Conference on Advances in Computing & Communications, India, Procedia Computer Science. 93(1):1010–1016
Lee J, Ebrahimi T (2012) Perceptual video compression: a survey. IEEE Trans Sel Top Sig Process 6(6):684–697
Lin H, Lianga S (2014) Discrete wavelet transform based noise removal and feature extraction for ECG signals. Int J Innov Res Bio-Med Eng 35(6):351–361
Lu T, Chang Y (July 2010) A survey of VQ codebook generation. Int J Inf Hiding Multimed Signal Process 1(3):190–203
Lu G, Ouyang W, Xu D, Zhang X, Cai C, Gao Z (2019) Dvc: An End-To-End Deep Video Compression Framework. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 11006–11015 Jun 2019
Ma X, Pan Z, Hu S, Wang L (2014) Enhanced side match vector quantization based on constructing complementary state codebook. IET Image Process 9(4):290–299
Ma S, Zhang X, Jia C, Zhao Z, Wang S, Wanga S (June 2020) Image And Video Compression With Neural Networks: A Review. IEEE Trans Circ Syst Video Technol 30(Issue 6):1683–1698
Mirzaei B, Nezamabadi H, Abbasi-Moghada D (2014) An Effective Codebook Initialization Technique for LBG Algorithm using Subtractive Clustering. In: Proceedings of the IEEE International Conference on Intelligent Systems, IEEE, Iran pp 1–5 Feb 2014
Mittal S, Vetter JS (2015) A survey of architectural approaches for data compression in cache and Main memory systems. IEEE Trans Parallel Distrib Syst 27(5):1524–1536
Nithin S, Suresh LP (2016) Video Coding on Fast Curvelet Transform and Burrows Wheeler Transform (BCH). In: Proceedings of the IEEE International Conference on Circuit, Power and Computing Technologies, IEEE, India pp 1–5 Mar 2016
Patel BK, Agrawal S (October 2013) Image compression techniques using artificial neural network. Int J Adv Res Comput Eng Technol 2(10):1–5
Ponlatha S, Sabeenian R (December 2013) Comparison of video compression standards. Int J Comput Electron Eng 5(6):549–549
Rubina I (2015) Novel Method for Fast 3D DCT for Video Compression. In: Proceedings of the International Conference on Creativity in Intelligent Technologies and Data Science, vol 535. Communications, Russia, pp 674–685
Singh MP, Arya KV, Sharma K (2009) Video Compression using Self Organizing Map and Pattern Storage using Hopfield Neural Network. In: Proceedings of the International Conference on Industrial and Information Systems, IEEE, Sri Lanka, pp 272–278 Dec 2009
Singh AV, Murthy KS (2013) Neuro-Curvelet Model for Efficient Image Compression using Vector Quantization. In: Proceedings of the International Conference on VLSI Communication Advanced Devices Signals and Systems and Networking, Lecture Notes in Electrical Engineering, vol 258. Springer, India, pp 179–185
Sivanandam SN, Deepa SN (2007) Introduction to genetic algorithms. Springer Science & Business Media Chapter 2, pp 15–37
Sun H, Lam K-Y, Chung S-L, Dong W, Gu M, Sun J (2005) Efficient vector quantization using genetic algorithm. Neural Comput Applic 14(3):203–211
Suri A, Goraya A (2014) Hybrid approach for video compression using ant Colony optimization and modified fast Haar wavelet transform. Int J Comput Appl 97(17):26–30
Tomar R, Jain K (2015) Lossless Image Compression using Differential Pulse Code Modulation and its Application. In: Proceedings of the IEEE International Conference on Computational Intelligence and Communication Networks, IEEE, India, pp 397–400 Dec 2015
Montgomery C, Rosedale T. Video test media (n.d.) https://media.xiph.org/video/derf/. (Accessed 03 Oct. 2016).
Wang W, Yang S, Tung C (2005) Codebook Design for Vector Quantization using genetic algorithm. Int J Electron Bus 3(2):83–89
Wei J (2015) Application of Hybrid Back Propagation Neural Network in Image Compression. In: Proceedings of the IEEE International Conference on Intelligent Computation Technology and Automation, Nanchang, IEEE, China, pp 209–212 Jun 2015
Zhang L, Wang SQ, Fang ZJ, Shu ZH, Zhang WR (2008) MCTF-Based Curvelet Video Compression Algorithm. In: International Conference on Natural Computation, vol 5. IEEE, China, pp 559–562 Oct 2008
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Darwish, S.M., Almajtomi, A.A.J. Metaheuristic-based vector quantization approach: a new paradigm for neural network-based video compression. Multimed Tools Appl 80, 7367–7396 (2021). https://doi.org/10.1007/s11042-020-10003-7
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-020-10003-7