Perception-based adaptive quantization for transform-domain Wyner-Ziv video coding

Zhang, Lei; Peng, Qiang; Wu, Xiao

doi:10.1007/s11042-016-3947-4

Perception-based adaptive quantization for transform-domain Wyner-Ziv video coding

Published: 08 November 2016

Volume 76, pages 16699–16725, (2017)
Cite this article

Multimedia Tools and Applications Aims and scope Submit manuscript

Lei Zhang¹,
Qiang Peng¹ &
Xiao Wu¹

303 Accesses
3 Citations
Explore all metrics

Abstract

Distributed video coding (DVC) is desirable for encoding systems with tight power or computational constraints, for which the popular practical solution is transform-domain Wyner-Ziv video coding (TD-WZVC). To achieve the similar coding performance with H.264/AVC, quantization is a key factor in TD-WZVC. Practically, the quantization matrix is trained offline and remains fixed value during coding. Optimal rate-distortion (RD) performance cannot be achieved due to the varying quality of side information (SI) frame. In this paper, a novel model of perceptual distortion probability is developed to estimate the perceptual distortion of SI frame and to derive the target perceptual distortion. With the two perceptual distortion probabilities, three components (i.e. quality of SI frame, perceptual features and RD optimization) are integrated to determine the optimal quantization matrix adaptively, which improves the coding performance. Extensive experiments demonstrate that the proposed scheme can adaptively determine proper quantization matrix online and achieve similar visual quality with less bit-rate, as compared to other adaptive quantization schemes in TD-WZVC.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

F2D-SIFPNet: a frequency 2D Slow-I-Fast-P network for faster compressed video action recognition

Article 16 April 2024

Compressed Video Sensing Based on Deep Generative Adversarial Network

Article 09 May 2024

Learning Enriched Features for Real Image Restoration and Enhancement

References

(2000) Methodology for the subjective assessment of the quality of television pictures. ITU-R Rec. BT. 500–10, ITU-R, ITU
Aaron A, Zhang R, Girod B (2002) Wyner-Ziv coding of motion video. Asilomar Conf Signals Syst Comput
Artigas X, Ascenso J, Dalai M, Klomp S, Kubasov D, Ouaret M (2007) The DISCOVER codec: architecture, techniques and evaluation. Picture Coding Symp (PCS)
Ascenso J, Brites C, and Pereira F. A denoising approach for iterative side information creation in distributed video coding. IEEE Int Conf Image Process (ICIP) 3513–3516
Ascenso J, Brites C, Pereira F (2006) Content adaptive Wyner-Ziv video coding driven by motion activity. IEEE Int Conf Image Process (ICIP) 8–11
Brites C, Pereira F (2008) Correlation noise modeling for efficient pixel and transform domain Wyner-Ziv video coding. IEEE Trans Circ Syst Video Technol 18:1177–1190
Article Google Scholar
Brites C, Pereira F (2011) An efficient encoder rate control solution for transform domain Wyner-Ziv video coding. IEEE Trans Circ Syst Video Technol 21:1278–1292
Article Google Scholar
BritesC, Pereira F (2007) Encoder rate control for transform domain Wyner-Ziv video coding. IEEE Int Conf Image Process (ICIP) II-5-II-8
Chen ZZ, Guillemot C (2010) Perceptually-friendly H.264/AVC video coding based on foveated just-noticeable-distortion model. IEEE Trans Circ Syst Video Technol 20:806–819
Article Google Scholar
Chen ZZ, Tan YP (2011) Frame-level quantization control for perceptual quality constrained H.264/AVC video Coding. IEEE Int Symp Circ Syst (ISCAS) 1231–1234
Chen JW, Zheng JH, Xu F, Villasenor JD (2012) Adaptive frequency weighting for high-performance video coding. IEEE Trans Circ Syst Video Technol 22:1027–1036
Article Google Scholar
Chien WJ, Karam LJ (2009) Transform-domain distributed video coding with rate distortion based adaptive quantization. IET Image Process 3:340–354
Article Google Scholar
Gu Z, Lin W, Xie S, Lu Z (2007) Wyner-Ziv video coding based on perception analysis. Int Conf Inf Commun Signal Process (ICICS) 1–5
HoangVan X, Jeon B (2012) Flexible complexity control solution for transform domain Wyner-Ziv video coding. IEEE Trans Broadcast 58:209–220
Article Google Scholar
Honsch I, Karam LJ (2002) Adaptive image coding with perceptual distortion control. IEEE Trans Image Process 11:213–222
Article Google Scholar
Kubasov D, Nayak J, Guillemot C (2007) Optimal reconstruction in Wyner-Ziv video coding with multiple side information. IEEE Multimed Signal Process Workshop (MMSP) 183–186
Li YP, Zhao DB, Ma SW, Gao W (2009) Distributed video coding based on the human visual system. IEEE Signal Process Lett 16:985–988
Article Google Scholar
Lin C, Zhao Y, Zhu C (2008) Two-stage diversity-based multiple description image coding. IEEE Signal Process Lett 15:873–840
Article Google Scholar
Liu Z, Cheng S, Liveris AD, Xiong Z (2004) Slepian-Wolf coded nested quantization for Wyner-Ziv coding: performance analysis and code design. IEEE Data Compression Conference (DCC) 322–331
Liu Z, Karam LJ, Watson AB (2006) JPEG2000 encoding with perceptual distortion control. IEEE Trans Image Process 15:1763–1778
Article Google Scholar
Ma L, Ngan KN, Zhang F, Li SN (2011) Adaptive block-size transform based just-noticeable difference model for image/videos. Signal Process Image Commun 26:162–174
Article Google Scholar
Naccari M, Pereira F (2011) Advanced H.264/AVC-based perceptual video coding: architecture, tools, and assessment. IEEE Trans Circ Syst Video Technol 21:766–782
Article Google Scholar
Puri R, Majumdar A, Ramchandran K (2007) PRISM: a video coding paradigm with motion estimation at the decoder. IEEE Trans Image Process 16:2436–2448
Article MathSciNet Google Scholar
Rebollo-Monedero D, Zhang R, and Girod B (2002) Design of optimal quantizers for distributed source coding. In: IEEE Data Compression Conference (DCC) 13–22
Slepian D, Wolf JK (1973) Noiseless coding of correlated information sources. IEEE Trans Inf Theor 19(4):471–480
Article MathSciNet MATH Google Scholar
Sofke S, Pereira F, Muller E (2009) Dynamic quality control for transform domain wyner-ziv video coding. EURASIP J Image Video Process
Sun YC, Tsai CJ (2012) Perceptual-based distributed video coding. J Vis Commun Image Represent 23:535–548
Article Google Scholar
Varodayan D, Aaron A, Girod B (2006) Rate-adaptive codes for distributed source coding. EURASIP Signal Process J Special Section Distributed Source Coding 86
Wang Z, Bovik AC, Sheikh HR, Simoncelli EP (2004) Image quality assessment: from error visibility to structural similarity. IEEE Trans Image Process 13:600–612
Article Google Scholar
Wang SQ, Rehman A, Wang Z, Ma S, Gao W (2013) Perceptual video coding based on SSIM-Inspired divisive normalization. IEEE Trans Image Process 22:1418–1429
Article MathSciNet Google Scholar
Wei ZY, Ngan KN (2009) Spatio-temporal just noticeable distortion profile for grey scale image/video in DCT domain. IEEE Trans Circ Syst Video Technol 19:337–346
Article Google Scholar
Wu B, Guo X, Zhao D-B, Gao W and Wu F (2006) An optimal non-uniform scalar quantizer for distributed video coding. IEEE Int Conf Multimedia Expo (ICME) 117–120
Wyner A, Ziv J (1976) The rate-distortion function for source coding with side information at the decoder. IEEE Trans Inf Theor 22(1):1–10
Article MathSciNet MATH Google Scholar
Xue Z, Loo KK, Cosmas J, Tun M, Feng L, Yip PY (2010) Error resilience scheme for wavelet video codec using automatic ROI detection and Wyner-Ziv coding over packet erasure channel. IEEE Trans Broadcast 56:481–493
Article Google Scholar
Yang TW, Zhu C, Fan XJ, Peng Q (2012) Source distortion temporal propagation model for motion compensated video coding optimization. IEEE Int Conf Multimed Expo (ICME) 85–90
Yang XK, Zhu C, Li ZG, Feng GN, Wu S, N Ling (2002) Degressive error protection algorithm for MPEG-4 FGS video streaming. IEEE Int Conf Image Proc (ICIP) 737–740
Zamir R, Shami S (2002) Nested linear/lattice codes for structured multiterminal binning. IEEE Trans Inf Theor 48:1250–1276
Article MathSciNet MATH Google Scholar
Zhang YS, Xiong HK, He ZH, Yu SY, Chen CW (2011) Reconstruction for distributed video coding: a context-adaptive markov random field approach. IEEE Trans Circ Syst Video Technol 21:1100–1114
Article Google Scholar
Zhang YX, Zhu C (2010) Adaptive coset partition for distributed video coding. Signal Process 90:2480–2486
Article MATH Google Scholar
Zhang Y, Zhu C, Yap K (2008) A joint source-channel video coding scheme based on distributed source coding. IEEE Trans Multimed 10:1648–1856
Article Google Scholar
Zhao Y, Yu L, Chen ZZ, Zhu C (2011) Video quality assessment based on measuring perceptual noise from spatial and temporal perspectives. IEEE Trans Circ Syst Video Technol 21:1890–1902
Article Google Scholar

Download references

Acknowledgments

This work described in this paper was supported by the NSFC (Grant No. 60972111, 61036008, 61071184, 61373121), Research Funds for the Doctoral Program of Higher Education of China (No. 20100184120009, 20120184110001), Program for Sichuan Provincial Science Fund for Distinguished Young Scholars (No. 2012JQ0029, 13QNJJ0149), and the Fundamental Research Funds for the Central Universities (Project no. SWJTU09CX032, SWJTU10CX08, SWJTU11ZT08).

Author information

Authors and Affiliations

School of Information Science and Technology, Southwest Jiaotong University, Chengdu, China
Lei Zhang, Qiang Peng & Xiao Wu

Authors

Lei Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Qiang Peng
View author publications
You can also search for this author in PubMed Google Scholar
Xiao Wu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Lei Zhang.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Zhang, L., Peng, Q. & Wu, X. Perception-based adaptive quantization for transform-domain Wyner-Ziv video coding. Multimed Tools Appl 76, 16699–16725 (2017). https://doi.org/10.1007/s11042-016-3947-4

Download citation

Received: 22 July 2014
Revised: 02 September 2016
Accepted: 07 September 2016
Published: 08 November 2016
Issue Date: August 2017
DOI: https://doi.org/10.1007/s11042-016-3947-4

Keyword

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Perception-based adaptive quantization for transform-domain Wyner-Ziv video coding

Abstract

Access this article

Similar content being viewed by others

F2D-SIFPNet: a frequency 2D Slow-I-Fast-P network for faster compressed video action recognition

Compressed Video Sensing Based on Deep Generative Adversarial Network

Learning Enriched Features for Real Image Restoration and Enhancement

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keyword

Navigation

Perception-based adaptive quantization for transform-domain Wyner-Ziv video coding

Abstract

Access this article

Similar content being viewed by others

F2D-SIFPNet: a frequency 2D Slow-I-Fast-P network for faster compressed video action recognition

Compressed Video Sensing Based on Deep Generative Adversarial Network

Learning Enriched Features for Real Image Restoration and Enhancement

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keyword

Search

Navigation