Abstract
Multiple description coding (MDC) has been an effective scheme for reliable transmission of videos over error prone networks but requires higher data rates. Recently, perceptual video coding schemes are able to encode videos at a lower data rate, but with the same perceived decoding quality. Such schemes exploit human visual system (HVS) properties to encode videos. Although HVS characteristics have been used in MDC, but only for residual information and are not fully standard compatible. To this end, we propose a high efficiency video coding (HEVC) standard compatible perceptual multiple description video coding (PMDVC) framework. Our proposed framework temporally sub-samples the input video into even and odd frame sub-sequences, which are individually encoded (using HEVC) in a perceptual manner. We encode each description using a context based visual saliency model to obtain visual saliency mask, which is optimally thresholded into salient and non-salient regions. Coding tree unit (CTU) level perceptual relevance mask of each frame is generated by dividing binary saliency mask into five different groups. The quantization parameter at CTU level of each perceptual relevant group is adjusted in such a manner that data rate is minimized while maintaining the perceived quality. Our proposed HEVC based PMDVC scheme is evaluated under lossless and lossy channel conditions. Our results show better performance in data rate reduction, perceptual peak signal-to-noise-ratio, multi- scale structural similarity index, and difference mean opinion score, when compared with HEVC based temporally sub-sampled multiple description video coding scheme.
Similar content being viewed by others
References
Alreshoodi, M., Adeyemi-Ejeye, F., Aljobouri, L., Fleury, M., & Alzahrani, B. (2017). Packet loss visibility for higher resolution video on portable devices. In 2017 IEEE international conference on consumer electronics (ICCE), IEEE (pp. 237–238).
Mendoza, S., & Smith, J. (2018). Mobile video optimization, US Patent 10148512 (4 Dec 2018).
Sullivan, G. J., Ohm, J., Han, W.-J., & Wiegand, T. (2012). Overview of the high efficiency video coding (HEVC) standard. IEEE Transactions on Circuits and Systems for Video Technology, 22(12), 1649–1668.
Ohm, J. R., Sullivan, G. J., Schwarz, H., Tan, T. K., & Wiegand, T. (2012). Comparison of the coding efficiency of video coding standards including high efficiency video coding (HEVC). IEEE Transactions on Circuits and Systems for Video Technology, 22(12), 1669–1684.
Wang, C., Sklar, D., & Johnson, D. (2001). Forward error-correction coding. Crosslink, 3(1), 26–29.
Huo, Y., El-Hajjar, M., & Hanzo, L. (2013). Inter-layer FEC aided unequal error protection for multilayer video transmission in mobile TV. IEEE Transactions on Circuits and Systems for Video Technology, 23(9), 1622–1634.
Kim, C. K., Lee, H. R., Jung, T. J., Kim, B. G., & Seo, K. D. (2016). An efficient delay-constrained ARQ scheme for MMT packet-based real-time video streaming over IP networks. Journal of Real-Time Image Processing, 12(2), 257–271.
Goyal, V. K. (2001). Multiple description coding: compression meets the network. IEEE Signal processing magazine, 18(5), 74–93.
Kazemi, M., Iqbal, R., & Shirmohammadi, S. (2018). Joint intra and multiple description coding for packet loss resilient video transmission. IEEE Transactions on Multimedia, 20(4), 781–795.
Li, H., Meng, L., Zhang, J., Tan, Y., Ren, Y., & Zhang, H. (2019). Multiple description coding based on convolutional auto-encoder. IEEE Access, 7, 26013–26021.
Zhao, L., Bai, H., Wang, A., & Zhao, Y. (2019). Multiple description convolutional neural networks for image compression. IEEE Transactions on Circuits and Systems for Video Technology, 29(8), 2494–2508.
Chaoub, A., Ennaoui, F. Z., & Ibn-Elhaj, E. (2019). Reliable rate-adaptive video transmission over cognitive cellular networks using multiple description scalable coding. Telecommunication Systems, 71(3), 321–338.
Wang, Y., Reibman, A. R., & Lin, S. (2005). Multiple description coding for video delivery. Proceedings of the IEEE, 93(1), 57–70.
Venkataramani, R., Kramer, G., & Goyal, V. K. (2003). Multiple description coding with many channels. IEEE Transactions on Information Theory, 49(9), 2106–2114.
Kazemi, M., Shirmohammadi, S., & Sadeghi, K. H. (2014). A review of multiple description coding techniques for error-resilient video delivery. Multimedia Systems, 20(3), 283–309.
Gallant, M., Shirani, S., & Kossentini, F. (2001). Standard-compliant multiple description video coding, in: Proceedings 2001 international conference on image processing (Cat. No. 01CH37205), Vol. 1, IEEE, pp. 946–949.
Chung, D. M., & Wang, Y. (1998). Multiple description image coding based on lapped orthogonal transforms. In Proceedings 1998 international conference on image processing. ICIP98 (Cat. No. 98CB36269), IEEE vol. 1, (pp. 664–668).
Wang, Y., Orchard, M. T., Vaishampayan, V., & Reibman, A. R. (2001). Multiple description coding using pairwise correlating transforms. IEEE Transactions on Image Processing, 10(3), 351–366.
Wang, Y., Reibman, A. R., & Lin, S. (2004). Multiple description coding for video delivery. Proceedings of the IEEE, 93(1), 57–70.
Vaishampayan, V. A. (1993). Design of multiple description scalar quantizers. IEEE Transactions on Information Theory, 39(3), 821–834.
Verdicchio, F., Munteanu, A., Gavrilescu, A. I., Cornelis, J., & Schelkens, P. (2006). Embedded multiple description coding of video. IEEE Transactions on Image Processing, 15(10), 3114–3130.
Crave, O., Pesquet-Popescu, B., & Guillemot, C. (2010). Robust video coding based on multiple description scalar quantization with side information. IEEE Transactions on Circuits and Systems for Video Technology, 20(6), 769–779.
Majid, M., & Abhayaratne, C. (2012). Redundancy controllable scalable unbalanced multiple description bitstream generation for peer-to-peer video streaming. Signal Processing: Image Communication, 27(5), 496–512.
Majid, M., Abhayaratne, C. (2009). Multiple description scalar quantization with successive refinement. In 17th European signal processing conference, IEEE (pp. 2268–2272).
Karim, H. A., Hewage, C. T., Worrall, S., & Kondoz, A. M. (2008). Scalable multiple description video coding for stereoscopic 3D. IEEE Transactions on Consumer Electronics, 54(2), 745–752.
Adedoyin, S., Fernando, W. A. C., Karim, H. A., Hewage, C. T., & Kondoz, A. M. (2008). Scalable multiple description coding with side information using motion interpolation. IEEE Transactions on Consumer Electronics, 54(4), 2045–2052.
Majid, M., Owais, M., & Anwar, S. M. (2018). Visual saliency based redundancy allocation in HEVC compatible multiple description video coding. Multimedia Tools and Applications, 77(16), 20955–20977.
Lee, J. S., & Ebrahimi, T. (2012). Perceptual video compression: a survey. IEEE Journal of Selected Topics in Signal Processing, 6(6), 684–697.
Zeeshan, M., & Majid, M. (2019). High efficiency video coding compliant perceptual video coding using entropy based visual saliency model. Entropy, 21(10), 964.
Chen, Y., Wu, K., & Zhang, Q. (2014). From QoS to QoE: a tutorial on video quality assessment. IEEE Communications Surveys and Tutorials, 17(2), 1126–1165.
Kufa, J., Polak, L., Kratochvil, T. (2016). HEVC, H. 265 versus VP9 for full HD and UHD video: Is there any difference in QoE? In International symposium ELMAR, IEEE (pp 51–55).
Zach, O., & Slanina, M. (2014). A Matlab-based tool for video quality evaluation without reference. Radioengineering, 23(1), 405–411.
Sumbal, R., Majid, & Anwar, S. M. (2013) Region of interest (ROI) based scalable multiple description image coding using MDSQ-SR. In 2013 IEEE 9th international conference on emerging technologies (ICET), IEEE (pp. 1–5).
Lin, C., Zhao, Y., Xiao, J., & Tillo, T. (2017). Region-based multiple description coding for multiview video plus depth video. IEEE Transactions on Multimedia, 20(5), 1209–1223.
Bai, H., Lin, W., Zhang, M., Wang, A., & Zhao, Y. (2014). Multiple description video coding based on human visual system characteristics. IEEE Transactions on Circuits and Systems for Video Technology, 24(8), 1390–1394.
Frintrop, S., Rome, E., & Christensen, H. I. (2010). Computational visual attention systems and their cognitive foundations: a survey. ACM Transactions on Applied Perception, 7(1), 6.
Zeeshan, M., Majid, M., Nizami, I. F., Anwar, S. M., Din, I. U., & Khan, M. K. (2018). A newly developed ground truth dataset for visual saliency in videos. IEEE Access, 6, 20855–20867.
Yuan, D., Zhao, T., Xu, Y., Xue, H., & Lin, L. (2019). Visual JND: a perceptual measurement in video coding. IEEE Access, 7, 29014–29022.
H. Jiang, J. Wang, Z. Yuan, T. Liu, N. Zheng, S. Li, (2011). Automatic salient object segmentation based on context and shape prior. In BMVC vol. 6 (pp. 110.1–110.12). Dundee, Scotland, August–September.
Zhang, L., Yang, C., Lu, H., Ruan, X., & Yang, M. (2017). Ranking saliency. IEEE Transactions on Pattern Analysis and Machine Intelligence, 39(9), 1892–1904.
Common HM Test conditions and software reference configurations (2012). Tech. rep., document JCT-VC of ISO/IEC MPEG and ITU-T VCEG, JCTVC-K1100. Shanghai, China.
HM Reference Software 16.11, Accessed: 25 Jan 2018. [Online]. Available: https://hevc.hhi.fraunhofer.de/svn/svn_HEVCSoftware/.
JCT-VC AHG report: Test sequence material (AHG5), Tech. rep., Joint collaborative team on video coding (JCT-VC) (Macao, China, Oct 2017).
P. Series (2008). Subjective video quality assessment methods for multimedia applications, Recommendation ITU-T P.910 (pp. 1–34).
Series, B. (2012). Methodology for the subjective assessment of the quality of television pictures. Recommendation ITU-R BT (pp. 500–513).
Z. Wang, E. P. Simoncelli, A. C. Bovik (2003). Multiscale structural similarity for image quality assessment. In Proceedings of IEEE thirty-seventh asilomar conference on signals, systems and computers, vol. 2 (pp. 1398–1402).
Common conditions for wire-line. (September 2001). low delay IP/UDP/RTP packet loss resilient testing. ITU-T video coding experts group VCEG (Santa Barbara, CA, USA: Tech. rep).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
On behalf of all authors, the corresponding author states that there is no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Zeeshan, M., Majid, M. HEVC compatible perceptual multiple description video coding for reliable video transmission over packet networks. Telecommun Syst 76, 63–83 (2021). https://doi.org/10.1007/s11235-020-00702-9
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11235-020-00702-9