Abstract
The existing depth video coding algorithms are generally based on in-loop depth filters, whose performance are unstable and easily affected by the outliers. In this paper, we design a joint weighted sparse representation-based median filter as the in-loop filter in depth video codec. It constructs depth candidate set which contains relevant neighboring depth pixel based on depth and intensity similarity weighted sparse coding, then the median operation is performed on this set to select a neighboring depth pixel as the result of the filtering. The experimental results indicate that the depth bitrate is reduced by about 9% compared with anchor method. It is confirmed that the proposed method is more effective in reducing the required depth bitrates for a given synthesis quality level.
Similar content being viewed by others
References
Smolic A, Mueller K, Merkle P. Multi-view Video plus Depth (MVD) Format for Advanced 3D Video System[C]//23rd ISO/IEC JTC1/SC29/WG11 and ITU-T SG16, San Jose: ISO/IEC and ITU-T Press, 2007, 6: 21–27.
Woo S K, Antonio O. Depth map distortion analysis for view rendering and depth coding[C]//Proc International Conf Image Processing. Cairo: IEEE Press, 2009: 721–724.
ITU-T.MVC extension for inclusion of depth maps draft text 4(JCT3V-A1001) [S]. Geneva: ITU-T Press, 2012.
Hannuksela M, Rusanovskyy D, Su W, et al. Multiview-videoplus-depth coding based on the advanced video coding standard [J]. IEEE Trans Image Processing, 2013, 22(9): 3449–3458.
Kwan J O, Vetro A, Ho S Y. Depth coding using a boundary reconstruction filter for 3D video systems [J]. IEEE Trans Circuits and Systems for Video Technology, 2011, 21(3): 350–359.
Liu S J, Lai P L. New depth coding techniques with utilization of corresponding video [J]. IEEE Trans Broadcasting, 2011, 57(2): 551–561.
Lim I, Lee J. Adaptive nonlocal range filter in depth map coding [C] // International Conf Image Processing. Florida: IEEE Press, 2012: 1285–1288.
Huang K, Aviyente S. Sparse Representation for Signal Classification [C] // Neural Information Processing Systems. Vancouver: MIT Press, 2006: 609–616.
Huang Y, Huang K Q. Salient Coding for Image Classification [C] // International Conf Computer Vision and Pattern Recognition. Colorado: IEEE Press, 2011: 1753–1760.
Wang J, Yang J, Yu K, et al. Locality-constrained Linear Coding for Image Classification [C] // International Conf. Computer Vision and Pattern Recognition. San Francisco: IEEE Press, 2010: 3360–3367.
Liu J, Ji S, Ye J. SLEP: Sparse learning with efficient projections [D]. Phoenix: Arizona State University, 2010.
Ohm J R. Overview of 3D video coding standardization[C]// Three Dimensional Systems and Applications. Osaka: IEEE Press, 2013: 1–4.
Chen Y, Pandit P, Yea S, et al. Draft Reference Software for MVC[S]. ISO/IEC JTC1/SC29/WG11 and ITU-T SG16 Q.6, Doc. JVT-AE207, London: ISO/IEC and ITU-T Press, 2009.
Zitnick C L, Kang S B, Uyttendaele M, et al. High-quality video view interpolation using a layered representation [C] // ACM SIGGRAPH. Los Angeles: ACM Press, 2004: 600–608.
Mori Y, Fukushima N, Fujii T, et al. View generation with 3D warping using depth information for FTV[C]//3DTV Conference: The True Vision-Capture, Transmission and Display of 3D Video. Istanbul: IEEE Press, 2008: 229–232.
ITU-T. Calculation of average PSNR differences between RD-curves(VCEG-M33)[S]. Geneva: ITU-T Press, 2001.
Author information
Authors and Affiliations
Corresponding author
Additional information
Foundation item: Supported by the National Natural Science Foundation of China (61462048)
Biography: LÜ Haitao, male, Associate professor, research direction: multimedia communications.
Rights and permissions
About this article
Cite this article
Lü, H., Yin, C., Cui, Z. et al. A depth video coding in-loop median filter based on joint weighted sparse representation. Wuhan Univ. J. Nat. Sci. 21, 351–357 (2016). https://doi.org/10.1007/s11859-016-1181-6
Received:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11859-016-1181-6