Skip to main content
Log in

A depth video coding in-loop median filter based on joint weighted sparse representation

  • Physics and Electronic Information Technology
  • Published:
Wuhan University Journal of Natural Sciences

Abstract

The existing depth video coding algorithms are generally based on in-loop depth filters, whose performance are unstable and easily affected by the outliers. In this paper, we design a joint weighted sparse representation-based median filter as the in-loop filter in depth video codec. It constructs depth candidate set which contains relevant neighboring depth pixel based on depth and intensity similarity weighted sparse coding, then the median operation is performed on this set to select a neighboring depth pixel as the result of the filtering. The experimental results indicate that the depth bitrate is reduced by about 9% compared with anchor method. It is confirmed that the proposed method is more effective in reducing the required depth bitrates for a given synthesis quality level.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Similar content being viewed by others

References

  1. Smolic A, Mueller K, Merkle P. Multi-view Video plus Depth (MVD) Format for Advanced 3D Video System[C]//23rd ISO/IEC JTC1/SC29/WG11 and ITU-T SG16, San Jose: ISO/IEC and ITU-T Press, 2007, 6: 21–27.

    Google Scholar 

  2. Woo S K, Antonio O. Depth map distortion analysis for view rendering and depth coding[C]//Proc International Conf Image Processing. Cairo: IEEE Press, 2009: 721–724.

    Google Scholar 

  3. ITU-T.MVC extension for inclusion of depth maps draft text 4(JCT3V-A1001) [S]. Geneva: ITU-T Press, 2012.

  4. Hannuksela M, Rusanovskyy D, Su W, et al. Multiview-videoplus-depth coding based on the advanced video coding standard [J]. IEEE Trans Image Processing, 2013, 22(9): 3449–3458.

    Article  Google Scholar 

  5. Kwan J O, Vetro A, Ho S Y. Depth coding using a boundary reconstruction filter for 3D video systems [J]. IEEE Trans Circuits and Systems for Video Technology, 2011, 21(3): 350–359.

    Article  Google Scholar 

  6. Liu S J, Lai P L. New depth coding techniques with utilization of corresponding video [J]. IEEE Trans Broadcasting, 2011, 57(2): 551–561.

    Article  Google Scholar 

  7. Lim I, Lee J. Adaptive nonlocal range filter in depth map coding [C] // International Conf Image Processing. Florida: IEEE Press, 2012: 1285–1288.

    Google Scholar 

  8. Huang K, Aviyente S. Sparse Representation for Signal Classification [C] // Neural Information Processing Systems. Vancouver: MIT Press, 2006: 609–616.

    Google Scholar 

  9. Huang Y, Huang K Q. Salient Coding for Image Classification [C] // International Conf Computer Vision and Pattern Recognition. Colorado: IEEE Press, 2011: 1753–1760.

    Google Scholar 

  10. Wang J, Yang J, Yu K, et al. Locality-constrained Linear Coding for Image Classification [C] // International Conf. Computer Vision and Pattern Recognition. San Francisco: IEEE Press, 2010: 3360–3367.

    Google Scholar 

  11. Liu J, Ji S, Ye J. SLEP: Sparse learning with efficient projections [D]. Phoenix: Arizona State University, 2010.

    Google Scholar 

  12. Ohm J R. Overview of 3D video coding standardization[C]// Three Dimensional Systems and Applications. Osaka: IEEE Press, 2013: 1–4.

    Google Scholar 

  13. Chen Y, Pandit P, Yea S, et al. Draft Reference Software for MVC[S]. ISO/IEC JTC1/SC29/WG11 and ITU-T SG16 Q.6, Doc. JVT-AE207, London: ISO/IEC and ITU-T Press, 2009.

  14. Zitnick C L, Kang S B, Uyttendaele M, et al. High-quality video view interpolation using a layered representation [C] // ACM SIGGRAPH. Los Angeles: ACM Press, 2004: 600–608.

    Google Scholar 

  15. Mori Y, Fukushima N, Fujii T, et al. View generation with 3D warping using depth information for FTV[C]//3DTV Conference: The True Vision-Capture, Transmission and Display of 3D Video. Istanbul: IEEE Press, 2008: 229–232.

    Chapter  Google Scholar 

  16. ITU-T. Calculation of average PSNR differences between RD-curves(VCEG-M33)[S]. Geneva: ITU-T Press, 2001.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Haitao Lü.

Additional information

Foundation item: Supported by the National Natural Science Foundation of China (61462048)

Biography: LÜ Haitao, male, Associate professor, research direction: multimedia communications.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Lü, H., Yin, C., Cui, Z. et al. A depth video coding in-loop median filter based on joint weighted sparse representation. Wuhan Univ. J. Nat. Sci. 21, 351–357 (2016). https://doi.org/10.1007/s11859-016-1181-6

Download citation

  • Received:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11859-016-1181-6

Keywords

CLC number

Navigation