Abstract
Depth image-based rendering techniques for multiview applications have been recently introduced for efficient view generation at arbitrary camera positions. The rate control in an encoder has thus to consider both texture and depth data. However, due to different structures of depth and texture data and their different roles on the rendered views, the allocation of the available bit budget between them requires a careful analysis. Information loss due to texture coding affects the value of pixels in synthesized views, while errors in depth information lead to a shift in objects or to unexpected patterns at their boundaries.In this paper, we address the problem of efficient bit allocation between texture and depth data of multiview sequences.We adopt a rate-distortion framework based on a simplified model of depth and texture images, which preserves the main features of depth and texture images. Unlike most recent solutions, our method avoids rendering at encoding time for distortion estimation so that the encoding complexity stays low. In addition to this, our model is independent of the underlying inpainting method that is used at the decoder for filling holes in the synthetic views. Extensive experiments validate our theoretical results and confirm the efficiency of our rate allocation strategy.
Similar content being viewed by others
Notes
The extension of our analysis to the scenes with \(C^{\alpha }\) regular surfaces are straightforward.
In this paper, we consider the \(\ell _{2}\) distortion. However, extensions to other error norms are straightforward.
References
Zhang Z (2012) Microsoft kinetic sensor and its effect. IEEE Multimed 19:4–10
Merkle P, Smolic A, Muller K, Wiegand T (2007) Efficient prediction structures for multiview video coding. IEEE Trans Circ Syst Video Technol 17(11):1461–1473
Vetro A, Wiegand T, Sullivan G (2011) Overview of the stereo and multiview video coding extensions of the H.264/MPEG-4 AVC standards. Proc IEEE 99(4):626–642
Müller K, Merkle P, Wiegand T (2011) 3D video representation using depth maps. Proc IEEE 99(4):643–656
Tian D, Lai P, Lopez P, Gomila C (2009) View synthesis techniques for 3D video SPIE Optical Engineering + Applications. In: International society for optics and photonics, pp 74430T–74430T
Fehn C (2004) Depth-image-based rendering (DIBR), compression and transmission for a new approach on 3D-TV. Proc SPIE Stereosc Image Process Render 5291:93–104
Shao F, Jiang GY, Yu M, Zhang Y (2011) Object-based depth image-based rendering for a three-dimensional video system by color-correction optimization. Opt Eng 50:047006−047006−10
Oh K-J, Yea S, Ho Y-S (2009) Hole filling method using depth based in-painting for view synthesis in free viewpoint television and 3-D video. In: Proceedings of picture coding symposium. Chicago
Cheng C-M, Lin S-J, Lai S-H, Yang J-C (2008) Improved novel view synthesis from depth image with large baseline. In: Proceedings of international conference on pattern recognition. Tampa
Merkle P, Morvan Y, Smolic A, Farin D, Muller K, de With PHN, Wiegand T (2009) The effects of multiview depth video compression on multiview rendering. Signal Processing: Image Commun 24:73–88
Maitre M, Do MN (2010) Depth and depth-color coding using shape-adaptive wavelets. J Vis Commun Image Repr 21:513–522
Donoho DL (1999) Wedgelets: nearly minimax estimation of edges. Ann Stat 27:859–897
Le Pennec E, Mallat S (2005) Sparse geometrical image approximation with bandelets. IEEE Trans Image Process 14(4):423–438
Maleki A, Rajaei B, Pourreza HR (2012) Rate-distortion analysis of directional wavelets. IEEE Trans Image Process 21(2):588–600
Sanchez A, Shen G, Ortega A (2009) Edge-preserving depth-map coding using graph-based wavelets. In: Proceedings asilomar conference signals, systems computers.Los Angeles, pp 578–582
Daribo I, Tillier C, Pesquet-Popescu B (2008) Adaptive wavelet coding of the depth map for stereoscopic view synthesis. In: IEEE proceedings of international workshop on multimedia signal processing.Paris, pp 413–417
Milani S, Zanuttigh P, Zamarin M, Forchhammer S (2011) Efficient depth map compression exploiting segmented color data. In: Proceedings of international conference on multimedia and expo. pp 1–6
ITU-T and ISO/IEC JTC 1Advanced video coding for generic audiovisual services TU-T recommendation. ITU, ITU Recommendation H.264 and ISO/IEC 14496-10 (MPEG-4 AVC), Version 1: May 2003; Version 2: May 2004; Version 3: Mar. 2005 (including FRExt extension); Version 4: Sep. 2005; Version 5 and Version 6: Jun. 2006; Version 7: Apr. 2007; Version 8: Jul. 2007 (including SVC extension); Version 9: Jul. 2009 (including MVC extension), Switzerland
ISO/IEC JTC1/SC29/WG11 (2008) Text of ISO/IEC 14496-10:200X/ FDAM 1 multiview video coding. Doc. N9978. ISO/IEC, Hannover
Ekmekcioglu E, Velisavljevic V, Worrall ST (2011) Content adaptive enhancement of multi-view depth maps for free viewpoint video. IEEE J Selected Topics Sig Proc 5(2):352–361
Lee JY, Wey H-C, Park D-S (2011) A fast and efficient multi-view depth image coding method based on temporal and inter-view correlations of texture images. IEEE Trans Circ Syst Video Technol 21(12):1859–1868
Liu Y, Ma S, Huang Q, Zha D, Gao W, Zhang N (2009) Compression-induced rendering distortion analysis for texture/depth rate allocation in 3d video compression. In: Proceedings data compression conference.Beijing, pp 352–361
Nguyen HT, Do MN (2009) Error analysis for image-based rendering with depth information. IEEE Image Proc 18(4):703–716
Kim W-S, Ortega A, Lai P, Tian D, Gomila C (2010) Depth map coding with distortion estimation of rendered view. Proc SPIE Visual Inf Proc Commun 7543:75430B−75430B−10
Oh BT, Lee J, Park D-S (2011) Depth map coding based on synthesized view distortion function. IEEE J Selected Topics Sig Proc 5(7):1344–1352
Davidoiu V, Maugey T, Pesquet-Popescu B, Frossard P (2011) Rate distortion analysis in a disparity compensated scheme. In: Proceedings IEEE international conference acoustics, speech, and signal processing. Paris, pp 857–860
Maitre M, Do MN (2008) Joint encoding of the depth image based representation using shape-adaptive wavelets. In: Proceedings IEEE international conference image processing (ICIP). Urbana, pp 1768–1771
Wang Q, Ji X, Dai Q, Zhang N (2012) Free viewpoint video coding with rate-distortion analysis. IEEE Trans Circ Syst Video Technol 22(6):875–889
Cheung G, Velisavljevic V, Ortega A (2011) On dependent bit allocation for multiview image coding with depth-image-based rendering. IEEE Trans Image Proc 20(11):3179–3194
Gelman A, Dragotti PL, Velisavljevic V (2012) Multiview image coding using depth layers and an optimized bit allocation. IEEE Trans Image Proc 21(9):4092–4105
Mallat S (1997) A wavelet tour of signal processing. Academic, San Diego
Prandoni P, Vetterli M (1999) Approximation and compression of piecewise smooth functions. Phil Trans Royal Soc London 357(1760):2573–2591
Cover TM, Thomas JA (2006) Elements of information theory (telecommunications and signal processing). Wiley, New York
Anonymous (2004) Sequence microsoft ballet and breakdancers. http://research.microsoft.com/en-us/um/people/sbkang/3dvideodownload
Anonymous (2005) Middlebury stereo dataset. http://vision.middlebury.edu/stereo/data/scenes2005
Anonymous (2006) Middlebury stereo dataset. http://vision.middlebury.edu/stereo/data/scenes2006/
Acknowledgments
This work has been partially supported by Iran Ministry of Science, Research and Technology and the Swiss National Science Foundation under grant 200021_126894.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Rajaei, B., Maugey, T., Pourreza, HR. et al. Rate-distortion analysis of multiview coding in a DIBR framework. Ann. Telecommun. 68, 627–640 (2013). https://doi.org/10.1007/s12243-013-0375-6
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12243-013-0375-6