Abstract
In this paper, a novel joint coding scheme is proposed for 3D media content including stereo images and multiview-plus-depth (MVD) video for the purpose of depth information hiding. The depth information is an image or image channel which reveals the distance of scene objects’ surfaces from a viewpoint. With the concern of copyright protection, access control and coding efficiency for 3D content, we propose to hide the depth information into the texture image/video by a reversible watermarking algorithm called Quantized DCT Expansion (QDCTE). Considering the crucial importance of depth information for depth-image-based rendering (DIBR), full resolution depth image/video is compressed and embedded into the texture image/video, and it can be extracted without extra quality degradation other than compression itself. The reversibility of the proposed algorithm guarantees that texture image/video quality will not suffer from the watermarking process even if high payload (i.e. depth information) is embedded into the cover image/video. In order to control the size increase of watermarked image/video, the embedding function is carefully selected and the entropy coding process is also customized according to watermarking strength. Huffman and content-adaptive variable-length coding (CAVLC), which are respectively used for JPEG image and H.264 video entropy encoding, are analyzed and customized. After depth information embedding, we propose a new method to update the entropy codeword table with high efficiency and low computational complexity according to watermark embedding strength. By using our proposed coding scheme, the depth information can be hidden into the compressed texture image/video with little bitstream size overhead while the quality degradation of original cover image/video from watermarking can be completely removed at the receiver side.
Similar content being viewed by others
References
Alattar AM (2004) Reversible watermark using the difference expansion of a generalized integer transform. IEEE Trans Image Process 13(8):1147–1156
Bal C, Nguyen T (2014) Multiview video plus depth coding with depth-based prediction mode. IEEE Trans Circuits Syst Video Technol 24(6):995–1005
Coltuc D, Caciula I (2009) On stereo embedding by reversible watermarking: further results. In: International symposium on signals, circuits and systems, Iasi, pp 121–124
Coltuc D, Caciula I, Coanda H (2010) Color stereo embedding by reversible watermarking. In: International symposium on electrical and electronics engineering, Galati, pp 256–259
Cox IJ, Miller ML, Bloom JA (2001) Digital watermarking. Morgan Kaufmann, San Francisco
Daribo I, Tillier C, Pesquet-Popescu B (2009) Motion vector sharing and bitrate allocation for 3D video-plus-depth coding. EURASIP J Adv Sig Process 2009(3):1–13
Ekmekcioglu E, Mrak M, Worral S, Kondoz A (2009) Utilizatioin of edge adaptive upsampling in compression of depth map videos for enhanced free-viewpoint rendering. In: Proceedings of IEEE international conference on image processing, pp 733–736
Ellinas JN (2009) Reversible watermarking on stereo image sequences. Int J Signal Process 5(3):210–215
Fehn C, Kauff P, Op de Beeck M, Ernst F, Jsselsteijin WI, Pollefeys M, Van Gool L, Ofek E, Sexton I (2002) An evolutionary and optimised approach on 3D-TV. In: Proceedings of International Broadcast Conference, pp 357–365
Grewatsch S, Muller E (2004) Sharing of motion vectors in 3D video coding. In: Proceedings of IEEE International Conference on Image Processing (ICIP 2004), vol 5, pp 3271–3274
ISO/IEC Moving Picture Experts Group (2003) ISO/IEC 14496-10:2003
Khan A, Mahmood MT, Ali A, Usman I, Choi T-S (2009) Hiding depth map of an object in its 2D image: reversible watermarking for 3D cameras. In: Proceedings of IEEE international conference on consumer electronics, Vegas, pp 1–2
Lam EY, Goodman JW (2000) A mathematical analysis of the DCT coefficient distributions for images. IEEE Trans Image Process 9:1661–1666
Lin YH, Wu JL (2011) A depth information based fast mode decision algorithm for color plus depth-map 3D videos. IEEE Trans Circuits Syst Video Technol 57(2):542–550
Liu Q, Yang Y, Ji R, Gao Y, Yu L (2012) Cross-view down up sampling method for multiview depth video coding. IEEE Signal Process Lett 19(5):295–298
Maitre M, Do MN (2008) Joint encoding of the depth image based representation using shape-adaptive wavelets. In: IEEE international conference on image processing (ICIP 2008), San Diego, pp 1768–1771
McMillan L (1997) An image-based approach to three-dimensional computer graphics. PhD thesis, University of North Carolina, Chapel Hill
Middlebury benchmark. http://vision.middlebury.edu/stereo/
Mobile 3D-TV. http://sp.cs.tut.fi/mobile3dtv/video-plus-depth/
Morvan Y, de With PHN, Farin D (2006) Platelet-based coding of depth maps for the transmission of multi-view images. In: Proceedings of SPIE, stereoscopic displays and applications, San Jose, pp 93–100
Morvan Y, Farin D, de With PHN (2007) Depth-image compression based on an R-D optimized quadtree decomposition for the transmission of multi-view images. In: IEEE International Conference on Image Processing (ICIP 2007), San Antonio, pp 105–108
Oh BT, Park DS (2011) Depth map coding based on synthesized view distortion function. IEEE Jounal of Selected Topics in Signal Processing 5(7):1344–1352
Oh KJ, Vetro A, Ho YS (2011). Depth coding using a boudary reconstruction filter for 3D video systems. IEEE Trans Circuits Syst Video Technol 21(3):350–359
Oh KJ, Yea S, Vetro A, Ho YS (2009) Depth reconstruction filter and down/up sampling for depth coding in 3-D video. IEEE Signal Process Lett 16(9):747–750
Scharstein D, Szeliski R (2002) A taxonomy and evaluation of dense two-frame stereo correspondence algorithms. Int J Comput Vis 47:7–42
Shahid Z, Chaumont M, Puech W (2011) Considering the reconstruction loop for data hiding of intra and inter frames of H.264/AVC. Springer: SIViP 5(2):75–93
Shahid Z, Puech W (2011) Synchronization of texture and depth map by data hiding for 3D H.264 video. In: IEEE international conference on image processing, pp 2773–2776
Shannon CE (2001) A mathematical theory of communication. ACM SIGMOBILE Mobile Computing and Communications Review 5.1
Shao F, Jiang G, Yu M, Chen K, Ho Y (2012) Asymmetric coding of multi-view video plus depth based 3-D video for view rendering. IEEE Trans Multimedia 14:157–167
Smolic A, Kauff P (2005) Interactive 3D video representation and coding technologies. Proceedings of IEEE, special issue on advances in video coding and delivery 93(1):98–110
Smolic A, Muller K, Stefanoski N, Ostermann J, Gotchev A, Akar GB, Triantafyllidis G, Koz A (2007) Coding algorithms for 3D-TV–a survey. IEEE Trans Circuits Syst Video Technol 17(11):1606–1621
Tian J (2003) Reversible data embedding using a difference expansion. IEEE Trans Circuits Syst Video 13:890–896
Vetro A, Wiegand T, Sullivan GJ (2011) Overview of the stereo and multiview video coding extensions of the H.264/MPEG-4 AVC standard. Proceedings of IEEE 99(4):626–642
Yang Y, Dai Q, Jiang G, Ho Y-S (2010) Comparative interactivity analysis in multiview video coding schemes. ETRI J 32(4):566–576
Zhang Y, Kwong S, Xu L, Hu S, Jiang G, Kuo CCJ (2013) Regional bit allocation and rate distortion optimization for multiview depth video coding with view synthesis distortion model. IEEE Trans Image Process 22(9):3497–3512
Author information
Authors and Affiliations
Corresponding author
Appendices
Appendix A: Derivation of Laplacian PDF in discrete domain
Given the continuous Laplacian PDF as shown in (2), the discrete Laplacian PDF is derived as follows.
where k is the discrete coefficient value, p is the Laplacian PDF in continuous domain, \(m=\exp (-\frac {1}{\sigma })\).
Appendix B: Entropy calculation for original and watermarked QDCT coefficients
Before watermarking, original QDCT coefficients obey the discrete Laplacian distribution as shown in (3). The entropy of these coefficients can be calculated as follows.
where k is the discrete coefficient value before watermarking, \(m=\exp (-\frac {1}{\sigma })\).
After watermarking, the original PDF are stretched. And the even values and odd values still obey discrete Laplacian distribution respectively. If we denote
where k is the discrete coefficient value before watermarking, s g n(x) = x/|x| is the sign function, \(m=\exp (-\frac {1}{\sigma })\).
The entropy of the watermarked QDCT coefficients is calculated as follows.
Rights and permissions
About this article
Cite this article
Wang, W., Zhao, J. Hiding depth information in compressed 2D image/video using reversible watermarking. Multimed Tools Appl 75, 4285–4303 (2016). https://doi.org/10.1007/s11042-015-2475-y
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-015-2475-y