SSIM-Based Joint Bit-Allocation Using Frame Model Parameters for 3D Video Coding

Conference paper
Part of the Communications in Computer and Information Science book series (CCIS, volume 841)


Optimum bit-allocation between texture video and depth map in 3D video results in better virtual view quality. To incorporate this, rate distortion optimization (RDO) property is used. The RDO in 3D video implies minimization of synthesis distortion at available rate. Several bit-allocation methods proposed in literature have not considered perceptual quality improvement. In this paper, we propose bit-allocation criteria that results in better visual quality of synthesized view. To achieve this, visual quality metrics are to be incorporated and structural similarity (SSIM) index is one of the metric that measures perceived quality. As SSIM gives similarity measure, we used dSSIM as distortion metric in mode decision and motion estimation instead of traditional metrics like mean square error (MSE) or sum of squared error (SSE). Synthesis distortion is modeled using dSSIM and joint bit-allocation is formulated as optimization problem that is solved using Lagrange multiplier method. Model parameters are determined at frame level for more accurate calculation of quantization parameters. BD-Rate evaluation shows a reduction in bit rate with improved SSIM.


  1. 1.
    Fehn, C.: A 3D-TV approach using depth-image-based rendering (DIBR). In: Proceedings of VIIP, vol. 3 (2003)Google Scholar
  2. 2.
    Fehn, C.: Depth-image-based rendering (DIBR), compression, and transmission for a new approach on 3D-TV. In: Electronic Imaging 2004, pp. 93–104. International Society for Optics and Photonics (2004)Google Scholar
  3. 3.
    Yuan, H., Chang, Y., Li, M., Yang, F.: Model based bit allocation between texture images and depth maps. In: International Conference On Computer and Communication Technologies in Agriculture Engineering (CCTAE), vol. 3, pp. 380–383. IEEE (2010)Google Scholar
  4. 4.
    Yuan, H.H., Chang, Y., Huo, J., Yang, F., Lu, Z.: Model-based joint bit allocation between texture videos and depth maps for 3-D video coding. IEEE Trans. Circ. Syst. Video Technol. 21(4), 485–497 (2011)CrossRefGoogle Scholar
  5. 5.
    Zhu, G., Jiang, G., Yu, M., Li, F., Shao, F., Peng, Z.: Joint video/depth bit allocation for 3D video coding based on distortion of synthesized view. In: IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB), pp. 1–6. IEEE (2012)Google Scholar
  6. 6.
    Shao, F., Jiang, G., Lin, W., Yu, M., Dai, Q.: Joint bit allocation and rate control for coding multi-view video plus depth based 3D video. IEEE Trans. Multimedia 15(8), 1843–1854 (2013)CrossRefGoogle Scholar
  7. 7.
    Yang, C., An, P., Shen, L.: Adaptive bit allocation for 3D video coding. Circ. Syst. Sig. Process. 36, 1–23 (2016)Google Scholar
  8. 8.
    Mai, Z.-Y., Yang, C.-L., Po, L.-M., Xie, S.-L.: A new rate-distortion optimization using structural information in H.264 I-frame encoder. In: Blanc-Talon, J., Philips, W., Popescu, D., Scheunders, P. (eds.) ACIVS 2005. LNCS, vol. 3708, pp. 435–441. Springer, Heidelberg (2005). Scholar
  9. 9.
    Huang, Y.-H., Ou, T.-S., Chen, H.H.: Perceptual-based coding mode decision. In: Proceedings of IEEE International Symposium on Circuits and Systems (ISCAS), pp. 393–396 (2010)Google Scholar
  10. 10.
    Chen, Z., Lin, W., Ngan, K.N.: Perceptual video coding: challenges and approaches. In: Proceedings of IEEE International Conference on Multimedia and Expo (ICME), pp. 784–789 (2010)Google Scholar
  11. 11.
    Huang, Y.-H., Ou, T.-S., Su, P.-Y., Chen, H.H.: Perceptual rate-distortion optimization using structural similarity index as quality metric. IEEE Trans. Circ. Syst. Video Technol. 20(11), 1614–1624 (2010)CrossRefGoogle Scholar
  12. 12.
    Cui, Z., Gan, Z., Zhu, X.: Structural similarity optimal MB layer rate control for H. 264. In: Proceedings of IEEE International Conference on Wireless Communications and Signal Processing (WCSP), pp. 1–5 (2011)Google Scholar
  13. 13.
    Yeo, C., Tan, H.L., Tan, Y.H.: On rate distortion optimization using SSIM. IEEE Trans. Circ. Syst. Video Technol. 23(7), 1170–1181 (2013)CrossRefGoogle Scholar
  14. 14.
    Zhao, T., Wang, J., Wang, Z., Chen, C.W.: SSIM-based coarse-grain scalable video coding. IEEE Trans. Broadcast. 61(2), 210–221 (2015)CrossRefGoogle Scholar
  15. 15.
    Wang, Z., Bovik, A.C., Sheikh, H.R., Simoncelli, E.P.: Image quality assessment: from error visibility to structural similarity. IEEE Trans. Image Process. 13(4), 600–612 (2004)CrossRefGoogle Scholar
  16. 16.
    Harshalatha, Y., Biswas, P.K.: Rate distortion optimization using SSIM for 3D video coding. In: 23rd International Conference on Pattern Recognition (ICPR), pp. 1261–1266. IEEE (2016)Google Scholar
  17. 17.
    Chen, H.H., Huang, Y.-H., Su, P.-Y., Ou, T.-S.: Improving video coding quality by perceptual rate-distortion optimization. In: IEEE International Conference on Multimedia and Expo (ICME), pp. 1287–1292 (2010)Google Scholar
  18. 18.
    3DV-ATM Reference Software 3DV-ATMv5.lr2. Accessed 06 Jan 2017
  19. 19.
    View Synthesis Reference Software VSRS3.5. Accessed 06 Jan 2017
  20. 20.
    Fujii Laborotory, Nagoya University. -data/. Accessed 06 Jan 2017
  21. 21.
    Zitnick, C.L., Kang, S.B., Uyttendaele, M., Winder, S., Szeliski, R.: High-quality video view interpolation using a layered representation. In: ACM Transactions on Graphics (TOG), vol. 23, no. 3, pp. 600–608. ACM (2004)CrossRefGoogle Scholar
  22. 22.
    Bjontegaard, G.: Calculation of Average PSNR Differences Between RD - curves. ITU-TQ.6/SG16 VCEG 13th Meeting.
  23. 23.
    Harshalatha, Y., Biswas, P.K.: SSIM-based joint-bit allocation for 3D video coding. Multimedia Tools Appl. (2017, in Press).

Copyright information

© Springer Nature Singapore Pte Ltd. 2018

Authors and Affiliations

  1. 1.Indian Institute of Technology KharagpurKharagpurIndia

Personalised recommendations