Skip to main content
Log in

Efficient fast mode decision using mode complexity for multi-view video coding

  • Published:
Journal of Central South University Aims and scope Submit manuscript

Abstract

The variable block-size motion estimation (ME) and disparity estimation (DE) are adopted in multi-view video coding (MVC) to achieve high coding efficiency. However, much higher computational complexity is also introduced in coding system, which hinders practical application of MVC. An efficient fast mode decision method using mode complexity is proposed to reduce the computational complexity. In the proposed method, mode complexity is firstly computed by using the spatial, temporal and inter-view correlation between the current macroblock (MB) and its neighboring MBs. Based on the observation that direct mode is highly possible to be the optimal mode, mode complexity is always checked in advance whether it is below a predefined threshold for providing an efficient early termination opportunity. If this early termination condition is not met, three mode types for the MBs are classified according to the value of mode complexity, i.e., simple mode, medium mode and complex mode, to speed up the encoding process by reducing the number of the variable block modes required to be checked. Furthermore, for simple and medium mode region, the rate distortion (RD) cost of mode 16×16 in the temporal prediction direction is compared with that of the disparity prediction direction, to determine in advance whether the optimal prediction direction is in the temporal prediction direction or not, for skipping unnecessary disparity estimation. Experimental results show that the proposed method is able to significantly reduce the computational load by 78.79% and the total bit rate by 0.07% on average, while only incurring a negligible loss of PSNR (about 0.04 dB on average), compared with the full mode decision (FMD) in the reference software of MVC.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. TANIMOTO M, TEHRANI M P, FUJII T, YENDO T. Free-viewpoint TV [J]. IEEE Signal Processing Magazine, 2011, 28(1): 67–76.

    Article  Google Scholar 

  2. MULLER K, MERKLE P, WIEGEND T. 3-D video representation using depth maps [J]. Proceedings of the IEEE, 2011, 99(4): 643–656.

    Article  Google Scholar 

  3. ISO/IEC 14496-10: 2008/FDAM 1:2008(E). Information technology-Coding of audio-visual objects-part 10: Advanced video coding, Amendment 1: multiview video coding [S].

  4. VETRO A, WIEGAND T, SULLIVAN G J. Overview of the stereo and multiview video coding extensions of the H.264/MPEG-4 AVC standard [J]. Proceedings of the IEEE, 2011, 99(4): 626–642.

    Article  Google Scholar 

  5. DING Li-fu, TSUNG Pei-kuei, CHIEN Shao-yi, CHEN Wei-yin, CHEN Liang-gee. Content-aware prediction algorithm with inter-view mode decision for multiview video coding [J]. IEEE Transactions on Multimedia, 2008, 10(8): 1553–1563.

    Article  Google Scholar 

  6. HUO Jun-yan, CHANG Yi-lin, LI Ming, MA Yan-zhuo. Scalable prediction structure for multiview video coding [C]// IEEE International Symposium on Circuits and Systems. Piscataway: IEEE, 2009: 2593–2596.

    Google Scholar 

  7. CHAN C C, LIN J P, TANG C W. On-line statistical analysis based fast mode decision for multi-view video coding [C]// Picture Coding Symposium. Piscataway: IEEE, 2010: 478–481.

    Chapter  Google Scholar 

  8. SHEN Li-quan, YAN Tao, LIU Zhi, ZHANG Zhao-yang, AN Ping, YANG Lei. Fast mode decision for multiview video coding [C]// IEEE International Conference on Image Processing. Piscataway: IEEE, 2009: 2953–2956.

    Google Scholar 

  9. ZENG Huan-qiang, MA Kai-kuang, CAI Can-hui. Mode-correlation-based early termination mode decision for multi-view video coding [C]// IEEE International Conference on Image Processing. Piscataway: IEEE, 2010: 3405–3408.

    Google Scholar 

  10. SEO J, SOHN K. Early disparity estimation skipping for multi-view video coding [J]. EURASIP Journal on Wireless Communications and Networking, 2012, 2012(1): 1–12.

    Article  Google Scholar 

  11. MERKLE P, SMOLIC A, MULLER K, WIEGAND T. Efficient prediction structure for multiview video coding [J]. IEEE Transactions on Circuits and Systems for Video Technology, 2007, 17(11): 1461–1473.

    Article  Google Scholar 

  12. SULLIVAN G, WIEGAND T. Rate-distortion optimization for video compression [J]. IEEE Signal Processing Magazine, 1998, 15(6): 74–90.

    Article  Google Scholar 

  13. WIEGAND T, SCHWARZ H, JOCH A, KOSSENTINI F, SULLIVAN G J. Rate-constrained coder control and comparison of video coding standards [J]. IEEE Transactions on Circuits and Systems for Video Technology, 2003, 13(7): 688–703.

    Article  Google Scholar 

  14. CHOI I, LEE J, JEON B. Fast coding mode selection with rate-distortion optimization for MPEG-4 part-10 AVC/H.264 [J]. IEEE Transactions on Circuits and Systems for Video Technology, 2006, 16(12): 1557–1561.

    Article  Google Scholar 

  15. ZENG Huan-qiang, CAI Can-hui, MA Kai-kuang. Fast mode decision for H.264/AVC based on macroblock motion activity [J]. IEEE Transactions on Circuits and Systems for Video Technology, 2009, 19(4): 491–499.

    Article  Google Scholar 

  16. KOO H S, JEON Y J, JEON B M. MVC motion skip mode, document JVT-W081 [R]. San Jose: JVT, 2007.

    Google Scholar 

  17. WANG Feng-sui, ZENG Huan-qiang, SHEN Qing-hong, DU Si-dan. Efficient early direct mode decision for multi-view video coding [J]. Signal Processing-Image Communication, 2013, 28(7): 736–744.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Qing-hong Shen  (沈庆宏) or Si-dan Du  (都思丹).

Additional information

Foundation item: Project(08Y29-7) supported by the Transportation Science and Research Program of Jiangsu Province, China; Project(201103051) supported by the Major Infrastructure Program of the Health Monitoring System Hardware Platform Based on Sensor Network Node, China; Project(61100111) supported by the National Natural Science Foundation of China; Project(BE2011169) supported by the Scientific and Technical Supporting Program of Jiangsu Province, China

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Wang, Fs., Shen, Qh. & Du, Sd. Efficient fast mode decision using mode complexity for multi-view video coding. J. Cent. South Univ. 21, 4244–4253 (2014). https://doi.org/10.1007/s11771-014-2421-6

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11771-014-2421-6

Key words

Navigation