Abstract
The H.264 standard achieves much higher coding efficiency than the MPEG-2 standard, due to its improved inter and intra prediction modes which come with a cost of higher computation complexity. Transcoding MPEG-2 video to H.264 is important to enable gradual migration to H.264. However, given the significant differences between the MPEG-2 and the H.264 coding algorithms, transcoding is much more complex and new approaches to transcoding are necessary. In this paper, we introduce and evaluate a low complexity macroblock partition mode decision algorithm, to be used as part of a high-efficient inter-frame prediction in MPEG-2 to H.264 transcoder. The proposed tools are used to compute an optimal MB coding mode decision with significantly reduced computational complexity. Specifically, we achieve the computational savings by using the following MB information coming from MPEG-2: the MB coding modes, the coded block pattern (CBPC) in MPEG-2, and the mean and variance of the 16 4 × 4 sub blocks of the MPEG-2 residual MBs. We use data mining algorithms to develop a decision tree for H.264 coding mode decisions. The decision trees are built using RD optimized mode decisions and result in highly efficient mode decisions, with significantly reduced computational complexity. The proposed transcoder is 35% faster than the RD optimized H.264 reference transcoder without a significant PSNR degradation (0.05 dB on average). The proposed transcoder performs over 0.4 dB better on average than the SAE cost based H.264 transcoding.
Similar content being viewed by others
References
Bialkowski J, Barkowsky M, Kaup A (2006) Overview of low-complexity video transcoding from H.263 to H.264. In: Proceeding of the International Conference on Multimedia & Expo ICME 2006, July
Bialkowski J, Barkowsky M, Leschka F, Kaup A (2006) Low-complexity transcoding of inter coded video frames from H.264 to H.263. In: Proceeding of the International Conference on Image Processing ICIP 2006, October
Bjontegaard G (2001) Calculation of average PSNR differences between RD-Curves. Presented at the 13th VCEG-M33 Meeting, Austin, TX, April
Chen C, Wu P-H, Chen H (2004) MPEG-2 to H.264 transcoding. Picture Coding Symposium, 15–17 Dec
Chen G, Zhang Y, Lin S, Dai F (2004) Efficient block size selection for MPEG-2 to H.264 transcoding. In: Proceedings of the 12th annual ACM Multimedia 2004, pp 300–303
Fernandez-Escribano G, Cuenca P, Orozco-Barbosa L, Garrido A (2005) A fast intra-frame prediction algorithm for MPEG-2/H.264 video transcoders. In: Proceeding of the International Conference on Image Processing ICIP 2005. Genoa, Italy, September, pp 684–687
Fernandez-Escribano G, Cuenca P, Orozco-Barbosa L, Garrido A (2005) Computacional complexity reduction of intra-frame prediction in MPEG-2/H.264 video transcoders. In: Proceeding of ICME 2005, Amsterdam, The Netherlands, July
Fernandez-Escribano G, Kalva H, Cuenca P, Orozco-Barbosa L (2006) A very low complexity MPEG-2 to H.264 transcoding using machine learning. In: Proceeding of the ACM Multimedia 2006, Santa Barbara (California), USA, October, pp 931–940
Gonzalez CA, Yeo H, Kuo CJ (1999) Requirements for motion-estimation search range in MPEG-2 coded video. IBM J Res Develop 43(4), July
Implementation Studies Group (2002) Main results of the AVC complexity analysis. MPEG Document N4964, ISO/IEC JTC11/SC29/WG11, July
ISO/IEC 13818-2 (1994) Generic coding of moving picture and associated audio. MPEG-2 International Standard
ITU-T Recommendation H.264 (2003) Advanced video coding for generic audiovisual services, May
Joint Video Team (JVT) (2006) of ISO/IEC MPEG and ITU-T VCEG, Reference Software to Committee Draft. JVT-F100 JM10.2. Available on-line at http://iphome.hhi.de/suehring/tml/
JVT Test Model Ad Hoc Group (2003) Evaluation sheet for motion estimation. Draft version 4, February
Kim S-E, Han J-K, Kim J-G (2005) Efficient motion estimation algorithm for MPEG-4 to H.264 transcoder. In: Proceeding of the International Conference on Image Processing ICIP 2005, Genoa, September
Lu X, Tourapis AM, Yin P, Boyce J (2005) Fast mode decision and motion estimation for H.264 with a focus on MPEG-2/H.264 transcoding. In: Proceedings of the IEEE International Symposium on Circuits and Systems ISCAS
Petljanski B, Kalva H (2006) DCT domain intra MB mode decision for MPEG-2 to H.264 transcoding. In: Proceedings of the International Conference on Consumer Electronics ICCE 2006. January, pp 419–420
Quinlan JR (1993) C4.5: programs for machine learning. Morgan Kaufmann
Su Y, Xin J, Vetro A, Sun H (2005) Efficient MPEG-2 to H.264/AVC intra transcoding in transform-domain. Proceedings of the IEEE International Symposium on Circuits and Systems ISCAS 2005, May, pp 1234–1237
Witten IH, Frank E (2005) Data mining: practical machine learning tools and techniques, 2nd edn. Morgan Kaufmann, San Francisco
Xin J, Vetro A, Sekiguchi S, Sugimoto K (2006) MPEG-2 to H.264/AVC transcoding for efficient storage of broadcast video bitstreams. In: International Conference on Consumer Electronics 2006, January, pp 417–418
Youn J, Sun M-T, Lin C-W (1999) Motion vector refinement for high-performance transcoding. IEEE Trans Multimedia 1(1):30–40, Mar
Zhou Z. Sun S, Lei S, Sun M-T (2005) Motion information and coding mode reuse for MPEG-2 to H.264 transcoding. In: Proceedings of the IEEE International Symposium on Circuits and Systems ISCAS 2005, 23–26 May, Vol. 2, pp 1230–1233
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Fernández-Escribano, G., Kalva, H., Cuenca, P. et al. A first approach to speeding-up the inter mode selection in MPEG-2/H.264 transcoders using machine learning. Multimed Tools Appl 35, 225–240 (2007). https://doi.org/10.1007/s11042-007-0124-9
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-007-0124-9