Fast QTMT decision tree for Versatile Video Coding based on deep neural network

Abdallah, Bouthaina; Belghith, Fatma; Ben Ayed, Mohamed Ali; Masmoudi, Nouri

doi:10.1007/s11042-022-13479-7

Fast QTMT decision tree for Versatile Video Coding based on deep neural network

1221: Deep Learning for Image/Video Compression and Visual Quality Assessment
Published: 09 August 2022

Volume 81, pages 42731–42747, (2022)
Cite this article

Multimedia Tools and Applications Aims and scope Submit manuscript

Bouthaina Abdallah ORCID: orcid.org/0000-0002-1400-9431¹,
Fatma Belghith¹,
Mohamed Ali Ben Ayed² &
…
Nouri Masmoudi¹

297 Accesses
6 Citations
1 Altmetric
Explore all metrics

Abstract

Versatile Video Coding (VVC), the emerging video coding standard, outperforms the coding efficiency of the previous standard named High Efficiency Video Coding (HEVC) at the cost of an encoding complexity increase. In fact, VVC proposes a new partitioning block structure called quadtree with nested multi-type tree (QTMT) that introduces a more flexible partition shape compared to the previous splitting algorithms namely quadtree plus binary tree (QTBT) and quadtree (QT) structures adopted in HEVC. However, QTMT increases the encoding time due to the rate-distortion cost (RDcost) process. In order to overcome this issue, this paper proposes a fast intra partitioning algorithm based on a Deep Learning (DL) approach using a Convolution Neural Network (CNN). First, a fast QTMT partition algorithm based on a CNN-binary tree horizontal (CNN-BTH) network is developed to predict the BTH mode decision at 32×32 Coding Units (CUs). The BTV decision tree algorithm is also predicted at this level by a CNN-binary tree vertical (CNN-BTV). Then, two algorithms are combined to suggest a new fast intra QTMT decision tree algorithm. Compared to the VVC reference software VTM-3.0, the proposed overall intra QTMT partition approach reaches a significant complexity reduction down to 37% compared to the original software VTM-3.0, and an average of 31% in terms of encoding time saving with a slight loss in coding performance.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Low-complexity QTMT partition based on deep neural network for Versatile Video Coding

Article 19 January 2021

CNN-based ternary tree partition approach for VVC intra-QTMT coding

Article 16 February 2024

Fast intra-coding unit partition decision in H.266/FVC based on deep learning

Article 09 July 2020

References

Abdallah B, Belghith F, Ayed MAB, Masmoudi N (2021) Low-complexity qtmt partition based on deep neural network for versatile video coding. SIViP:1–8
Amestoy T, Mercat A, Hamidouche W, Menard D, Bergeron C (2019) Tunable vvc frame partitioning based on lightweight machine learning. IEEE Trans Image Process 29:1313–1328
Article MathSciNet MATH Google Scholar
Bjøntegaard G (2001) Calculation of average psnr differences between rd-curves (vceg-m33). In: VCEG meeting (ITU-T SG 16 Q. 6), pp 2–4
Bossen F, Boyce J, Li X, Seregin V, Sühring K (2018) Jvet common test conditions and software reference configurations for sdr video. Joint Video Experts Team (JVET) of ITU-T SG, vol 16
Cao J, Tang N, Wang J, Liang F (2020) Texture-based fast cu size decision and intra mode decision algorithm for vvc. In: International conference on multimedia modeling. Springer, pp 739–751
Chang CY, Srinivasan K, Wang WC, Ganapathy GP, Vincent DR, Deepa N (2020) Quality assessment of tire shearography images via ensemble hybrid faster region-based convnets. Electronics 9(1):45
Article Google Scholar
Fan Y, Chen J, Sun H, Katto J, Jing M (2020) A fast qtmt partition decision strategy for vvc intra prediction. IEEE Access 8:107900–107911. https://doi.org/10.1109/ACCESS.2020.3000565
Article Google Scholar
Fu T, Zhang H, Mu F, Chen H (2019) Fast cu partitioning algorithm for h. 266/vvc intra-frame coding. In: 2019 IEEE International conference on multimedia and expo (ICME). IEEE, pp 55–60
Fu T, Zhang H, Mu F, Chen H (2019) Fast cu partitioning algorithm for h.266/vvc intra-frame coding. In: 2019 IEEE International conference on multimedia and expo (ICME), pp 55–60. https://doi.org/10.1109/ICME.2019.00018
Jin Z, An P, Yang C, Shen L (2018) Fast qtbt partition algorithm for intra frame coding through convolutional neural network. IEEE Access 6:54660–54673
Article Google Scholar
Kibeya H, Belghith F, Ayed MAB, Masmoudi N (2016) Fast coding unit selection and motion estimation algorithm based on early detection of zero block quantified transform coefficients for high-efficiency video coding standard. IET Image Process 10(5):371–380
Article Google Scholar
Kibeya H, Belghith F, Ben Ayed MA, Masmoudi N (2016) Fast intra-prediction algorithms for high efficiency video coding standard. J Electr Imaging vol 25(1)
Kim S, Jun D, Kim BG, Beack S, Lee M, Lee T (2021) Two-dimensional audio compression method using video coding schemes. Electronics 10 (9):1094
Article Google Scholar
Li T, Xu M, Tang R (2020) Deepqtmt: a deep learning approach for fast qtmt-based cu partition of intra-mode vvc. arXiv:2006.13125
Liu X, Li Y, Liu D, Wang P, Yang LT (2017) An adaptive cu size decision algorithm for hevc intra prediction based on complexity classification using machine learning. IEEE Trans Circuits Syst Video Technol 29(1):144–155
Article Google Scholar
Liu Z, Yu X, Gao Y, Chen S, Ji X, Wang D (2016) Cu partition mode decision for hevc hardwired intra encoder using convolution neural network. IEEE Trans Image Process 25(11):5088–5103
Article MathSciNet MATH Google Scholar
Park SH, Kang JW (2019) Context-based ternary tree decision method in versatile video coding for fast intra coding. IEEE Access 7:172597–172605
Article Google Scholar
Sidaty N, Hamidouche W, Deforges O, Philippe P (2017) Emerging video coding performance: 4k quality monitoring. In: 2017 Ninth international conference on quality of multimedia experience (qoMEX). IEEE, pp 1–3
Tang G, Jing M, Zeng X, Fan Y (2019) Adaptive cu split decision with pooling-variable cnn for vvc intra encoding. In: 2019 IEEE visual communications and image processing (VCIP), pp 1–4, DOI 10.1109/VCIP47243.2019.8965679, (to appear in print)
VVC Test Model (VTM) vesion (2018) VVC Test Model (VTM) vesion 3.0: [online] available. https://vcgit.hhi.fraunhofer.de/jvet/VVCSoftwareVTM/tree/VTM-3.0 (December 2018)
Wang Z, Wang S, Zhang J, Wang S, Ma S (2017) Effective quadtree plus binary tree block partition decision for future video coding. In: 2017 Data compression conference (DCC), pp 23–32. https://doi.org/10.1109/DCC.2017.70
Wang Z, Wang S, Zhang X, Wang S, Ma S (2018) Fast qtbt partitioning decision for interframe coding with convolution neural network. In: 2018 25th IEEE international conference on image processing (ICIP). IEEE, pp 2550–2554
Yang H, Shen L, Dong X, Ding Q, An P, Jiang G (2019) Low complexity ctu partition structure decision and fast intra mode decision for versatile video coding. IEEE Trans Circuits Syst Video Technol
Yang H, Shen L, Dong X, Ding Q, An P, Jiang G (2020) Low-complexity ctu partition structure decision and fast intra mode decision for versatile video coding. IEEE Trans Circuits Syst Video Technol 30:1668–1682
Article Google Scholar
Zhong G, Wang J, Hu J, Liang F (2021) A gan-based video intra coding. Electronics 10(2):132
Article Google Scholar

Download references

Author information

Authors and Affiliations

Electronics and Information Technology Laboratory, National Engineering School of Sfax, University of Sfax, Sfax, 3035, Tunisie
Bouthaina Abdallah, Fatma Belghith & Nouri Masmoudi
New Technologies and Telecom Systems Laboratory (NTS’COM), Sfax National School of Electronics and Communication (ENET’COM), University of Sfax, Sfax, Tunisia
Mohamed Ali Ben Ayed

Authors

Bouthaina Abdallah
View author publications
You can also search for this author in PubMed Google Scholar
Fatma Belghith
View author publications
You can also search for this author in PubMed Google Scholar
Mohamed Ali Ben Ayed
View author publications
You can also search for this author in PubMed Google Scholar
Nouri Masmoudi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Bouthaina Abdallah.

Ethics declarations

Competing interests

The authors have no relevant financial or non-financial interests to disclose..

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Abdallah, B., Belghith, F., Ben Ayed, M.A. et al. Fast QTMT decision tree for Versatile Video Coding based on deep neural network. Multimed Tools Appl 81, 42731–42747 (2022). https://doi.org/10.1007/s11042-022-13479-7

Download citation

Received: 02 June 2021
Revised: 21 February 2022
Accepted: 13 July 2022
Published: 09 August 2022
Issue Date: December 2022
DOI: https://doi.org/10.1007/s11042-022-13479-7

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fast QTMT decision tree for Versatile Video Coding based on deep neural network

Abstract

Access this article

Similar content being viewed by others

Low-complexity QTMT partition based on deep neural network for Versatile Video Coding

CNN-based ternary tree partition approach for VVC intra-QTMT coding

Fast intra-coding unit partition decision in H.266/FVC based on deep learning

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Fast QTMT decision tree for Versatile Video Coding based on deep neural network

Abstract

Access this article

Similar content being viewed by others

Low-complexity QTMT partition based on deep neural network for Versatile Video Coding

CNN-based ternary tree partition approach for VVC intra-QTMT coding

Fast intra-coding unit partition decision in H.266/FVC based on deep learning

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation