Abstract
In July 2020, the Joint Video Experts Team has published the versatile video coding (VVC) standard. The VVC encoder enhances the coding efficiency compared with his predecessor high-efficiency video coding encoder, thanks to the improved coding modules and the new proposed techniques such as the new block partitioning structure called quadtree with nested multi-type tree (QTMT). However, QTMT induces a significant increase in encoding time mainly at the rate distortion optimization level (RDO) which causes an enormous computational complexity. Instead of RDO-QTMT partition process, a deep-QTMT partition approach based on a fast convolution neural network-ternary tree (CNN-TT) is proposed to predict the best intra-QTMT decision tree in order to reduce the encoding time. A database is initially established containing CU-based TT partition depths with several video contents. Then, a CNN-TT model is developed under three-levels provided by the TT structure to early determine the QTMT partition at 32\(\times \)32. Different threshold values are fixed for each level according to the CNN-TT predicted probabilities to reach a balance between the encoding complexity and the coding efficiency. The experimental results prove that our deep-QTMT partition approach saves a significant encoder time on average between 23% and 58% with an acceptable RD performance.
Similar content being viewed by others
Data availability
Please contact the author for data requests.
References
Javaid, S., Rizvi, S., Ubaid, M.T., et al.: Vvc/h. 266 intra mode qtmt based cu partition using cnn. IEEE Access 10, 37246–37256 (2022)
Tech, G., Pfaff, J., Schwarz, H., et al.: “Fast partitioning for vvc intra-picture encoding with a cnn minimizing the rate-distortion-time cost,” in Data Compression Conference (DCC), 3–12, IEEE (2021)
Li, T., Xu, M., Tang, R., et al.: Deepqtmt: a deep learning approach for fast qtmt-based cu partition of intra-mode vvc. IEEE Trans. Image Process. 30, 5377–5390 (2021)
Xu, J., Wu, G., Zhu, C., et al.: Cnn-based fast cu partitioning algorithm for vvc intra coding, in 2022 IEEE ICIP, 2706–2710, IEEE (2022)
Abdallah, B., Belghith, F., Ayed, M. A. B., et al.: Low-complexity qtmt partition based on deep neural network for versatile video coding,” Signal, Image and Video Processing , 1–8 (2021)
Abdallah, B., Belghith, F., Ben Ayed, M.A., et al.: Fast qtmt decision tree for versatile video coding based on deep neural network, Multimedia Tools Appl. , 1–17 (2022)
Park, S.-H., Kang, J.-W.: Fast multi-type tree partitioning for versatile video coding using a lightweight neural network. IEEE Trans. Multimedia 23, 4388–4399 (2020)
Amestoy, T., Mercat, A., Hamidouche, W., et al.: Tunable vvc frame partitioning based on lightweight machine learning. IEEE Trans. Image Process. 29, 1313–1328 (2019)
Fu, T., Zhang, H., Mu, F., et al.: Fast cu partitioning algorithm for h.266/vvc intra-frame coding, in 2019 IEEE International Conference on Multimedia and Expo (ICME), 55–60 (2019)
Saldanha, M., Sanchez, G., Marcon, C., et al.: Configurable fast block partitioning for vvc intra coding using light gradient boosting machine. IEEE Trans. Circuits Syst. Video Technol. 32(6), 3947–3960 (2021)
Shu, Z., Li, J., Peng, Z., et al.: Intra complexity control algorithm for vvc. Electronics 11(16), 2572 (2022)
Wu, G., Huang, Y., Zhu, C., et al.: Svm based fast cu partitioning algorithm for vvc intra coding, in 2021 IEEE (ISCAS), 1–5 (2021)
Zhang, Q., Wang, Y., Huang, L., et al.: Fast cu partition and intra mode decision method for h. 266/vvc. IEEE Access 8, 117539–117550 (2020)
Zhao, J., Wu, A., Zhang, Q.: Svm-based fast cu partition decision algorithm for vvc intra coding. Electronics 11(14), 2147 (2022)
Abdallah, B., Belghith, F., Ayed, M. A. B., et al.: Qtmt partitioning structure in vvc: overview and analysis,” IEEE international Ccnference on Sciences and Techniques of Automatic Control and Computer Engineering (STA) , 331–335 (2022)
Bossen, F., Boyce, J., Li, X., et al.: “Jvet common test conditions and software reference configurations for sdr video,” (JVET) of ITU-T SG16 (2018)
Bjøntegaard, G.: Calculation of average psnr differences between rd-curves (vceg-m33), in VCEG Meeting (ITU-T SG16 Q. 6), 2–4 (2001)
Park, S.-H., Kang, J.-W.: Context-based ternary tree decision method in versatile video coding for fast intra coding. IEEE Access 7, 172597–172605 (2019)
Zhang, Y., Kwong, S., Wang, X., et al.: Machine learning-based coding unit depth decisions for flexible complexity allocation in hevc. IEEE Trans. Image Process. 24(7), 2225–2238 (2015)
Liu, Z., Yu, X., Gao, Y., et al.: Cu partition mode decision for hevc hardwired intra encoder using convolution neural network. IEEE Trans. Image Process. 25(11), 5088–5103 (2016)
Funding
No funding was received for conducting this study.
Author information
Authors and Affiliations
Contributions
All authors have equally contributed.
Corresponding author
Ethics declarations
Conflict of interest
The authors have no relevant financial or non-financial interests to disclose.
Ethical approval
Not applicable.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Belghith, F., Abdallah, B., Ben Jdidia, S. et al. CNN-based ternary tree partition approach for VVC intra-QTMT coding. SIViP 18, 3587–3594 (2024). https://doi.org/10.1007/s11760-024-03023-5
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11760-024-03023-5