Abstract
In this study, a video deblocking approach using multipath deep neural networks is proposed. The proposed approach contains temporal fusion subnet, variable-filter-size (VFS) subnet, and enhancement subnet. Video deblocking is performed via early fusion so that temporal correlations between adjacent video frames are employed. Based on the experimental results obtained in this study, in terms of two objective performance metrics and subjective evaluation, the performance of the proposed approach is better than those of four comparison approaches.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
High efficiency video coding, Recommendation ITU-T H.265, November 2019
Hashimoto, K., Gohshi, S.: Novel deblocking method for cropped video. In: Proceedings of 2019 International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS), pp. 1–2 (2019)
Bougacha, O., Ben Ayed, M. A., Masmoudi, N.: Prefiltering effect on HEVC intra prediction module. In: Proceedings of 2019 16th international Multi-Conference on Systems, Signals & Devices (SSD), pp. 7–11 (2019)
Andersson, K., Misra, K., Ikeda, M., Rusanovskyy, D., Iwamura, S.: Deblocking filtering in VVC. In: Proceedings of 2021 Picture Coding Symposium (PCS), pp. 1–5 (2021)
Zhao, H., He, M., Teng, G., Shang, X., Wang, G., Feng, Y.: A CNN-based post-processing algorithm for video coding efficiency improvement. IEEE Access 8, 920–929 (2020)
Zhang, Y., Shen, T., Ji, X., Zhang, Y., Xiong, R., Dai, Q.: Residual highway convolutional neural networks for in-loop filtering in HEVC. IEEE Trans. Image Process. 27(8), 3827–3841 (2018)
Qi, Z., Jung, C., Xie, B.: Subband adaptive image deblocking using wavelet based convolutional neural networks. IEEE Access 9, 62593–62601 (2021)
Zhang, Y., Chandler, D. M., Mou, X.: Multi-domain residual encoder–decoder networks for generalized compression artifact reduction. J. Vis. Commun. Image Representation 83, 103425–103437 (2022)
Shi, Z., Mettes, P., Maji, S., Snoek, C.G.M.: On measuring and controlling the spectral bias of the deep image prior. Int. J. of Comput. Vis. 130, 885–908 (2022)
Lu, G., Zhang, X., Ouyang, W., Xu, D., Chen, L., Gao, Z.: Deep non-local Kalman network for video compression artifact reduction. IEEE Trans. Image Process. 29, 1725–1737 (2020)
Wang, X., Chan, K.C., Yu, K., Dong, C., Change Loy, C.: EDVR: video restoration with enhanced deformable convolutional networks. In: Proceedings of 2019 IEEE/CVF Computer Vision and Pattern Recognition Workshops (CVPRW), pp. 1954–1963 (2019)
Meng, X., Deng, X., Zhu, S., Zeng, B.: Enhancing quality for VVC compressed videos by jointly exploiting spatial details and temporal structure. In: Proceedings of 2019 IEEE International Conference on Image Processing (ICIP), pp. 1193–1197 (2019)
Huang, Z., Sun, J., Guo, X., Shang, M.: One-for-all: an efficient variable convolution neural network for in-loop filter of VVC. IEEE Trans. Circ. Syst. Video Technol. 32(4), 2342–2355 (2022)
Norkin, A.: Generalized deblocking filter for AVM. In: Proceedings of 2022 Picture Coding Symposium (PCS), pp. 355–359 (2022)
Li, S., Huang, L., Xiong, X., Xu, D., Zhu, X., Fan, Y.: An area-efficient deblocking filter architecture for multi-standard video codec. In: Proceedings of 2022 IEEE 4th International Conference on Circuits and Systems (ICCS), pp. 149–154 (2022)
Tran, D., Bourdev, L., Fergus, R., Torresani, L., Paluri, M.: Learning spatiotemporal features with 3D convolutional networks. arXiv:1412.0767 (2014)
Karpathy, A., Toderici, G., Shetty, S., Leung, T., Sukthankar, R., Fei-Fei, L.: Large-scale video classification with convolutional neural networks. In: Proceedings of 2014 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1725–1732 (2014)
Caballero, J., Ledig, C., Aitken, A., Acosta, A., Totz, J., Wang, Z., Shi, W.: Real-time video super-resolution with spatio-temporal networks and motion compensation. In: Proceedings of 2017 IEEE/CVF Conf. on Computer Vision and Pattern Recognition (CVPR), pp. 4778–4787 (2017)
Dai, Y., Liu, D., Wu, F.: A convolutional neural network approach for post-processing in HEVC intra coding. arXiv:1608.06690 (2016)
Nair, V., Hinton, G. E.: Rectified linear units improve restricted boltzmann machines. In: Proceedings of the 27th International Conference on Machine Learning, pp. 807–814 (2010)
He, K., Zhang, X., Ren, S., Sun, J.: Delving deep into rectifiers: surpassing human-level performance on imagenet classification. arXiv:1502.01852 (2015)
Kingma, D. P., Ba, J. L.: Adam: an approach for stochastic optimization. In: Proceedings of 2015 International Conference on Learning Representations, pp. 1–15 (2015)
Nah, S., Baik, S., Hong, S., Moon, G., Son, S., Timofte, R., Lee, K. M.: NTIRE 2019 challenge on video deblurring and super-resolution: dataset and study. In: Proceedings of 2019 IEEE/CVF Computer Vision and Pattern Recognition Workshops (CVPRW), pp. 1996–2005 (2019)
Martens, J.B., Meesters, L.: Image dissimilarity. IEEE Trans. Sig. Process. 70(3), 155–176 (1998)
Acknowledgements
This work was supported in part by National Science and Technology Council, Taiwan, Republic of China under grants MOST 111-2221-E-194-021 and NSTC 112-2221-E-194-030.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Chou, PP., Leou, JJ. (2024). Video Deblocking Using Multipath Deep Neural Networks. In: Lee, CY., Lin, CL., Chang, HT. (eds) Technologies and Applications of Artificial Intelligence. TAAI 2023. Communications in Computer and Information Science, vol 2075. Springer, Singapore. https://doi.org/10.1007/978-981-97-1714-9_3
Download citation
DOI: https://doi.org/10.1007/978-981-97-1714-9_3
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-97-1713-2
Online ISBN: 978-981-97-1714-9
eBook Packages: Computer ScienceComputer Science (R0)