Video Deblocking Using Multipath Deep Neural Networks

Chou, Ping-Peng; Leou, Jin-Jang

doi:10.1007/978-981-97-1714-9_3

Ping-Peng Chou⁸ &
Jin-Jang Leou⁸

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 2075))

Included in the following conference series:

International Conference on Technologies and Applications of Artificial Intelligence

72 Accesses

Abstract

In this study, a video deblocking approach using multipath deep neural networks is proposed. The proposed approach contains temporal fusion subnet, variable-filter-size (VFS) subnet, and enhancement subnet. Video deblocking is performed via early fusion so that temporal correlations between adjacent video frames are employed. Based on the experimental results obtained in this study, in terms of two objective performance metrics and subjective evaluation, the performance of the proposed approach is better than those of four comparison approaches.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 59.99; Price excludes VAT (USA)

Softcover Book: USD 79.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

High efficiency video coding, Recommendation ITU-T H.265, November 2019
Google Scholar
Hashimoto, K., Gohshi, S.: Novel deblocking method for cropped video. In: Proceedings of 2019 International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS), pp. 1–2 (2019)
Google Scholar
Bougacha, O., Ben Ayed, M. A., Masmoudi, N.: Prefiltering effect on HEVC intra prediction module. In: Proceedings of 2019 16th international Multi-Conference on Systems, Signals & Devices (SSD), pp. 7–11 (2019)
Google Scholar
Andersson, K., Misra, K., Ikeda, M., Rusanovskyy, D., Iwamura, S.: Deblocking filtering in VVC. In: Proceedings of 2021 Picture Coding Symposium (PCS), pp. 1–5 (2021)
Google Scholar
Zhao, H., He, M., Teng, G., Shang, X., Wang, G., Feng, Y.: A CNN-based post-processing algorithm for video coding efficiency improvement. IEEE Access 8, 920–929 (2020)
Article Google Scholar
Zhang, Y., Shen, T., Ji, X., Zhang, Y., Xiong, R., Dai, Q.: Residual highway convolutional neural networks for in-loop filtering in HEVC. IEEE Trans. Image Process. 27(8), 3827–3841 (2018)
Article MathSciNet Google Scholar
Qi, Z., Jung, C., Xie, B.: Subband adaptive image deblocking using wavelet based convolutional neural networks. IEEE Access 9, 62593–62601 (2021)
Article Google Scholar
Zhang, Y., Chandler, D. M., Mou, X.: Multi-domain residual encoder–decoder networks for generalized compression artifact reduction. J. Vis. Commun. Image Representation 83, 103425–103437 (2022)
Google Scholar
Shi, Z., Mettes, P., Maji, S., Snoek, C.G.M.: On measuring and controlling the spectral bias of the deep image prior. Int. J. of Comput. Vis. 130, 885–908 (2022)
Article Google Scholar
Lu, G., Zhang, X., Ouyang, W., Xu, D., Chen, L., Gao, Z.: Deep non-local Kalman network for video compression artifact reduction. IEEE Trans. Image Process. 29, 1725–1737 (2020)
Article MathSciNet Google Scholar
Wang, X., Chan, K.C., Yu, K., Dong, C., Change Loy, C.: EDVR: video restoration with enhanced deformable convolutional networks. In: Proceedings of 2019 IEEE/CVF Computer Vision and Pattern Recognition Workshops (CVPRW), pp. 1954–1963 (2019)
Google Scholar
Meng, X., Deng, X., Zhu, S., Zeng, B.: Enhancing quality for VVC compressed videos by jointly exploiting spatial details and temporal structure. In: Proceedings of 2019 IEEE International Conference on Image Processing (ICIP), pp. 1193–1197 (2019)
Google Scholar
Huang, Z., Sun, J., Guo, X., Shang, M.: One-for-all: an efficient variable convolution neural network for in-loop filter of VVC. IEEE Trans. Circ. Syst. Video Technol. 32(4), 2342–2355 (2022)
Google Scholar
Norkin, A.: Generalized deblocking filter for AVM. In: Proceedings of 2022 Picture Coding Symposium (PCS), pp. 355–359 (2022)
Google Scholar
Li, S., Huang, L., Xiong, X., Xu, D., Zhu, X., Fan, Y.: An area-efficient deblocking filter architecture for multi-standard video codec. In: Proceedings of 2022 IEEE 4th International Conference on Circuits and Systems (ICCS), pp. 149–154 (2022)
Google Scholar
Tran, D., Bourdev, L., Fergus, R., Torresani, L., Paluri, M.: Learning spatiotemporal features with 3D convolutional networks. arXiv:1412.0767 (2014)
Karpathy, A., Toderici, G., Shetty, S., Leung, T., Sukthankar, R., Fei-Fei, L.: Large-scale video classification with convolutional neural networks. In: Proceedings of 2014 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1725–1732 (2014)
Google Scholar
Caballero, J., Ledig, C., Aitken, A., Acosta, A., Totz, J., Wang, Z., Shi, W.: Real-time video super-resolution with spatio-temporal networks and motion compensation. In: Proceedings of 2017 IEEE/CVF Conf. on Computer Vision and Pattern Recognition (CVPR), pp. 4778–4787 (2017)
Google Scholar
Dai, Y., Liu, D., Wu, F.: A convolutional neural network approach for post-processing in HEVC intra coding. arXiv:1608.06690 (2016)
Nair, V., Hinton, G. E.: Rectified linear units improve restricted boltzmann machines. In: Proceedings of the 27th International Conference on Machine Learning, pp. 807–814 (2010)
Google Scholar
He, K., Zhang, X., Ren, S., Sun, J.: Delving deep into rectifiers: surpassing human-level performance on imagenet classification. arXiv:1502.01852 (2015)
Kingma, D. P., Ba, J. L.: Adam: an approach for stochastic optimization. In: Proceedings of 2015 International Conference on Learning Representations, pp. 1–15 (2015)
Google Scholar
Nah, S., Baik, S., Hong, S., Moon, G., Son, S., Timofte, R., Lee, K. M.: NTIRE 2019 challenge on video deblurring and super-resolution: dataset and study. In: Proceedings of 2019 IEEE/CVF Computer Vision and Pattern Recognition Workshops (CVPRW), pp. 1996–2005 (2019)
Google Scholar
Martens, J.B., Meesters, L.: Image dissimilarity. IEEE Trans. Sig. Process. 70(3), 155–176 (1998)
Google Scholar

Download references

Acknowledgements

This work was supported in part by National Science and Technology Council, Taiwan, Republic of China under grants MOST 111-2221-E-194-021 and NSTC 112-2221-E-194-030.

Author information

Authors and Affiliations

Department of Computer Science and Information Engineering, National Chung Cheng University, Chiayi, Taiwan, 621, Republic of China
Ping-Peng Chou & Jin-Jang Leou

Authors

Ping-Peng Chou
View author publications
You can also search for this author in PubMed Google Scholar
Jin-Jang Leou
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ping-Peng Chou .

Editor information

Editors and Affiliations

National Yunlin University of Science and Technology, Douliu, Taiwan
Chao-Yang Lee
National Yunlin University of Science and Technology, Douliou, Taiwan
Chun-Li Lin
National Yunlin University of Science and Technology, Douliou, Taiwan
Hsuan-Ting Chang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Chou, PP., Leou, JJ. (2024). Video Deblocking Using Multipath Deep Neural Networks. In: Lee, CY., Lin, CL., Chang, HT. (eds) Technologies and Applications of Artificial Intelligence. TAAI 2023. Communications in Computer and Information Science, vol 2075. Springer, Singapore. https://doi.org/10.1007/978-981-97-1714-9_3

Download citation

DOI: https://doi.org/10.1007/978-981-97-1714-9_3
Published: 28 March 2024
Publisher Name: Springer, Singapore
Print ISBN: 978-981-97-1713-2
Online ISBN: 978-981-97-1714-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics