Face Super-Resolution Model Based on Diffusion Model

Feng, Tianyi; Xie, Yongping

doi:10.1007/978-981-99-7502-0_6

Tianyi Feng⁴⁰ &
Yongping Xie⁴⁰

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 1033))

Included in the following conference series:

International Conference in Communications, Signal Processing, and Systems

127 Accesses

Abstract

The problem of restoring high-resolution images from blurry images has long been a concern, and traditional methods of directly interpolating low-resolution images to obtain high-resolution images are simple but ineffective. Inspired by SR3, we propose a super-resolution model of human faces based on the diffusion model, which achieves super-resolution through a random iterative denoising process. In this paper, we have used a residual block that integrates multi-scale spatial attention and coordinate attention. Additionally, we have enhanced the restoration of image details through a global attention model. These advancements effectively address the discrepancy between automated evaluation metrics and human perception in high-frequency details for super-resolution models. Through evaluation of the standard eight-fold super-resolution task on CelebA-HQ, our model performs well and achieves competitive scores on SSIM and PSNR metrics.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 299.00; Price excludes VAT (USA)

Hardcover Book: USD 379.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Kingma DP, Welling M (2013) Auto-encoding variational bayes. arXiv preprint arXiv:1312.6114
Kingma DP, Dhariwal P (2018) Glow: generative flow with invertible \(1\times 1\) convolutions. In: Advances in neural information processing systems, vol 31
Google Scholar
Karras T, Aila T, Laine S, Lehtinen J (2017) Progressive growing of GANs for improved quality, stability, and variation. arXiv preprint arXiv:1710.10196
Gulrajani I, Ahmed F, Arjovsky M, Dumoulin V, Courville AC (2017) Improved training of wasserstein GANs. In: Advances in neural information processing systems, vol 30
Google Scholar
Ravuri S, Vinyals O (2019) Classification accuracy score for conditional generative models. In: Advances in neural information processing systems, vol 32
Google Scholar
Lim B, Son S, Kim H, Nah S, Lee KM (2017) Enhanced deep residual networks for single image super-resolution. In: Proceedings of the IEEE conference on computer vision and pattern recognition workshops, pp 136–144
Google Scholar
Wang X, Yu K, Wu S, Gu J, Liu Y, Dong C, Qiao Y, Loy CC (2018) ESRGAN: enhanced super-resolution generative adversarial networks. In: Proceedings of the European conference on computer vision (ECCV) workshops, p 0
Google Scholar
Liang J, Cao J, Sun G, Zhang K, Van Gool L, Timofte R (2021) SwinIR: image restoration using Swin transformer. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 1833–1844
Google Scholar
Chen Y, Tai Y, Liu X, Shen C, Yang J (2018) FSRNet: end-to-end learning face super-resolution with facial priors. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2492–2501
Google Scholar
Menon S, Damian A, Hu S, Ravi N, Rudin C (2020) Pulse: self-supervised photo upsampling via latent space exploration of generative models. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 2437–2445
Google Scholar
Rombach R, Blattmann A, Lorenz D, Esser P, Ommer B (2022) High-resolution image synthesis with latent diffusion models. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 10684–10695
Google Scholar
Ho J, Saharia C, Chan W, Fleet DJ, Norouzi M, Salimans T (2022) Cascaded diffusion models for high fidelity image generation. J Mach Learn Res 23(47):1–33
MathSciNet Google Scholar
Saharia C, Ho J, Chan W, Salimans T, Fleet DJ, Norouzi M (2022) Image super-resolution via iterative refinement. IEEE Trans Pattern Anal Mach Intell
Google Scholar
Su J-N, Gan M, Chen G-Y, Yin J-L, Chen CP (2022) Global learnable attention for single image super-resolution. IEEE Trans Pattern Anal Mach Intell
Google Scholar
Gao S, Liu X, Zeng B, Xu S, Li Y, Luo X, Liu J, Zhen X, Zhang B (2023) Implicit diffusion models for continuous super-resolution. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 10021–10030
Google Scholar

Download references

Author information

Authors and Affiliations

Dalian University of Technology, Dalian, 116081, China
Tianyi Feng & Yongping Xie

Authors

Tianyi Feng
View author publications
You can also search for this author in PubMed Google Scholar
Yongping Xie
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yongping Xie .

Editor information

Editors and Affiliations

College of Electronic and Communication Engineering, Tianjin Normal University, Tianjin, China
Wei Wang
Inovative Parking Building, Room B410, Dalian University of Technology, Dalian, China
Xin Liu
Sci & Tech, DianHang Bldg, Rm 321, Dalian Maritime Univ, Sch of Info, Dalian, China
Zhenyu Na
College of Electronic and Communication Engineering, Tianjin Normal University, Tianjin, China
Baoju Zhang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Feng, T., Xie, Y. (2024). Face Super-Resolution Model Based on Diffusion Model. In: Wang, W., Liu, X., Na, Z., Zhang, B. (eds) Communications, Signal Processing, and Systems. CSPS 2023. Lecture Notes in Electrical Engineering, vol 1033. Springer, Singapore. https://doi.org/10.1007/978-981-99-7502-0_6

Download citation

DOI: https://doi.org/10.1007/978-981-99-7502-0_6
Published: 18 April 2024
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-7555-6
Online ISBN: 978-981-99-7502-0
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics

Face Super-Resolution Model Based on Diffusion Model