Abstract
Existing deep learning-based image super-resolution (SR) algorithms have made great achievements on synthetic LR–HR pairs in a fully supervised manner. However, the real image degradation process is complicated and unclear, leading to a domain gap between the synthetic LR images and real LR images. To address this issue, the most widely used alternatives are GAN-based methods trained with unpaired data. But most GAN-based methods tend to follow complex multi-stage pipelines and still suffer from the vast cost of hyperparameter tuning on the loss function. In this paper, in contrast to complicated two-stage or multi-stage GAN, a straightforward and simple one-stage GAN-based framework is proposed. Different from the current one-stage GAN, our approach exclusively fits the CycleGAN pattern without any other restrictions, fully leveraging images from unpaired datasets. Specifically, our framework consists of a DegradeNet and an SRNet, which implicitly models the degradation and SR process between the source LR domain and target HR domain. Such a simple one-stage GAN allows us to flexibly learn the data distribution without any assumptions, greatly reducing the model complexity and training difficulty. To further address the inherent practice of empirically setting hyperparameters in previous methods, we propose an adaptive weighting network to automatically learn the contribution weights of different losses in a meta-learning manner. This flexible strategy could shorten the gap between various losses and meanwhile help the model to adaptively optimize in an efficient way. Extensive experiments demonstrate the superiority of our proposed method in terms of indicators and perceptual quality.
Similar content being viewed by others
Data availability
The datasets generated during and/or analyzed during the current study are available from the corresponding author on reasonable request.
References
Dong C, Loy CC, He K, Tang X (2016) Image super-resolution using deep convolutional networks. IEEE Trans Pattern Anal Mach Intell 38(2):295–307
Kim J, Lee JK, Lee KM (2016) Accurate image super-resolution using very deep convolutional networks. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 1646–1654
Lim B, Son S, Kim H, Nah S, Lee KM (2017) Enhanced deep residual networks for single image super-resolution. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 1132–1140
Lai WS, Huang JB, Ahuja N, Yang MH (2017) Deep laplacian pyramid networks for fast and accurate super-resolution. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 624–632
Zhang Y, Li K, Li K, Wang L, Zhong B, Fu Y (2018) Image super-resolution using very deep residual channel attention networks. In: Proceedings of the European Conference on Computer Vision, pp 294–310
Chen H, Wang Y, Guo T, Xu C, Deng Y, Liu Z, Ma S, Xu C, Xu C, Gao W (2021) Pre-trained image processing transformer. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 12299–12310
Que Y, Lee HJ (2022) Single image super-resolution via deep progressive multi-scale fusion networks. Neural Comput Appl 34:10707–10717
Wang B, Li S, Chen Q, Zuo C (2023) Learning-based single-shot long-range synthetic aperture Fourier ptychographic imaging with a camera array. Opt Lett 48(2):263–266
Ledig C, Theis L, Huszar F, Caballero J, Cunningham A, Acosta A, Aitken AP, Tejani A, Totz J, Wang Z (2017) Photo-realistic single image super-resolution using a generative adversarial network. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 105–114
Sajjadi MSM, Scholkopf B, Hirsch M (2017) EnhanceNet: single image super-resolution through automated texture synthesis. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp 4501–4510
Park S-J, Son H, Cho S, Hong K-S, Lee S (2018) SRFeat: single image super-resolution with feature discrimination. In: Proceedings of the European Conference on Computer Vision (ECCV), pp 439–455
Wang X, Yu K, Wu S, Gu J, Liu Y, Dong C, Qiao Y, Change Loy C (2018) ESRGAN: enhanced super-resolution generative adversarial networks. In: Proceedings of the European Conference on Computer Vision Workshops, pp 63–79
Jolicoeur-Martineau A (2018) The relativistic discriminator: a key element missing from standard GAN. In: Proceedings of the International Conference on Learning Representations, pp 1–25
Liu A, Liu Y, Gu J, Qiao Y, Dong C (2022) Blind image super-resolution: a survey and beyond. IEEE Trans Pattern Anal Mach Intell 45:5461–5480
Yue Z, Zhao Q, Xie J, Zhang L, Meng D, Wong K-YK (2022) Blind image super-resolution with elaborate degradation modeling on noise and kernel. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, pp 2128–2138
Wei Y, Gu S, Li Y, Timofte R, Jin L, Song H (2021) Unsupervised real-world image super resolution via domain-distance aware training. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 13385–13394
Gazdieva M, Rout L, Korotin A, Kravchenko A, Filippov A, Burnaev E (2022) An optimal transport perspective on unpaired image super-resolution. arXiv preprint arXiv:2202.01116
Wang X, Xie L, Dong C, Shan Y (2021) Real-ESRGAN: training real-world blind super-resolution with pure synthetic data. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp 1905–1914
Zhang K, Zuo W, Zhang L (2018) Learning a single convolutional super-resolution network for multiple degradations. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 3262–3271
Xu Y-S, Tseng S-YR, Tseng Y, Kuo H-K, Tsai Y-M (2020) Unified dynamic convolutional network for super-resolution with variational degradations. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 12496–12505
Luo Z, Huang Y, Li S, Wang L, Tan T (2022) Learning the degradation distribution for blind image super-resolution. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, pp 6063–6072
Gu J, Lu H, Zuo W, Dong C (2019) Blind super-resolution with iterative kernel correction. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 1604–1613
Huang Y, Li S, Wang L, Tan T (2020) Unfolding the alternating optimization for blind super resolution. Adv Neural Inf Process Syst 33:5632–5643
Guo Y, Chen J, Wang J, Chen Q, Cao J, Deng Z, Xu Y, Tan M (2020) Closed-loop matters: dual regression networks for single image super-resolution. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 5407–5416
Tao G, Ji X, Wang W, Chen S, Lin C, Cao Y, Lu T, Luo D, Tai Y (2021) Spectrum-to-kernel translation for accurate blind image super-resolution. Adv Neural Inf Process Syst 34:22643–22654
Yuan Y, Liu S, Zhang J, Zhang Y, Dong C, Lin L (2018) Unsupervised image super-resolution using cycle-in-cycle generative adversarial networks. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, pp 701–710
Bharti V, Biswas B, Shukla KK (2021) EMOCGAN: a novel evolutionary multiobjective cyclic generative adversarial network and its application to unpaired image translation. Neural Comput Appl, 1–15
Sun W, Gong D, Shi Q, van den Hengel A, Zhang Y (2021) Learning to zoom-in via learning to zoom-out: real-world super-resolution by generating and adapting degradation. IEEE Trans Image Process 30:2947–2962
Lugmayr A, Danelljan M, Timofte R (2019) Unsupervised learning for real-world super-resolution. In: Proceedings of the IEEE/CVF International Conference on Computer Vision Workshop, pp 3408–3416
Maeda S (2020) Unpaired image super-resolution using pseudo-supervision. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 291–300
Chen S, Han Z, Dai E, Jia X, Liu Z, Xing L, Zou X, Xu C, Liu J, Tian Q (2020) Unsupervised image super-resolution with an indirect supervised path. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, pp 468–469
Wang W, Zhang H, Yuan Z, Wang C (2021) Unsupervised real-world super-resolution: A domain adaptation perspective. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp 4318–4327
Kim G, Park J, Lee K, Lee J, Min J, Lee B, Han DK, Ko H (2020) Unsupervised real-world super resolution with cycle generative adversarial network and domain discriminator. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, pp 456–457
Romero A, Van Gool L, Timofte R (2022) Unpaired real-world super-resolution with pseudo controllable restoration. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 798–807
Shen C, Yin Y, Wang X, Li X, Song J, Song M (2021) Training generative adversarial networks in one stage. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 3350–3360
Mei Y, Fan Y, Zhou Y, Huang L, Huang TS, Shi H (2020) Image super-resolution with cross-scale non-local attention and exhaustive self-exemplars mining. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 5690–5699
Yang Y, Zhuang Y, Pan Y (2021) Multiple knowledge representation for big data artificial intelligence: framework, applications, and case studies. Front Inf Technol Electron Eng 22(12):1551–1558
Zhang Y, Tian Y, Kong Y, Zhong B, Fu Y (2018) Residual dense network for image super-resolution. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 2472–2481
Zhou S, Zhang J, Zuo W, Loy CC (2020) Cross-scale internal graph neural network for image super-resolution. Adv Neural Inf Process Syst 33:3499–3509
Shocher A, Cohen N, Irani M (2018) Zero-shot super-resolution using deep internal learning. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 3118–3126
Park S, Yoo J, Cho D, Kim J, Kim TH (2020) Fast adaptation to super-resolution networks via meta-learning. In: Proceedings of the European Conference on Computer Vision, pp 754–769
Soh JW, Cho S, Cho NI (2020) Meta-transfer learning for zero-shot super-resolution. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 3516–3525
Bell-Kligler S, Shocher A, Irani M (2019) Blind super-resolution kernel estimation using an Internal-GAN. Adv Neural Inf Process Syst 32:284–293
Cheng X, Fu Z, Yang J (2020) Zero-shot image super-resolution with depth guided internal degradation learning. In: Proceedings of the European Conference on Computer Vision, pp 265–280
Emad M, Peemen M, Corporaal H (2021) DualSR: zero-shot dual learning for real-world super-resolution. In: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, pp 1630–1639
Ulyanov D, Vedaldi A, Lempitsky V (2018) Deep image prior. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 9446–9454
Zhu J-Y, Park T, Isola P, Efros AA (2017) Unpaired image-to-image translation using cycle-consistent adversarial networks. In: Proceedings of the IEEE International Conference on Computer Vision, pp 2223–2232
Bulat A, Yang J, Tzimiropoulos G (2018) To learn image super-resolution, use a gan to learn how to do image degradation first. In: Proceedings of the European Conference on Computer Vision, pp 185–200
Zhao T, Ren W, Zhang C, Ren D, Hu Q (2018) Unsupervised degradation learning for single image super-resolution. arXiv preprint arXiv:1812.04240
Prajapati K, Chudasama V, Patel H, Upla K, Ramachandra R, Raja K, Busch C (2020) Unsupervised single image super-resolution network (USISResNet) for real-world data using generative adversarial network. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, pp 464–465
Li L, Jamieson K, DeSalvo G, Rostamizadeh A, Talwalkar A (2017) Hyperband: a novel bandit-based approach to hyperparameter optimization. J Mach Learn Res 18(1):6765–6816
Malkiel I, Wolf L (2020) Mtadam: automatic balancing of multiple training loss terms. In: Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, pp 10713–10729
Shu J, Xie Q, Yi L, Zhao Q, Meng D (2019) Meta-weight-net: learning an explicit mapping for sample weighting. Adv Neural Inf Process Syst 32:1917–1928
Isola P, Zhu J, Zhou T, Efros AA (2017) Image-to-image translation with conditional adversarial networks. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, pp 5967–5976
Lugmayr A, Danelljan M, Timofte R, Fritsche M, Gu S, Purohit K, Kandula P, Suin M, Rajagoapalan A, Joon NH (2019) Aim 2019 challenge on real-world image super-resolution: methods and results. In: Proceedings of the IEEE/CVF International Conference on Computer Vision Workshop, pp 3575–3583
Lugmayr A, Danelljan M, Timofte R (2020) Ntire 2020 challenge on real-world image super-resolution: methods and results. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, pp 494–495
Kingma DP, Ba J (2015) Adam: a method for stochastic optimization. In: Proceedings of the International Conference on Learning Representations, pp 1–13
Wang Z, Bovik AC, Sheikh HR, Simoncelli EP (2004) Image quality assessment: from error visibility to structural similarity. IEEE Trans Image Process 13(4):600–612
Zhang R, Isola P, Efros AA, Shechtman E, Wang O (2018) The unreasonable effectiveness of deep features as a perceptual metric. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 586–595
Timofte R, Agustsson E, Van Gool L, Yang M, Zhang L, Lim B, Son S, Kim H, Nah S, Lee KM (2017) Ntire 2017 challenge on single image super-resolution: methods and results. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, pp 1110–1121
Liu Z, Luo P, Wang X, Tang X (2015) Deep learning face attributes in the wild. In: Proceedings of the IEEE International Conference on Computer Vision, pp 3730–3738
Cao Q, Shen L, Xie W, Parkhi OM, Zisserman A (2018) VGGFace2: a dataset for recognising faces across pose and age. In: Proceedings of the IEEE International Conference on Automatic Face & Gesture Recognition, pp 67–74
Yang S, Luo P, Loy C-C, Tang X (2016) Wider face: a face detection benchmark. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 5525–5533
Acknowledgements
This work was supported by National Key Research and development Program of China (2021YFA1000102), and in part by the grants from the National Natural Science Foundation of China (Nos. 61673396, 61976245), Natural Science Foundation of Shandong Province, China (No. ZR2022MF260).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors have no competing interests to declare that are relevant to the content of this article.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Shao, M., Liu, H., Yang, J. et al. Adaptive one-stage generative adversarial network for unpaired image super-resolution. Neural Comput & Applic 35, 20909–20922 (2023). https://doi.org/10.1007/s00521-023-08888-0
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00521-023-08888-0