Abstract
In the real world, lacking paired training data makes image super-resolution (SR) be a tricky unsupervised task. Existing methods are mainly train models on synthetic datasets and achieve the tradeoff between detail restoration and noise artifact suppression based on a priori knowledge, which indicate it cannot be optimal in both aspects. To solve this problem, we propose OTSR, a single image super-resolution method based on optimal transport theory. OTSR aims to find the optimal solution to the ill-posed SR problem, so that the model can restore high-frequency detail accurately and also suppress noise and artifacts well. Our method consists of three stages: real-world images degradation estimation, LR images generation and model optimization based on quadratic Wasserstein distance. Through the first two stages, the problem of no paired image is solved. In the third stage, under the guidance of optimal transport theory, the optimal mapping from LR to HR image space is learned. Extensive experiments show that our method outperforms the state-of-the-art methods in terms of both detail repair and noise artifact suppression. The source code is available at https://github.com/cognaclee/OTSR.
Similar content being viewed by others
References
Adolphs, L., Daneshmand, H., Lucchi, A., Hofmann, T.: Local saddle point optimization: a curvature exploitation approach. In: Kamalika Chaudhuri and Masashi Sugiyama, editors, Proceedings of the Twenty-Second International Conference on Artificial Intelligence and Statistics, volume 89 of Proceedings of Machine Learning Research, pp. 486–495. PMLR, (2019)
Agustsson, E., Timofte, R.: Ntire 2017 challenge on single image super-resolution: Dataset and study. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), pp. 1122–1131, (2017)
Arjovsky, M., Chintala, S., Bottou, L.: Wasserstein generative adversarial networks. In: Precup, Doina, Teh, Yee Whye, editors, Proceedings of the 34th International Conference on Machine Learning, volume 70 of Proceedings of Machine Learning Research, pp. 214–223. PMLR, (2017)
Bell-K., Sefi, S., Assaf, I.M.: Blind super-resolution kernel estimation using an internal-gan. CoRR arXiv:1909.06581 (2019)
Chen, J., Chen, J., Chao, H., Yang, M.: Image blind denoising with generative adversarial network based noise modeling. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 3155–3164, (2018)
Deshpande, I., Zhang, Z., Schwing, A.: Generative modeling using the sliced wasserstein distance. In: 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 3483–3491, (2018)
Dong, C., Loy, C.C., He, K., Tang, X.: Image super-resolution using deep convolutional networks. IEEE Trans. Pattern Anal. Mach. Intell. 38(2), 295–307 (2016)
Frogner, C., Zhang, C., Mobahi, H., Araya-Polo, M., Poggio, T.A.: Learning with a wasserstein loss. CoRR arXiv:1506.05439 (2015)
Goodfellow, I., Pouget-Abadie, J., Mirza, M., Xu, B., Warde-Farley, D., Ozair, S., Courville, A., Bengio, Y.: Generative adversarial nets. In: Ghahramani, Z., Welling, M., Cortes, C., Lawrence, N., Weinberger, K.Q. editors. Advances in Neural Information Processing Systems, volume 27. Curran Associates, Inc. (2014)
Gu, J., Lu, H., Zuo, W., Dong, C.: Blind super resolution with iterative kernel correction. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1604–1613, (2019)
Gulrajani, I., Ahmed, F., Arjovsky, M., Dumoulin, V., Courville, A.C.: Improved training of wasserstein gans. CoRR arXiv:1704.00028 (2017)
Haris, M., Shakhnarovich, G., Ukita, N.: Deep back projection networks for super resolution. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1664–1673, (2018)
Ignatov, A., Kobyshev, N., Timofte, R., Vanhoey, K., Gool, L.V.: Dslr-quality photos on mobile devices with deep convolutional networks. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 3277–3285 (2017)
Isola, P., Zhu, J., Zhou, T., Efros, A.A.: Image-to-image translation with conditional adversarial networks. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 5967–5976, (2017)
Ji, X., Cao, Y., Tai, Y., Wang, C., Li, J., Huang, F.: Real-world super-resolution via kernel estimation and noise injection. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, (2020)
Johnson, J., Alahi, A., Fei-Fei, L.: Perceptual losses for real-time style transfer and super-resolution. In: European Conference on Computer Vision, pp. 694–711. Springer International Publishing (2016)
Kanopoulos, N., Vasanthavada, N., Baker, R.L.: Design of an image edge detection filter using the sobel operator. IEEE Journal of Solid-State Circuits 23(2) (1988)
Kantorovich, L.V., Rubinshten, G.S.: On a space of completely additive functions. Vestnik Leningrad Univ 13(7), 52–59 (1958)
Kingma, D.P., Ba, J.: Adam: A method for stochastic optimization. In: Bengio, Yoshua, LeCun, Yann, editors. 3rd International Conference on Learning Representations, ICLR 2015, San Diego, CA, USA, May 7-9, 2015, Conference Track Proceedings (2015)
Ledig, C., Theis, L., Huszar, F., Caballero, J., Cunningham, A., Acosta, A., Aitken, A., Tejani, A., Totz, J., Wang, Z., Shi, W.: Photo-realistic single image super-resolution using a generative adversarial network. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 105–114 (2017)
Lei, N., Kehua, S., Cui, L., Yau, S.-T., Xianfeng David, G.: A geometric view of optimal transportation and generative model. Comput. Aided Geomet. Design 68, 1–21 (2019)
Lei, N., An, D., Guo, Y., Kehua, S., Liu, S., Luo, Z., Yau, S.-T., Xianfeng, G.: A geometric understanding of deep learning. Engineering 6(3), 361–374 (2020)
Lim, B., Son, S., Kim, H., Nah, S., Lee, KM.: Enhanced deep residual networks for single image super-resolution. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, pp. 136-144, (2017)
Liu, H., GU, X., Samaras, D.: A two-step computation of the exact GAN Wasserstein distance. In: Jennifer Dy and Andreas Krause, editors, Proceedings of the 35th International Conference on Machine Learning, volume 80 of Proceedings of Machine Learning Research, pp. 3159–3168. PMLR, (2018)
Liu, H., Gu, X., Samaras, D.:Asserstein gan with quadratic transport cost. In: Proceedings of the International Conference on Computer Vision
Lugmayr, A., Danelljan, M, Timofte, R.: Unsupervised learning for real-world super-resolution. In: 2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW), pp. 3408–3416, (2019)
Luo, Z., H., Yan, L., Shang, W., Liang, T.T.: Unfolding the alternating optimization for blind super resolution. CoRR arXiv:2010.02631 (2020)
Mescheder, L., Geiger, A., Nowozin, S.: Which training methods for GANs do actually converge? In: Dy, Jennifer, Krause, Andreas, editors, Proceedings of the 35th International Conference on Machine Learning, volume 80 of Proceedings of Machine Learning Research, pp. 3481–3490. PMLR, (2018)
Mittal, A., Moorthy, A.K., Bovik, A.C.: No-reference image quality assessment in the spatial domain. IEEE Trans. Image Process. 21(12), 4695–4708 (2012)
Mittal, A., Soundararajan, R., Bovik, A.C.: Making a “completely blind’’ image quality analyzer. IEEE Signal Process. Lett. 20(3), 209–212 (2013)
Miyato, T., Kataoka, T., Koyama, M., Yoshida, Y.: Spectral normalization for generative adversarial networks. CoRR arXiv:1802.05957 (2018)
Peyre, G., Cuturi, M.: Computational optimal transport. Found. Trends in Mach. Learn. 11(5–6), 355–607 (2019)
Santambrogio, F.: Optimal transport for applied mathematicians. Progress in Nonlinear Differential Equations and their applications 87 (2015)
Shocher, M. Irani A., Cohen, N.: Zero-shot super-resolution using deep internal learning. In: The Conference on Computer Vision and Pattern Recognition, pp. 3118–3126, (2018)
Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. Computer Science (2014)
Timofte, R., Agustsson, Ei., Gool, L.V., Yang, Ming-Hsuan, Z., Lei: N.: Challenge on single image super-resolution: Methods and results. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, pp. 1110–1121, (2017)
Wang, Z., Chen, J., Hoi, S.C.H.: Deep learning for image super resolution: a survey. IEEE Transactions on Pattern Analysis and Machine Intelligence, pp. 3365–3387 (2020)
Wang, X., Yu, K., Wu, S., Gu, J., Liu, Y., Dong, C., Qiao, Y., Loy, C.C.: Esrgan: Enhanced super-resolution generative adversarial networks. In: The European Conference on Computer Vision Workshops (ECCVW), September (2018)
Wang, H.R.S.Z., Bovik, A.C., Simoncelli, E.P.: Image quality assessment: from error visibility to structural similarity. IEEE Trans. Image Process. 13(4), 600–612 (2004)
Wang, M., Zhenxue Chen, Q.M., Jonathan, W., Jian, M.: Improved face super-resolution generative adversarial networks. Mach. Vis. Appl. 31(4), 1–12 (2020)
Xie, S., Tu, Z.: Holistically nested edge detection. In: Proceedings of the IEEE International Conference on Computer Vision (ICCV), pp. 1395–1403, (2015)
Yann, Brenier: Polar factorization and monotone rearrangement of vector-valued functions. Commun. Pure Appl. Math. 64, 375–417 (1991)
Yuan, Y., Liu, S., Zhang, J., Zhang, Y., Dong, C., Lin, L.: Unsupervised image super-resolution using cycle-in-cycle generative adversarial networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, pp. 701–710, (2018)
Zhang, R., Isola, P., Efros, Alexei A., Shechtman, E., Wang, O.: The unreasonable effectiveness of deep features as a perceptual metric. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 586–595, (2018)
Zhang, Y., Li, K., Li, K., Wang, L., Zhong, B., Fu, Y.: Image super-resolution using very deep residual channel attention networks. In: ECCV, pp. 286–301, (2018)
Zhang, K., Zuo, W., Zhang, L.: Deep plug-and-play super-resolution for arbitrary blur kernels. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1671–1681 (2019)
Zhang, Y., Tian, Y., Kong, Y., Zhong, B., Fu, Y.: Residual dense network for image super-resolution. In: 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 2472–2481, (2018)
Zhang, W., Ma, K., Zhai, G., Yang, X.: Uncertainty-aware blind image quality assessment in the laboratory and wild. IEEE Trans. Image Process. 30, 3474–3486 (2021)
Zhou, R., Susstrunk, S.: Kernel modeling super-resolution on real low resolution images. In: 2019 IEEE International Conference on Computer Vision(ICCV), pp. 2433–2443, (2019)
Acknowledgements
This research was supported by the National Key R &D Program of China 2021YFA1003003, and the National Natural Science Foundation of China under Grant No. 61936002, 61772105, 61720106005. We are fortunate and thankful for all the advice and guidance we have received during this work.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Li, Z., Lei, N., Shi, J. et al. Real-World super-resolution under the guidance of optimal transport. Machine Vision and Applications 33, 48 (2022). https://doi.org/10.1007/s00138-022-01299-6
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s00138-022-01299-6