Dual Discriminator Weighted Mixture Generative Adversarial Network for image generation

Liu, Bao; Wang, Liang; Wang, Jingting; Zhang, Jinyu

doi:10.1007/s12652-021-03667-y

Dual Discriminator Weighted Mixture Generative Adversarial Network for image generation

Original Research
Published: 03 February 2022

Volume 14, pages 10013–10025, (2023)
Cite this article

Journal of Ambient Intelligence and Humanized Computing Aims and scope Submit manuscript

Bao Liu ORCID: orcid.org/0000-0002-9335-5411¹,
Liang Wang¹,
Jingting Wang² &
…
Jinyu Zhang³

372 Accesses
3 Citations
Explore all metrics

Abstract

Image generation is a hot topic in the field of machine learning and computer vision. As a representative of its algorithm, the Generative Adversarial Network (GAN) has the problem of mode collapse in practice. The proposed Dual Discriminator Weighted Mixture Generative Adversarial Network (D2WMGAN) approach can cope with this problem. On the one hand, the D2WMGAN uses the mixed distribution of multiple generators to approximate the real distribution, in order to prevent the extreme situation that multiple generators learn the same distribution and generate the same class of samples, with a classifier to play games with generators to make different generators learn different distributions. On the other hand, the objective function of D2WMGAN weights the Kullback–Leibler (KL) divergence and the reverse KL divergence, and uses their complementary characteristics to improve the quality and diversity of samples from the generators. Then, the theoretical conditional optimality of the D2WMGAN is proved theoretically, which shows that multiple generators can learn the real data distribution in the case of the optimal discriminator and classifier. Finally, extensive experiments are conducted on a large amount of synthetic data and real-world large-scale datasets (such as, CIFAR-10 and MNIST), and the commonly used GAN evaluation indicators (Wasserstein distance, JS divergence, Inception score, and Frechet Inception Distance) are introduced for comparative analysis. Experimental results show that the proposed D2WMGAN approach can better learn multiple mode data, generate rich realistic samples, and effectively solve the problem of mode collapse.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

VARGAN: variance enforcing network enhanced GAN

Article 12 April 2022

Improving generative adversarial networks with simple latent distributions

Article 16 April 2021

A Step Beyond Generative Multi-adversarial Networks

Data availability statement

The data that support the findings of this study are available from the corresponding author upon reasonable request.

Code availability

The code that support the findings of this study are available from the corresponding author upon reasonable request.

References

Arjovsky M, Bottou L (2017) Towards principled methods for training generative adversarial networks. In: Proceedings of International Conference on Learning Representations (ICLR)
Arjovsky M, Chintala S, Bottou L (2017) Wasserstein generative adversarial networks. In: Proceedings of International Conference on Machine Learning, PMLR, pp 214–223
Dowson DC, Landau BV (1982) The Fréchet distance between multivariate normal distributions. J Multivar Anal 12(3):450–455
Article MATH Google Scholar
Fiore U, Santis AD, Perla F, Zanetti P, Palmieri F (2019) Using generative adversarial networks for improving classification effectiveness in credit card fraud detection. Inf Sci 479:448–455
Article Google Scholar
Goodfellow I, Pouget-Abadie J, Mirza M, Xu B, Warde-Farley D, Ozair S, Courville A, Bengio Y (2014) Generative adversarial nets. In: Advances in Neural Information Processing Systems, Curran Associates, pp 2672–2680
Guo Z, Wan Y, Ye H (2019) A data imputation method for multivariate time series based on generative adversarial network. Neurocomputing 360:185–197
Article Google Scholar
Hoang Q, Nguyen TD, Le T, Phung D (2018) MGAN: Training generative adversarial nets with multiple generators. In: Proceedings of International Conference on Learning Representations (ICLR)
Hu R, Cui X (2021) Application of single frame image super-resolution algorithm based on generative adversarial network in tennis motion image resolution. J Ambient Intell Humaniz Comput. https://doi.org/10.1007/s12652-021-03100-4
Article Google Scholar
Hu H, Miao C, Hu WW (2018) Generative adversarial networks-and ResNets-based framework for image translation with super-resolution. J Electron Imaging 27(6):063018
Article MathSciNet Google Scholar
Jin Q, Luo X, Shi Y, Kita K (2019) Image generation method based on improved condition GAN. In: Proceedings of 2019 6th International Conference on Systems and Informatics (ICSAI), IEEE, pp 1290–1294
Krizhevsky A, Hinton G (2009) Learning multiple layers of features from tiny images. Tech Rep TR-2009 1(4):54–57
Google Scholar
Krizhevsky A, Sutskever I, Hinton GE (2012) Imagenet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems, pp 1097–1105
LeCun Y, Bottou L, Bengio Y, Haffner P (1998) Gradient-based learning applied to document recognition. In: Proceedings of the IEEE, pp 2278–2324
Lee CY, Shon JG, Park JS (2021) An edge detection–based eGAN model for connectivity in ambient intelligence environments. J Ambient Intell Humaniz Comput. https://doi.org/10.1007/s12652-021-03261-2
Article Google Scholar
Lei N, An D, Guo Y, Su K, Liu S, Luo Z, Yan ST, Gu XF (2020) A geometric understanding of deep learning. Engineering 6(3):361–374
Article Google Scholar
Li N, Zheng Z, Zhang S, Yu Z, Zheng H, Zheng B (2018a) The synthesis of unpaired underwater images using a multistyle generative adversarial network. IEEE Access 6:54241–54257
Article Google Scholar
Li Z, Nie F, Chang X, Nie L, Zhang H, Yang Y (2018b) Rank-constrained spectral clustering with flexible embedding. IEEE Trans Neural Netw Learn Syst 29(12):6073–6082
Article MathSciNet Google Scholar
Li Z, Nie F, Chang X, Yang Y, Zhang C, Sebe N (2018c) Dynamic affinity graph construction for spectral clustering using multiple features. IEEE Trans Neural Netw Learn Syst 29(12):6323–6332
Article MathSciNet Google Scholar
Li Y, Xiao N, Ouyang W (2019) Improved generative adversarial networks with reconstruction loss. Neurocomputing 323:363–372
Article Google Scholar
Li Y, Zhao K, Ren F, Wang B, Zhao J (2020) Research on super-resolution image reconstruction based on low-resolution infrared sensor. IEEE Access 8:69186–69199
Article Google Scholar
Li P, Li Z, Pang X, Wang H, Lin W, Wu W (2021) Multi-scale residual denoising GAN model for producing super-resolution CTA images. J Ambient Intell Humaniz Comput. https://doi.org/10.1007/s12652-021-03009-y
Article Google Scholar
Liu G, Li X, Wei J (2021a) Large-area damage image restoration algorithm based on generative adversarial network. Neural Comput Appl 33(10):4651–4661
Article Google Scholar
Liu X, Gao Z, Chen BM (2021b) IPMGAN: integrating physical model and generative adversarial network for underwater image enhancement. Neurocomputing 453:538–551
Article Google Scholar
Liu B, Gao N, Huang MT, Liu H, Wang JT (2021c) On the effectiveness of dual discriminator weighted generative adversarial network. J Electron Imaging 30(3):1–17
Article Google Scholar
Liu L, Muelly M, Deng J, Pfister T, Li J (2019) Generative modeling for small-data object detection. In: Proceedings of the IEEE International Conference on Computer Vision, pp 6073–6081
Ma C, Zhu J, Li Y, Li J, Jiang Y, Li X (2020) Single image super resolution via wavelet transform fusion and SRFeat network. J Ambient Intell Humaniz Comput. https://doi.org/10.1007/s12652-020-02065-0
Article Google Scholar
Menéndez ML, Pardo JA, Pardo L, Pardo MC (1997) The Jensen-shannon divergence. J Franklin Inst 334(2):307–318
Article MathSciNet MATH Google Scholar
Metz L, Poole B, Pfau D, Sohl-Dickstein J (2016) Unrolled generative adversarial networks. In: Proceedings of International Conference on Learning Representations (ICLR), pp 1–25
Nguyen T, Le T, Vu H (2017) Dual discriminator generative adversarial nets. In: Proceedings of International Conference on Neural Information Processing Systems (NIPS), pp 2671–2681
Nowozin S, Cseke B, Tomioka R (2016) f-gan: Training generative neural samplers using variational divergence minimization. In: Proceedings of the 30th International Conference on Neural Information Processing Systems (NIPS), pp 271–279
Raiber F, Kurland O (2017) Kullback-leibler divergence revisited. In: Proceedings of the ACM SIGIR International Conference on Theory of Information Retrieval, pp 117–124
Rajasenbagam T, Jeyanthi S, Arun Pandian J (2021) Detection of pneumonia infection in lungs from chest X-ray images using deep convolutional neural network and content-based image retrieval techniques. J Ambient Intell Humaniz Comput. https://doi.org/10.1007/s12652-021-03075-2
Article Google Scholar
Ren P, Xiao Y, Chang X, Huang PY, Li Z, Chen X, Wang X (2021) A comprehensive survey of neural architecture search: challenges and solutions. ACM Comput Surv (CSUR) 54(4):1–34
Article Google Scholar
Rubner Y, Tomasi C, Guibas LJ (2000) The earth mover’s distance as a metric for image retrieval. Int J Comput Vision 40(2):99–121
Article MATH Google Scholar
Sajja TK, Kalluri HK (2021) Image classification using regularized convolutional neural network design with dimensionality reduction modules: RCNN–DRM. J Ambient Intell Humaniz Comput 12:9423–9434
Article Google Scholar
Salimans T, Goodfellow I, Zaremba W (2016) Improved techniques for training GANs. In: Advances in Neural Information Processing Systems, pp 2234–2242
Szegedy C, Vanhoucke V, Ioffe S, Shlens J, Wojna Z (2016) Rethinking the inception architecture for computer vision. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 2818–2826
Xiang P, Wang L, Cheng J, Zhang B, Wu JJ (2017) A deep network architecture for image inpainting. In: Proceedings of 2017 3rd IEEE International Conference on Computer and Communications (ICCC), IEEE, pp 1851–1856
Xu L, Zeng X, Li W, Huang Z (2020) Multi-granularity generative adversarial nets with reconstructive sampling for image inpainting. Neurocomputing 402:220–234
Article Google Scholar
Yan C, Chang X, Luo M, Zheng Q, Zhang X, Li Z, Nie F (2020) Self-weighted robust LDA for multiclass classification with edge classes. ACM Trans Intell Syst Technol (TIST) 12(1):1–19
Google Scholar
Yang T, Chang X, Su H, Crombez N, Yan Z (2020) Raindrop removal with light field image using image inpainting. IEEE Access 8:58416–58426
Article Google Scholar
Zhang CL, Luo JH, Wei XS, Wu J (2017) In defense of fully connected layers in visual representation transfer. In: Proceedings of Pacific Rim Conference on Multimedia. Springer, Cham, pp 807–817
Google Scholar

Download references

Acknowledgements

Research supported in part by grant for the Key Research and Development Program of Shaanxi (2021GY-131), Yulin Science and Technology Plan Project (CXY-2020-037), Xi’an Science and Technology Plan Project (2020KJRC0068), Scientific Research Program Funded by Shaanxi Provincial Education Department (18JK1005), Key R & D Projects of Shaanxi Province (2019GY-097), and Innovation Capability Support Program of Shaanxi (2020TD-021).

Author information

Authors and Affiliations

College of Electrical and Control Engineering, Xi’an University of Science and Technology, Xi’an, China
Bao Liu & Liang Wang
Department of Engineering and Technology, Xi’an Fanyi University, Xi’an, China
Jingting Wang
Shaanxi Zhongyi Times Technology Co., Ltd., Xi’an, China
Jinyu Zhang

Authors

Bao Liu
View author publications
You can also search for this author in PubMed Google Scholar
Liang Wang
View author publications
You can also search for this author in PubMed Google Scholar
Jingting Wang
View author publications
You can also search for this author in PubMed Google Scholar
Jinyu Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

BL: conceptualization, methodology, writing—review and editing, and funding acquisition. LW: software, validation, investigation, resources, data curation, and writing—original draft preparation. JW and JZ: formal analysis. JW: supervision and project administration. JZ: visualization.

Corresponding author

Correspondence to Bao Liu.

Ethics declarations

Conflict of interest

The authors declare that there is no conflict of interest regarding the publication of this paper.

Ethics approval and consent to participate

Not applicable.

Consent to publication

All authors have read this manuscript and would like to have it considered exclusively for publication.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Liu, B., Wang, L., Wang, J. et al. Dual Discriminator Weighted Mixture Generative Adversarial Network for image generation. J Ambient Intell Human Comput 14, 10013–10025 (2023). https://doi.org/10.1007/s12652-021-03667-y

Download citation

Received: 16 September 2021
Accepted: 15 December 2021
Published: 03 February 2022
Issue Date: August 2023
DOI: https://doi.org/10.1007/s12652-021-03667-y

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Dual Discriminator Weighted Mixture Generative Adversarial Network for image generation

Abstract

Access this article

Similar content being viewed by others

VARGAN: variance enforcing network enhanced GAN

Improving generative adversarial networks with simple latent distributions

A Step Beyond Generative Multi-adversarial Networks

Data availability statement

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Ethics approval and consent to participate

Consent to publication

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Dual Discriminator Weighted Mixture Generative Adversarial Network for image generation

Abstract

Access this article

Similar content being viewed by others

VARGAN: variance enforcing network enhanced GAN

Improving generative adversarial networks with simple latent distributions

A Step Beyond Generative Multi-adversarial Networks

Data availability statement

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Ethics approval and consent to participate

Consent to publication

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation