Skip to main content

Review on Generative Adversarial Neural Networks (GAN) in Text-to-Image Synthesis

  • Conference paper
  • First Online:
Proceedings of the 11th International Conference on Robotics, Vision, Signal Processing and Power Applications

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 829))

  • 1760 Accesses

Abstract

In recent years there is a spike in text-to-image synthesis research. Most of the researches use Generative Adversarial Network (GAN) because of its effectiveness in generating a realistic synthetic image. In this paper, we provide several recent papers that focus on GAN based text-to-image synthesis and discuss their architecture, advantages of the model, dataset, and evaluation metric.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 259.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 329.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Dong, Y., Zhang, Y., Ma, L., Wang, Z., Luo, J.: Unsupervised text-to-image synthesis. Pattern Recogn. 110, 107573 (2021)

    Google Scholar 

  2. Wu, X., Xu, K., Hall, P.: A survey of image synthesis and editing with generative adversarial networks. Tsinghua Sci. Technol. 22, 660–674 (2017)

    Google Scholar 

  3. Goodfellow, I.J., et al.:. Generative adversarial nets. In: Proceedings of the 27th International Conference on Neural Information Processing Systems, NIPS 2014, vol. 2, pp. 2672–2680, Cambridge, MA, USA, MIT Press (2014)

    Google Scholar 

  4. Radford, A., Metz, L., Chintala, S.: Unsupervised representation learning with deep convolutional generative adversarial networks (2016)

    Google Scholar 

  5. Mirza, M., Osindero, S.: Conditional generative adversarial nets. CoRR, abs/1411.1784 (2014)

    Google Scholar 

  6. Reed, S., Akata, Z., Yan, X., Logeswaran, L., Schiele, B., Lee, H.: Generative adversarial text to image synthesis. In: Proceedings of Machine Learning Research, vol. 48, pp. 1060–1069, New York, New York, USA, PMLR, 20–22 June 2016

    Google Scholar 

  7. Zhang, H., et al.: Stack-gan: text to photo-realistic image synthesis with stacked generative adversarial networks. In: 2017 IEEE International Conference on Computer Vision (ICCV), pp. 5908–5916 (2017)

    Google Scholar 

  8. Zhang, H., et al.: Stackgan++: realistic image synthesis with stacked generative adversarial networks. IEEE Trans. Pattern Anal. Mach. Intell. 10 (2017)

    Google Scholar 

  9. Xu, T., et al.: Attngan: Fine-grained text to image generation with attentional generative adversarial net- works. In: 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 1316–1324 (2018)

    Google Scholar 

  10. Qiao, T., Zhang, J., Xu, D., Tao, D.: Mirrorgan: learning text-to-image generation by redescription. In: 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1505–1514 (2019)

    Google Scholar 

  11. Li, W., et al.: Object-driven text-to-image synthesis via adversarial training. In: 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 12166–12174 (2019)

    Google Scholar 

  12. Nam, S., Kim, Y., Kim, S.J.: Text-adaptive generative adversarial networks: Manipulating images with natural language. In: Proceedings of the 32nd International Conference on Neural Information Processing Systems, NIPS 2018, pp. 42–51, Red Hook, NY, USA, Curran Associates Inc (2018)

    Google Scholar 

  13. Li, B., Qi, X., Lukasiewicz, T., Torr, P.H.S.: Manigan: Text-guided image manipulation. In: 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 7877–7886 (2020)

    Google Scholar 

Download references

Acknowledgement

This research is fully supported by Universiti Sains Malaysia Research University Individual (RUI) Research Grant 1001/PELECT/8014056.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Muhamad Faris Che Aminudin .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Che Aminudin, M.F., Suandi, S.A. (2022). Review on Generative Adversarial Neural Networks (GAN) in Text-to-Image Synthesis. In: Mahyuddin, N.M., Mat Noor, N.R., Mat Sakim, H.A. (eds) Proceedings of the 11th International Conference on Robotics, Vision, Signal Processing and Power Applications. Lecture Notes in Electrical Engineering, vol 829. Springer, Singapore. https://doi.org/10.1007/978-981-16-8129-5_134

Download citation

Publish with us

Policies and ethics