Review on Generative Adversarial Neural Networks (GAN) in Text-to-Image Synthesis

Che Aminudin, Muhamad Faris; Suandi, Shahrel Azmin

doi:10.1007/978-981-16-8129-5_134

Muhamad Faris Che Aminudin³⁹ &
Shahrel Azmin Suandi³⁹

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 829))

1760 Accesses

Abstract

In recent years there is a spike in text-to-image synthesis research. Most of the researches use Generative Adversarial Network (GAN) because of its effectiveness in generating a realistic synthetic image. In this paper, we provide several recent papers that focus on GAN based text-to-image synthesis and discuss their architecture, advantages of the model, dataset, and evaluation metric.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 259.00; Price excludes VAT (USA)

Hardcover Book: USD 329.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Dong, Y., Zhang, Y., Ma, L., Wang, Z., Luo, J.: Unsupervised text-to-image synthesis. Pattern Recogn. 110, 107573 (2021)
Google Scholar
Wu, X., Xu, K., Hall, P.: A survey of image synthesis and editing with generative adversarial networks. Tsinghua Sci. Technol. 22, 660–674 (2017)
Google Scholar
Goodfellow, I.J., et al.:. Generative adversarial nets. In: Proceedings of the 27th International Conference on Neural Information Processing Systems, NIPS 2014, vol. 2, pp. 2672–2680, Cambridge, MA, USA, MIT Press (2014)
Google Scholar
Radford, A., Metz, L., Chintala, S.: Unsupervised representation learning with deep convolutional generative adversarial networks (2016)
Google Scholar
Mirza, M., Osindero, S.: Conditional generative adversarial nets. CoRR, abs/1411.1784 (2014)
Google Scholar
Reed, S., Akata, Z., Yan, X., Logeswaran, L., Schiele, B., Lee, H.: Generative adversarial text to image synthesis. In: Proceedings of Machine Learning Research, vol. 48, pp. 1060–1069, New York, New York, USA, PMLR, 20–22 June 2016
Google Scholar
Zhang, H., et al.: Stack-gan: text to photo-realistic image synthesis with stacked generative adversarial networks. In: 2017 IEEE International Conference on Computer Vision (ICCV), pp. 5908–5916 (2017)
Google Scholar
Zhang, H., et al.: Stackgan++: realistic image synthesis with stacked generative adversarial networks. IEEE Trans. Pattern Anal. Mach. Intell. 10 (2017)
Google Scholar
Xu, T., et al.: Attngan: Fine-grained text to image generation with attentional generative adversarial net- works. In: 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 1316–1324 (2018)
Google Scholar
Qiao, T., Zhang, J., Xu, D., Tao, D.: Mirrorgan: learning text-to-image generation by redescription. In: 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1505–1514 (2019)
Google Scholar
Li, W., et al.: Object-driven text-to-image synthesis via adversarial training. In: 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 12166–12174 (2019)
Google Scholar
Nam, S., Kim, Y., Kim, S.J.: Text-adaptive generative adversarial networks: Manipulating images with natural language. In: Proceedings of the 32nd International Conference on Neural Information Processing Systems, NIPS 2018, pp. 42–51, Red Hook, NY, USA, Curran Associates Inc (2018)
Google Scholar
Li, B., Qi, X., Lukasiewicz, T., Torr, P.H.S.: Manigan: Text-guided image manipulation. In: 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 7877–7886 (2020)
Google Scholar

Download references

Acknowledgement

This research is fully supported by Universiti Sains Malaysia Research University Individual (RUI) Research Grant 1001/PELECT/8014056.

Author information

Authors and Affiliations

Intelligent Biometric Group, School of Electrical and Electronic Engineering, Universiti Sains Malaysia Engineering Campus, Universiti Sains Malaysia, 14300, Nibong Tebal, Pulau Pinang, Malaysia
Muhamad Faris Che Aminudin & Shahrel Azmin Suandi

Authors

Muhamad Faris Che Aminudin
View author publications
You can also search for this author in PubMed Google Scholar
Shahrel Azmin Suandi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Muhamad Faris Che Aminudin .

Editor information

Editors and Affiliations

School of Electrical and Electronic Engineering, Universiti Sains Malaysia, Penang, Malaysia
Nor Muzlifah Mahyuddin
School of Electrical and Electronic Engineering, Universiti Sains Malaysia, Penang, Malaysia
Nor Rizuan Mat Noor
School of Electrical and Electronic Engineering, Universiti Sains Malaysia, Penang, Malaysia
Harsa Amylia Mat Sakim

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Che Aminudin, M.F., Suandi, S.A. (2022). Review on Generative Adversarial Neural Networks (GAN) in Text-to-Image Synthesis. In: Mahyuddin, N.M., Mat Noor, N.R., Mat Sakim, H.A. (eds) Proceedings of the 11th International Conference on Robotics, Vision, Signal Processing and Power Applications. Lecture Notes in Electrical Engineering, vol 829. Springer, Singapore. https://doi.org/10.1007/978-981-16-8129-5_134

Download citation

DOI: https://doi.org/10.1007/978-981-16-8129-5_134
Published: 11 February 2022
Publisher Name: Springer, Singapore
Print ISBN: 978-981-16-8128-8
Online ISBN: 978-981-16-8129-5
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics