Abstract
In recent years there is a spike in text-to-image synthesis research. Most of the researches use Generative Adversarial Network (GAN) because of its effectiveness in generating a realistic synthetic image. In this paper, we provide several recent papers that focus on GAN based text-to-image synthesis and discuss their architecture, advantages of the model, dataset, and evaluation metric.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Dong, Y., Zhang, Y., Ma, L., Wang, Z., Luo, J.: Unsupervised text-to-image synthesis. Pattern Recogn. 110, 107573 (2021)
Wu, X., Xu, K., Hall, P.: A survey of image synthesis and editing with generative adversarial networks. Tsinghua Sci. Technol. 22, 660–674 (2017)
Goodfellow, I.J., et al.:. Generative adversarial nets. In: Proceedings of the 27th International Conference on Neural Information Processing Systems, NIPS 2014, vol. 2, pp. 2672–2680, Cambridge, MA, USA, MIT Press (2014)
Radford, A., Metz, L., Chintala, S.: Unsupervised representation learning with deep convolutional generative adversarial networks (2016)
Mirza, M., Osindero, S.: Conditional generative adversarial nets. CoRR, abs/1411.1784 (2014)
Reed, S., Akata, Z., Yan, X., Logeswaran, L., Schiele, B., Lee, H.: Generative adversarial text to image synthesis. In: Proceedings of Machine Learning Research, vol. 48, pp. 1060–1069, New York, New York, USA, PMLR, 20–22 June 2016
Zhang, H., et al.: Stack-gan: text to photo-realistic image synthesis with stacked generative adversarial networks. In: 2017 IEEE International Conference on Computer Vision (ICCV), pp. 5908–5916 (2017)
Zhang, H., et al.: Stackgan++: realistic image synthesis with stacked generative adversarial networks. IEEE Trans. Pattern Anal. Mach. Intell. 10 (2017)
Xu, T., et al.: Attngan: Fine-grained text to image generation with attentional generative adversarial net- works. In: 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 1316–1324 (2018)
Qiao, T., Zhang, J., Xu, D., Tao, D.: Mirrorgan: learning text-to-image generation by redescription. In: 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1505–1514 (2019)
Li, W., et al.: Object-driven text-to-image synthesis via adversarial training. In: 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 12166–12174 (2019)
Nam, S., Kim, Y., Kim, S.J.: Text-adaptive generative adversarial networks: Manipulating images with natural language. In: Proceedings of the 32nd International Conference on Neural Information Processing Systems, NIPS 2018, pp. 42–51, Red Hook, NY, USA, Curran Associates Inc (2018)
Li, B., Qi, X., Lukasiewicz, T., Torr, P.H.S.: Manigan: Text-guided image manipulation. In: 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 7877–7886 (2020)
Acknowledgement
This research is fully supported by Universiti Sains Malaysia Research University Individual (RUI) Research Grant 1001/PELECT/8014056.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Che Aminudin, M.F., Suandi, S.A. (2022). Review on Generative Adversarial Neural Networks (GAN) in Text-to-Image Synthesis. In: Mahyuddin, N.M., Mat Noor, N.R., Mat Sakim, H.A. (eds) Proceedings of the 11th International Conference on Robotics, Vision, Signal Processing and Power Applications. Lecture Notes in Electrical Engineering, vol 829. Springer, Singapore. https://doi.org/10.1007/978-981-16-8129-5_134
Download citation
DOI: https://doi.org/10.1007/978-981-16-8129-5_134
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-16-8128-8
Online ISBN: 978-981-16-8129-5
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)