Generating Automobile Images Dynamically from Text Description

Sindhu, N.; Mamatha, H. R.

doi:10.1007/978-981-15-5258-8_21

N. Sindhu⁵ &
H. R. Mamatha⁵

Part of the book series: Lecture Notes on Data Engineering and Communications Technologies ((LNDECT,volume 53))

1022 Accesses

Abstract

Synthesis of a realistic image from matching visual descriptions provided in the textual format is a challenge that has attracted attention in the recent research community in the field of artificial intelligence. Generation of the image from given text input is a problem, where given a text input, an image which matches text description must be generated. However, a relatively new class of convolutional neural networks referred to as generative adversarial networks (GANs) has provided compelling results in understanding textual features and generating high-resolution images. In this work, the main aim is to generate an automobile image from the given text input using generative adversarial networks and manipulate automobile colour using text-adaptive discriminator. This work involves creating a detailed text description of each image of a car to train the GAN model to produce images.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Reed S, Akata Z, Yan X, Logeswaran L, Schiele B, Lee H (2016) Generative adversarial text to image synthesis. arXiv:1605.05396
Farhadi A, Endres I, Hoiem D, Forsyth D (2009) Describing objects by their attributes. In: 2009 IEEE conference on computer vision and pattern recognition. IEEE, pp 1778–1785
Google Scholar
Huang H, Yu PS, Wang C (2018) An introduction to image synthesis with generative adversarial nets. arXiv:1803.04469
Mao J, Xu W, Yang Y, Wang J, Huang Z, Yuille A (2014) Deep captioning with multimodal recurrent neural networks (m-rnn). arXiv:1412.6632
Reed S, Akata Z, Lee H, Schiele B (2016) Learning deep representations of fine-grained visual descriptions. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 49–58
Google Scholar
Viswanathan A, Mehta B, Bhavatarini MP, Mamatha HR (2018) Text to image translation using generative adversarial network. In: 2018 international conference on advances in computing, communications and informatics (ICACCI). IEEE, pp 1648–1654
Google Scholar
Nam S, Kim Y, Kim SJ (2018) Text-adaptive generative adversarial networks: manipulating images with natural language. In: Advances in neural information processing systems, pp 42–51
Google Scholar
Goodfellow I, Pouget-Abadie J, Mirza M, Xu B, Warde-Farley D, Ozair S, Courville A, Bengio Y (2014) Generative adversarial nets. In: Advances in neural information processing systems, pp 2672–2680
Google Scholar

Download references

Author information

Authors and Affiliations

PES University, Bengaluru, India
N. Sindhu & H. R. Mamatha

Authors

N. Sindhu
View author publications
You can also search for this author in PubMed Google Scholar
H. R. Mamatha
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to N. Sindhu .

Editor information

Editors and Affiliations

Research and Industry Incubation Center, Department of Information Science and Engineering, Dayananda Sagar College of Engineering, Bangalore, India
V. Suma
Department of Technology and Maritime Innovation, University of Southeast, Horten, Norway
Noureddine Bouhmala
Go Perception Laboratory, Cornell University, Ithaca, NY, USA
Haoxiang Wang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Sindhu, N., Mamatha, H.R. (2021). Generating Automobile Images Dynamically from Text Description. In: Suma, V., Bouhmala, N., Wang, H. (eds) Evolutionary Computing and Mobile Sustainable Networks. Lecture Notes on Data Engineering and Communications Technologies, vol 53. Springer, Singapore. https://doi.org/10.1007/978-981-15-5258-8_21

Download citation

DOI: https://doi.org/10.1007/978-981-15-5258-8_21
Published: 01 August 2020
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-5257-1
Online ISBN: 978-981-15-5258-8
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics