Skip to main content

Usage of Generative Adversarial Network to Improve Text to Image Synthesis

  • Conference paper
  • First Online:
Computational Intelligence in Machine Learning (ICCIML 2022)

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 1106))

  • 89 Accesses

Abstract

Nowadays generating high-quality image from text description is one of most challenging problem in computer vision. Generating images from text description has a wide range of applications, like computer-aided design. In this chapter, a new model called Knowledge Transfer Generative Adversarial Networks (KT GAN) is proposed for an exact synthesis of given image. To enhance the quality of generated image, an Alternate Attention Transfer Mechanism (AATM) and Semantic Distillation Mechanism (SDM) are introduced to help the generator better to traverse cross-domain area that exists between text and image. There are two main goals for using AATM: first, to make images more visually appealing by highlighting relevant words and second, to make images more visually appealing by highlighting essential sub-regions of an image. Use an image encoder that has been taught in image-image for training the text encoder for image-image. Improved text characteristics and photos of higher quality is possible with this method. The suggested KT GAN outperforms the traditional approach significantly and provides convincing results over a wide range of assessment measures after precise testing on two public datasets.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 299.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 379.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Pradhan AK, Swain S, Rout JK (2022) Role of machine learning and cloud-driven platform in IoT-based smart farming. In: Machine learning and internet of things for societal issues. Springer, Singapore, pp 43–54

    Google Scholar 

  2. Christian L et al (2017) Photo realistic single image super-resolution using a GAN. In: CVPR

    Google Scholar 

  3. Kumar S, Ansari MD, Gunjan VK, Solanki VK (2020) On classification of BMD images using machine learning (ANN) algorithm. In: ICDSMLA 2019. Springer, Singapore, pp 1590–1599

    Google Scholar 

  4. Mani MR, Srikanth T, Satyanarayana C (2022) An integrated approach for medical image classification using potential shape signature and neural network. In: Machine learning and internet of things for societal issues. Springer, Singapore, pp 109–115

    Google Scholar 

  5. Shaik AS, Karsh RK, Suresh M, Gunjan VK (2022) LWT-DCT based image hashing for tampering localization via blind geometric correction. In: ICDSMLA 2020. Springer, Singapore, pp 1651–1663

    Google Scholar 

  6. Reed S et al (2016) Generative adversarial text to image synthesis. In: ICML

    Google Scholar 

  7. Gunjan VK, Shaik F, Venkatesh C, Amarnath M (2017) Artifacts correction in MRI images. In: Computational methods in molecular imaging technologies. Springer, Singapore, pp 9–28

    Google Scholar 

  8. Zhang H et al (2017) StackGAN: text to photo-realistic image synthesis with stacked generative adversarial networks. In: IEEE/ICCV, Oct. 2017, pp 5907–5915

    Google Scholar 

  9. Prasad PS, Bethel GNB, Singh N, Gunjan VK, Basir S, Miah S (2022) Blockchain-based privacy access control mechanism and collaborative analysis for medical images. Sec Commun Netw

    Google Scholar 

  10. Bowen L et al (2019) Controllable text to image generation. In: NeurIPS, pp 2065–2075

    Google Scholar 

  11. Gunjan VK, Shaik F, Venkatesh C, Amarnath M (2017) Visual quality improvement of CT image reconstruction with quantitative measures. In: Computational methods in molecular imaging technologies. Springer, Singapore, pp 45–73

    Google Scholar 

  12. Bhardwaj T, Mittal R, Upadhyay H, Lagos L (2022) Applications of swarm intelligent and deep learning algorithms for image-based cancer recognition. In: Garg L, Basterrech S, Banerjee C, Sharma TK (eds) Artificial intelligence in healthcare. Advanced technologies and societal change. Springer, Singapore. https://doi.org/10.1007/978-981-16-6265-2_9

  13. Merugu S, Kumar A, Ghinea G (2023) Hardware, component, description. In: Track and trace management system for dementia and intellectual disabilities. Springer, Singapore, pp 31–48

    Google Scholar 

  14. Sinha P, Shaob M (2021) Detection of lung cancer in CT scans via deep learning and Cuckoo search optimization and IOT. Peer Rev Bimonthly Int J 11(5):11–19. Helix-The Scientific Explorer

    Google Scholar 

  15. Merugu S, Kumar A, Ghinea G (2023) Geriatric mobility assistive system. In: Track and trace management system for dementia and intellectual disabilities. Springer, Singapore, pp 85–93

    Google Scholar 

  16. Rane CV, Patil SR (2020) Data embeddable texture synthesis with fast data extraction. Peer Rev Bimonthly Int J 10(04):83–89. Helix-The Scientific Explorer

    Google Scholar 

  17. Kinge S (2019) A multi-class fisher linear discriminant approach for the improvement in the accuracy of complex texture discrimination. Peer Rev Bimonthly Int J 9(04):5108–5121. Helix-The Scientific Explorer

    Google Scholar 

  18. Bommagani G, Shaik F (2021) Histogram equalized thresholding method for analysis of diabetic myonecrosis related images. Peer Rev Bimonthly Int J 11(5):32–46. Helix-The Scientific Explorer

    Google Scholar 

Download references

Acknowledgements

Thanks to the Vasavi college of Engineering for sponsoring and supporting to develop this project successfully.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to D. Baswaraj .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Baswaraj, D., Srinivas, K. (2024). Usage of Generative Adversarial Network to Improve Text to Image Synthesis. In: Gunjan, V.K., Kumar, A., Zurada, J.M., Singh, S.N. (eds) Computational Intelligence in Machine Learning. ICCIML 2022. Lecture Notes in Electrical Engineering, vol 1106. Springer, Singapore. https://doi.org/10.1007/978-981-99-7954-7_17

Download citation

Publish with us

Policies and ethics