Skip to main content

Unsupervised Image Generation and Manipulation Using Deep Convolutional Adversarial Networks

  • Conference paper
  • First Online:
Machine Learning and Information Processing

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 1311))

  • 1033 Accesses

Abstract

In recent years, there has been an outburst in the field of Computer Vision due to the introduction of Convolutional Neural Networks. However, Convolutional Neural Networks have been sparsely used for unsupervised learning. The advancement of computational power and large datasets provide large opportunities to apply deep learning for image processing. This paper proposes a new architecture based on Deep Convolutional Generative Adversarial Network (DCGAN) for unsupervised image generation, its usage for image manipulation tasks such as denoising, super-resolution, and deconvolution. This proposed model demonstrates that the learned features can be used for image processing tasks—demonstrating their applications for general use as DCGAN learns from large datasets and adds high-level image details and features where traditional methods cannot be used. While the image results from the proposed network architecture and the original DCGAN architecture are similar in terms of performance, they are visually better when viewed by humans.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. J. Kim, J. Kwon Lee, K. Mu Lee. Deeply-recursive convolutional network for image super-resolution, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2016), pp. 1637–1645

    Google Scholar 

  2. J. Kim, J. Kwon Lee, K. Mu Lee, Accurate image superresolution using very deep convolutional networks, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2016), pp. 1646–1654

    Google Scholar 

  3. P. Vincent, H. Larochelle, I. Lajoie, Y. Bengio, P.A. Manzagol, Stacked denoising autoencoders: Learning useful representations in a deep network with a local denoising criterion. J. Mach. Learn. Res. 11, 3371–3408 (2010)

    Google Scholar 

  4. C. Dong, C.C. Loy, K. He, X. Tang, Image super-resolution using deep convolutional networks. IEEE Trans. Pattern Anal. Mach. Intell. 38(2), 295–307 (2016)

    Article  Google Scholar 

  5. E. Denton, S. Chintala, A. Szlam, R. Fergus, Deep generative image models using a laplacian pyramid of adversarial networks (2015). arXiv preprint arXiv:1506.05751

  6. K. Gregor, I. Danihelka, A. Graves, D. Wierstra, Draw: a recurrent neural network for image generation (2015). arXiv preprint arXiv:1502.04623

  7. Y.A.H. Bastien, C. Martin-Vide, R. Mitkov, B. Truthe, in Deep Learning of Representations: Looking Forward, vol. 7978 (Springer, Berlin, 2013), pp. 1–37

    Google Scholar 

  8. Y. Bengio, Learning Deep Architectures for AI (Now Publishers, Delft, 2009)

    Google Scholar 

  9. Y. Bengio, G. Mesnil, Y. Dauphin, S. Rifai, Better Mixing via Deep Representation, vol. 28(1) (2013), pp. 552–560

    Google Scholar 

  10. Y. Bengio, E. Thibodeau-Laufer, J. Yosinski, Deep generative stochastic networks trainable by backprop, in Proceedings of the 31st International Conference on International Conference on Machine Learning (ICML’14), vol. 32 (2014), pp. II-226–II-234

    Google Scholar 

  11. I.J. Goodfellow, J. Pouget-Abadie, M. Mirza, B. Xu, D. Warde-Farley, S. Ozair, A. Courville, Y. Bengio, Generative adversarial networks, in Advances in Neural Information Processing Systems, vol 27 (2014)

    Google Scholar 

  12. C. Ledig, L. Theis, F. Huszar, J. Caballero, A. Cunningham, A. Acosta, A. Aitken, A. Tejani, J. Totz, Z. Wang, et al., in Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network (2016). arXiv preprint arXiv:1609.04802

  13. Y. Bengio, Learning deep architectures of AI. Found. Trends Mach. Learn. 2(1), 1–127 (2009)

    Article  MathSciNet  Google Scholar 

  14. G.E. Hinton, S. Osindero, Y.W. Teh, A fast learning algorithm for deep belief nets. Neural Comput. 18, 1527–1554 (2006)

    Google Scholar 

  15. K. He, X. Zhang, S. Ren, J. Sun, Deep residual learning for image recognition, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2016), pp. 770–778

    Google Scholar 

  16. K. He, X. Zhang, S. Ren, J. Sun, Identity mappings in deep residual networks, in European Conference on Computer Vision (Springer, Berlin, 2016), pp. 630–645

    Google Scholar 

  17. O. Breuleux, Y. Bengio, P. Vincent, Quickly generating representative samples from an RBM-derived process. Neural Comput. 23(8), 2053–2073 (2011)

    Article  MathSciNet  Google Scholar 

  18. M. Hardt, B. Recht, Y. Singer, Train Faster, Generalize Better: Stability of Stochastic Gradient Descent (2015). arXiv preprint arXiv:1509.01240

  19. J. Bergstra, Y. Bengio, Random search for hyper-parameter optimization, in JMLR (2012)

    Google Scholar 

  20. https://yann.lecun.com/exdb/mnist/

  21. https://github.com/sunsided/mnist

  22. https://mmlab.ie.cuhk.edu.hk/projects/CelebA.html

  23. https://ufldl.stanford.edu/housenumbers/

  24. https://www.csail.mit.edu/research/places-database-scene-recognition

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Premanand Ghadekar .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Ghadekar, P., Joshi, S., Kokate, Y., Kude, H. (2021). Unsupervised Image Generation and Manipulation Using Deep Convolutional Adversarial Networks. In: Swain, D., Pattnaik, P.K., Athawale, T. (eds) Machine Learning and Information Processing. Advances in Intelligent Systems and Computing, vol 1311. Springer, Singapore. https://doi.org/10.1007/978-981-33-4859-2_4

Download citation

Publish with us

Policies and ethics