Unsupervised Image Generation and Manipulation Using Deep Convolutional Adversarial Networks

Ghadekar, Premanand; Joshi, Shaunak; Kokate, Yogini; Kude, Harshada

doi:10.1007/978-981-33-4859-2_4

Premanand Ghadekar¹⁷,
Shaunak Joshi¹⁷,
Yogini Kokate¹⁷ &
…
Harshada Kude¹⁷

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 1311))

1033 Accesses

Abstract

In recent years, there has been an outburst in the field of Computer Vision due to the introduction of Convolutional Neural Networks. However, Convolutional Neural Networks have been sparsely used for unsupervised learning. The advancement of computational power and large datasets provide large opportunities to apply deep learning for image processing. This paper proposes a new architecture based on Deep Convolutional Generative Adversarial Network (DCGAN) for unsupervised image generation, its usage for image manipulation tasks such as denoising, super-resolution, and deconvolution. This proposed model demonstrates that the learned features can be used for image processing tasks—demonstrating their applications for general use as DCGAN learns from large datasets and adds high-level image details and features where traditional methods cannot be used. While the image results from the proposed network architecture and the original DCGAN architecture are similar in terms of performance, they are visually better when viewed by humans.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

J. Kim, J. Kwon Lee, K. Mu Lee. Deeply-recursive convolutional network for image super-resolution, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2016), pp. 1637–1645
Google Scholar
J. Kim, J. Kwon Lee, K. Mu Lee, Accurate image superresolution using very deep convolutional networks, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2016), pp. 1646–1654
Google Scholar
P. Vincent, H. Larochelle, I. Lajoie, Y. Bengio, P.A. Manzagol, Stacked denoising autoencoders: Learning useful representations in a deep network with a local denoising criterion. J. Mach. Learn. Res. 11, 3371–3408 (2010)
Google Scholar
C. Dong, C.C. Loy, K. He, X. Tang, Image super-resolution using deep convolutional networks. IEEE Trans. Pattern Anal. Mach. Intell. 38(2), 295–307 (2016)
Article Google Scholar
E. Denton, S. Chintala, A. Szlam, R. Fergus, Deep generative image models using a laplacian pyramid of adversarial networks (2015). arXiv preprint arXiv:1506.05751
K. Gregor, I. Danihelka, A. Graves, D. Wierstra, Draw: a recurrent neural network for image generation (2015). arXiv preprint arXiv:1502.04623
Y.A.H. Bastien, C. Martin-Vide, R. Mitkov, B. Truthe, in Deep Learning of Representations: Looking Forward, vol. 7978 (Springer, Berlin, 2013), pp. 1–37
Google Scholar
Y. Bengio, Learning Deep Architectures for AI (Now Publishers, Delft, 2009)
Google Scholar
Y. Bengio, G. Mesnil, Y. Dauphin, S. Rifai, Better Mixing via Deep Representation, vol. 28(1) (2013), pp. 552–560
Google Scholar
Y. Bengio, E. Thibodeau-Laufer, J. Yosinski, Deep generative stochastic networks trainable by backprop, in Proceedings of the 31st International Conference on International Conference on Machine Learning (ICML’14), vol. 32 (2014), pp. II-226–II-234
Google Scholar
I.J. Goodfellow, J. Pouget-Abadie, M. Mirza, B. Xu, D. Warde-Farley, S. Ozair, A. Courville, Y. Bengio, Generative adversarial networks, in Advances in Neural Information Processing Systems, vol 27 (2014)
Google Scholar
C. Ledig, L. Theis, F. Huszar, J. Caballero, A. Cunningham, A. Acosta, A. Aitken, A. Tejani, J. Totz, Z. Wang, et al., in Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network (2016). arXiv preprint arXiv:1609.04802
Y. Bengio, Learning deep architectures of AI. Found. Trends Mach. Learn. 2(1), 1–127 (2009)
Article MathSciNet Google Scholar
G.E. Hinton, S. Osindero, Y.W. Teh, A fast learning algorithm for deep belief nets. Neural Comput. 18, 1527–1554 (2006)
Google Scholar
K. He, X. Zhang, S. Ren, J. Sun, Deep residual learning for image recognition, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2016), pp. 770–778
Google Scholar
K. He, X. Zhang, S. Ren, J. Sun, Identity mappings in deep residual networks, in European Conference on Computer Vision (Springer, Berlin, 2016), pp. 630–645
Google Scholar
O. Breuleux, Y. Bengio, P. Vincent, Quickly generating representative samples from an RBM-derived process. Neural Comput. 23(8), 2053–2073 (2011)
Article MathSciNet Google Scholar
M. Hardt, B. Recht, Y. Singer, Train Faster, Generalize Better: Stability of Stochastic Gradient Descent (2015). arXiv preprint arXiv:1509.01240
J. Bergstra, Y. Bengio, Random search for hyper-parameter optimization, in JMLR (2012)
Google Scholar
https://yann.lecun.com/exdb/mnist/
https://github.com/sunsided/mnist
https://mmlab.ie.cuhk.edu.hk/projects/CelebA.html
https://ufldl.stanford.edu/housenumbers/
https://www.csail.mit.edu/research/places-database-scene-recognition

Download references

Author information

Authors and Affiliations

Department of Information Technology, Vishwakarma Institute of Technology, Pune, India
Premanand Ghadekar, Shaunak Joshi, Yogini Kokate & Harshada Kude

Authors

Premanand Ghadekar
View author publications
You can also search for this author in PubMed Google Scholar
Shaunak Joshi
View author publications
You can also search for this author in PubMed Google Scholar
Yogini Kokate
View author publications
You can also search for this author in PubMed Google Scholar
Harshada Kude
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Premanand Ghadekar .

Editor information

Editors and Affiliations

Department of Computer Science, Rama Devi Women’s University, Bhubaneswar, Odisha, India
Debabala Swain
School of Computer Engineering, Kalinga Institute of Industrial Technology Deemed University, Bhubaneswar, Odisha, India
Prasant Kumar Pattnaik
Oak Ridge National Laboratory, Tennessee, TN, USA
Tushar Athawale

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ghadekar, P., Joshi, S., Kokate, Y., Kude, H. (2021). Unsupervised Image Generation and Manipulation Using Deep Convolutional Adversarial Networks. In: Swain, D., Pattnaik, P.K., Athawale, T. (eds) Machine Learning and Information Processing. Advances in Intelligent Systems and Computing, vol 1311. Springer, Singapore. https://doi.org/10.1007/978-981-33-4859-2_4

Download citation

DOI: https://doi.org/10.1007/978-981-33-4859-2_4
Published: 03 April 2021
Publisher Name: Springer, Singapore
Print ISBN: 978-981-33-4858-5
Online ISBN: 978-981-33-4859-2
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics