Generating Videos Based on Convolutional Recurrent Generative Adversarial Networks

Li, Yachao; Komma, Toshihiro

doi:10.1007/978-3-319-95588-9_116

Yachao Li¹⁵ &
Toshihiro Komma¹⁵

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 809))

Included in the following conference series:

International Conference on Geometry and Graphics

314 Accesses

Abstract

In this paper, we propose a new way to generate videos via recurrent convolutional generative adversarial networks (CRGAN). The video tasks involving spatio-temporal series are more difficult than image tasks. In order to deal with spatio-temporal series tasks, we use a method that combines convolutional neural networks (CNN), which are used to deal with spatio relationships of videos, with Long Short Term Memory (LSTM), which is a variant of recurrent neural networks and used to deal with temporal relationships of videos, called convolutional recurrent neural networks (CRNN) to process the video inputs.Generative adversarial networks (GAN) is a method of unsupervised learning and has attained great improvements in image generation. In our paper, we combine CRNN with GAN and use unsupervised learning to generate videos. In the end, we will present some videos generated by our methods.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 509.00; Price excludes VAT (USA)

Softcover Book: USD 649.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Recurrent Deconvolutional Generative Adversarial Networks with Application to Video Generation

Conditional Generative Recurrent Adversarial Networks

From Recognition to Generation Using Deep Learning: A Case Study with Video Generation

References

Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems 25, pp. 1097–1105. Curran Associates, Inc. (2012). http://papers.nips.cc/paper/4824-imagenet-classification-with-deep-convolutional-neural-networks.pdf
Goodfellow, I., Pouget-Abadie, J., Mirza, M., Xu, B., Warde-Farley, D., Ozair, S., Courville, A., Bengio, Y.: Generative adversarial nets. In: Ghahramani, Z., Welling, M., Cortes, C., Lawrence, N.D., Weinberger K.Q. (eds.) Advances in Neural Information Processing Systems 27, pp. 2672–2680. Curran Associates, Inc. (2014). http://papers.nips.cc/paper/5423-generative-adversarial-nets.pdf
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. CoRR abs/1512.03385 (2015). arXiv:1512.03385
Ji, S., Xu, W., Yang, M., Yu, K.: 3D convolutional neural networks for human action recognition. IEEE Trans. Pattern Anal. Mach. Intell. 35(1), 221–231 (2013)
Article Google Scholar
Donahue, J., Anne Hendricks, L., Guadarrama, S., Rohrbach, M., Venugopalan, S., Saenko, K., Darrell, T.: Long-term recurrent convolutional networks for visual recognition and description. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2625–2634 (2015)
Google Scholar
Radford, A., Metz, L., Chintala, S.: Unsupervised representation learning with deep convolutional generative adversarial networks. arXiv preprint arXiv:1511.06434 (2015)
Salimans, T., Goodfellow, I., Zaremba, W., Cheung, V., Radford, A., Chen, X.: Improved techniques for training GANs. In: Advances in Neural Information Processing Systems, pp. 2234–2242 (2016)
Google Scholar
Mirza, M., Osindero, S.: Conditional generative adversarial nets. arXiv preprint arXiv:1411.1784 (2014)
Arjovsky, M., Chintala, S., Bottou, L.: Wasserstein GAN. arXiv preprint arXiv:1701.07875 (2017)
Srivastava, N., Mansimov, E., Salakhudinov, R.: Unsupervised learning of video representations using LSTMs. In: International Conference on Machine Learning, pp. 843–852 (2015)
Google Scholar
Vondrick, C., Pirsiavash, H., Torralba, A.: Generating videos with scene dynamics. In: Advances In Neural Information Processing Systems, pp. 613–621 (2016)
Google Scholar
LeCun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient-based learning applied to document recognition. Proc. IEEE 86(11), 2278–2324 (1998)
Article Google Scholar
Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014)
Britz, D.: Recurrent neural networks tutorial part 1 introduction to RNNs. http://www.wildml.com/2015/09/recurrent-neural-networks-tutorial-part-1-introduction-to-rnns/ (2015)
Sak, H., Senior, A., Beaufays, F.: Long short-term memory recurrent neural network architectures for large scale acoustic modeling. In: Fifteenth Annual Conference of the International Speech Communication Association (2014)
Google Scholar
Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997)
Article Google Scholar
http://www.stratio.com/blog/deep-learning-3-recurrent-neural-networks-lstm/

Download references

Author information

Authors and Affiliations

Tokyo Metropolitan University, Tokyo, Japan
Yachao Li & Toshihiro Komma

Authors

Yachao Li
View author publications
You can also search for this author in PubMed Google Scholar
Toshihiro Komma
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yachao Li .

Editor information

Editors and Affiliations

Department of Architecture and Urban Studies—DASTU, School of Architecture Urban Planning and Construction Engineering—AUIC, Politecnico di Milano, Milan, Italy
Luigi Cocchiarella

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Li, Y., Komma, T. (2019). Generating Videos Based on Convolutional Recurrent Generative Adversarial Networks. In: Cocchiarella, L. (eds) ICGG 2018 - Proceedings of the 18th International Conference on Geometry and Graphics. ICGG 2018. Advances in Intelligent Systems and Computing, vol 809. Springer, Cham. https://doi.org/10.1007/978-3-319-95588-9_116

Download citation

DOI: https://doi.org/10.1007/978-3-319-95588-9_116
Published: 07 July 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-95587-2
Online ISBN: 978-3-319-95588-9
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics

Generating Videos Based on Convolutional Recurrent Generative Adversarial Networks

Abstract

Access this chapter

Similar content being viewed by others

Recurrent Deconvolutional Generative Adversarial Networks with Application to Video Generation

Conditional Generative Recurrent Adversarial Networks

From Recognition to Generation Using Deep Learning: A Case Study with Video Generation

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Generating Videos Based on Convolutional Recurrent Generative Adversarial Networks

Abstract

Access this chapter

Similar content being viewed by others

Recurrent Deconvolutional Generative Adversarial Networks with Application to Video Generation

Conditional Generative Recurrent Adversarial Networks

From Recognition to Generation Using Deep Learning: A Case Study with Video Generation

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation