Skip to main content

Generating Videos Based onĀ Convolutional Recurrent Generative Adversarial Networks

  • Conference paper
  • First Online:
ICGG 2018 - Proceedings of the 18th International Conference on Geometry and Graphics (ICGG 2018)

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 809))

Included in the following conference series:

  • 314 Accesses

Abstract

In this paper, we propose a new way to generate videos via recurrent convolutional generative adversarial networks (CRGAN). The video tasks involving spatio-temporal series are more difficult than image tasks. In order to deal with spatio-temporal series tasks, we use a method that combines convolutional neural networks (CNN), which are used to deal with spatio relationships of videos, with Long Short Term Memory (LSTM), which is a variant of recurrent neural networks and used to deal with temporal relationships of videos, called convolutional recurrent neural networks (CRNN) to process the video inputs.Generative adversarial networks (GAN) is a method of unsupervised learning and has attained great improvements in image generation. In our paper, we combine CRNN with GAN and use unsupervised learning to generate videos. In the end, we will present some videos generated by our methods.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 509.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 649.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems 25, pp. 1097ā€“1105. Curran Associates, Inc. (2012). http://papers.nips.cc/paper/4824-imagenet-classification-with-deep-convolutional-neural-networks.pdf

  2. Goodfellow, I., Pouget-Abadie, J., Mirza, M., Xu, B., Warde-Farley, D., Ozair, S., Courville, A., Bengio, Y.: Generative adversarial nets. In: Ghahramani, Z., Welling, M., Cortes, C., Lawrence, N.D., Weinberger K.Q. (eds.) Advances in Neural Information Processing Systems 27, pp. 2672ā€“2680. Curran Associates, Inc. (2014). http://papers.nips.cc/paper/5423-generative-adversarial-nets.pdf

  3. He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. CoRR abs/1512.03385 (2015). arXiv:1512.03385

  4. Ji, S., Xu, W., Yang, M., Yu, K.: 3D convolutional neural networks for human action recognition. IEEE Trans. Pattern Anal. Mach. Intell. 35(1), 221ā€“231 (2013)

    ArticleĀ  Google ScholarĀ 

  5. Donahue, J., Anne Hendricks, L., Guadarrama, S., Rohrbach, M., Venugopalan, S., Saenko, K., Darrell, T.: Long-term recurrent convolutional networks for visual recognition and description. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2625ā€“2634 (2015)

    Google ScholarĀ 

  6. Radford, A., Metz, L., Chintala, S.: Unsupervised representation learning with deep convolutional generative adversarial networks. arXiv preprint arXiv:1511.06434 (2015)

  7. Salimans, T., Goodfellow, I., Zaremba, W., Cheung, V., Radford, A., Chen, X.: Improved techniques for training GANs. In: Advances in Neural Information Processing Systems, pp. 2234ā€“2242 (2016)

    Google ScholarĀ 

  8. Mirza, M., Osindero, S.: Conditional generative adversarial nets. arXiv preprint arXiv:1411.1784 (2014)

  9. Arjovsky, M., Chintala, S., Bottou, L.: Wasserstein GAN. arXiv preprint arXiv:1701.07875 (2017)

  10. Srivastava, N., Mansimov, E., Salakhudinov, R.: Unsupervised learning of video representations using LSTMs. In: International Conference on Machine Learning, pp. 843ā€“852 (2015)

    Google ScholarĀ 

  11. Vondrick, C., Pirsiavash, H., Torralba, A.: Generating videos with scene dynamics. In: Advances In Neural Information Processing Systems, pp. 613ā€“621 (2016)

    Google ScholarĀ 

  12. LeCun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient-based learning applied to document recognition. Proc. IEEE 86(11), 2278ā€“2324 (1998)

    ArticleĀ  Google ScholarĀ 

  13. Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014)

  14. Britz, D.: Recurrent neural networks tutorial part 1 introduction to RNNs. http://www.wildml.com/2015/09/recurrent-neural-networks-tutorial-part-1-introduction-to-rnns/ (2015)

  15. Sak, H., Senior, A., Beaufays, F.: Long short-term memory recurrent neural network architectures for large scale acoustic modeling. In: Fifteenth Annual Conference of the International Speech Communication Association (2014)

    Google ScholarĀ 

  16. Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735ā€“1780 (1997)

    ArticleĀ  Google ScholarĀ 

  17. http://www.stratio.com/blog/deep-learning-3-recurrent-neural-networks-lstm/

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yachao Li .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

Ā© 2019 Springer International Publishing AG, part of Springer Nature

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Li, Y., Komma, T. (2019). Generating Videos Based onĀ Convolutional Recurrent Generative Adversarial Networks. In: Cocchiarella, L. (eds) ICGG 2018 - Proceedings of the 18th International Conference on Geometry and Graphics. ICGG 2018. Advances in Intelligent Systems and Computing, vol 809. Springer, Cham. https://doi.org/10.1007/978-3-319-95588-9_116

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-95588-9_116

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-95587-2

  • Online ISBN: 978-3-319-95588-9

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics