Research and Practice of Video Recognition Based on Deep Learning

Ren, Jie; Shi, Heping; Cao, Jihua

doi:10.1007/978-981-16-9423-3_69

Jie Ren⁴²,
Heping Shi⁴³ &
Jihua Cao⁴²

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 854))

1503 Accesses

Absrtact

This paper used tensorflow and keras library to build a deep learning environment. Designed and established a deep 3D convolutional network model and Long Short-Term Memory network model, using UCF-101 dataset known category videos as training samples to train the network. Some videos in the dataset were used as test samples to verify the recognition performance of the network model and realize classification. Finally, Tensorboard was used to visually analyze the network training process. The experimental results show that the model has better video recognition performance.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 189.00; Price excludes VAT (USA)

Softcover Book: USD 249.99; Price excludes VAT (USA)

Hardcover Book: USD 249.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Zhu, F., Shao, L., Xie, J., et al.: From handcrafted to learned representations for human action recognition: a survey. Image Vis. Comput. 55, 42–52 (2016)
Article Google Scholar
Najafi, A., Hasanlou, M., Akbari, V.: Land cover changes detection in polarimetric SAR data using algebra, similarity and distance based methods. Int. Arch. Photogram. Remote Sens. Spat. Inf. Sci. 42, 195–200 (2017)
Article Google Scholar
Dhulekar, P., Gandhe, S.T., Chitte, H., et al.: Human action recognition: an overview. In: Satapathy, S., Bhateja, V., Joshi, A. (eds.) Proceedings of the International Conference on Data Engineering and Communication Technology, vol. 468, pp. 481–488. Springer, Singapore (2017). https://doi.org/10.1007/978-981-10-1675-2_48
Gupta, V., Singh, J.P.: Study and analysis of back-propagation approach in artificial neural network using HOG descriptor for real-time object classification. In: Ray, K., Sharma, T., Rawat, S., Saini, R., Bandyopadhyay, A. (eds.) Soft Computing: Theories and Applications, vol. 742, pp. 45–52. Springer, Singapore (2019). https://doi.org/10.1007/978-981-13-0589-4_5
Chapter Google Scholar
Rashid, M., Khan, M.A., Sharif, M., et al.: Object detection and classification: a joint selection and fusion strategy of deep convolutional neural network and SIFT point features. Multimedia Tools Appl. 78(12), 15751–15777 (2019). https://doi.org/10.1007/s11042-018-7031-0
Article Google Scholar
Liu, L., Hu, F., Zhao, J.: Action recognition based on features fusion and 3D convolutional neural networks. In:2016 9th International Symposium on Computational Intelligence and Design (ISCID), vol. 1, pp. 178–181. IEEE (2016)
Google Scholar
Xu, Z., Vilaplana, V., Morros, J.R.: Action tube extraction based 3D-CNN for RGB-D action recognition. In: 2018 International Conference on Content-Based Multimedia Indexing (CBMI), pp. 1–6. IEEE (2018)
Google Scholar
Li, C., Sun, S., Min, X., et al.: End-to-end learning of deep convolutional neural network for 3D human action recognition. In: 2017 IEEE International Conference on Multimedia and Expo Workshops (ICMEW), pp. 609–612. IEEE (2017)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Electronic Engineering, Tianjin University of Technology and Education, Tianjin, 300222, China
Jie Ren & Jihua Cao
College of Automobiles and Transportation, Tianjin University of Technology and Education, Tianjin, 300222, China
Heping Shi

Authors

Jie Ren
View author publications
You can also search for this author in PubMed Google Scholar
Heping Shi
View author publications
You can also search for this author in PubMed Google Scholar
Jihua Cao
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jihua Cao .

Editor information

Editors and Affiliations

Department of Electrical Engineering, University of Texas at Arlington, Arlington, TX, USA
Qilian Liang
Tianjin Normal University, Tianjin, China
Wei Wang
Tianjin Normal University, Tianjin, China
Jiasong Mu
Dalian University of Technology, Dalian, China
Xin Liu
School of Information Science and Technology, Dalian Maritime University, Dalian, China
Zhenyu Na

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ren, J., Shi, H., Cao, J. (2022). Research and Practice of Video Recognition Based on Deep Learning. In: Liang, Q., Wang, W., Mu, J., Liu, X., Na, Z. (eds) Artificial Intelligence in China. Lecture Notes in Electrical Engineering, vol 854. Springer, Singapore. https://doi.org/10.1007/978-981-16-9423-3_69

Download citation

DOI: https://doi.org/10.1007/978-981-16-9423-3_69
Published: 22 March 2022
Publisher Name: Springer, Singapore
Print ISBN: 978-981-16-9422-6
Online ISBN: 978-981-16-9423-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics