Event Recognition on Images by Fine-Tuning of Deep Neural Networks

Yudin, Dmitry; Zeno, Bassel

doi:10.1007/978-3-319-68321-8_49

Dmitry Yudin²⁰ &
Bassel Zeno²¹

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 679))

Included in the following conference series:

International Conference on Intelligent Information Technologies for Industry

799 Accesses
3 Citations

Abstract

The paper considers usage of fine-tuning of the deep neural network ensemble for recognition of 60 event types in the set of 60,000 images from WIDER database. The applied ensemble consists of two deep convolutional neural networks (CNN) using the GoogLeNet architecture, previously trained on other image bases: ImageNet and Places. Separately the accuracy of recognition of 10 events was analyzed: “Car Racing”, “Ceremony”, “Concert”, “Demonstration”, “Football”, “Meeting”, “Picnic”, “Swimming”, “Tennis” and “Traffic”. During the ensemble training output layer in the each of deep CNN is replaced to the layer with respectively 10 and 60 neurons and we tune only weights which connect output layer with previous one. The classification accuracy of 10 event classes from the WIDER image database averages 83.22%, for 60 event classes accuracy is 50.4%. In addition, the approach based on the automatic features formation using deep CNN provided a much better recognition quality of social events compared to the choice of features manually (LBP, LDP or HOG) and their further classification by support vector machine. The testing time of the developed ensemble provides the possibility of using the classifier in practical applications of event recognition with a processing speed up to 20 frames per second.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Zeno, B., Yudin, D., Alkhatib, B.: Event recognition on images using support vector machine and multi-level histograms of local patterns. ARPN J. Eng. Appl. Sci. 11(20), 12282–12287 (2016)
Google Scholar
Hinton, G., Osindero, S., The, Y.-W.: A fast learning algorithm for deep belief nets. Neural Comput. 18(7), 1527–1554 (2006)
Article MathSciNet MATH Google Scholar
Bengio, Y., Courville, A., Vincent, P.: Representation learning: a review and new perspectives. IEEE TPAMI 35(8), 1798–1828 (2013)
Article Google Scholar
LeCun, Y., Boser, B., Denker, J.S., Henderson, D., Howard, R.E., Hubbard, W., Jackel, L.D.: Backpropagation applied to handwritten zip code recognition. Neural Comput. 1(4), 541–551 (1989)
Article Google Scholar
Krizhevsky, A., Sutskever, I., Hinton, G. E.: ImageNet classification with deep convolutional neural networks. In: NIPS, pp. 1106–1114 (2012)
Google Scholar
Razavian, A., Azizpour, H., Sullivan, J., Carlsson, S.: CNN Features off-the-shelf: an Astounding Baseline for Recognition. In: CoRR, arXiv:1403.6382 (2014)
Web Image Dataset for Event Recognition (WIDER). http://personal.ie.cuhk.edu.hk/~xy012/event_recog/WIDER/ Accessed 12 Apr 2017
Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., Erhan, D., Vanhoucke, V., Rabinovich, A.: Going deeper with convolutions. arXiv:1409.4842 (2014)
Deng, J., Dong, W., Socher, R., Li, L.-J., Li, K., Fei-Fei, L.: Imagenet: A large-scale hierarchical image database. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 248–255 (2009)
Google Scholar
Large Scale Visual Recognition Challenge 2012 (ILSVRC2012). http://www.image-net.org/challenges/LSVRC/2012/index Accessed 12 Apr 2017
Zhou, B., Lapedriza, A., Xiao, J., Torralba, A., Aude, O.: Learning deep features for scene recognition using places database. In: Advances in Neural Information Processing Systems, pp. 487–495 (2014)
Google Scholar
Places Database. http://places.csail.mit.edu/ Accessed 12 Apr 2017
Zeiler, M.D., Fergus, R.: Visualizing and understanding convolutional networks. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) Computer Vision – ECCV 2014. ECCV 2014. Lecture Notes in Computer Science, vol. 8689, pp. 818–833. Springer, Cham (2014)
Google Scholar
Jia, Y.: Caffe: Deep learning framework by the BVLC. http://caffe.berkeleyvision.org/ Accessed 12 Apr 2017
Zhang, B., Gao, Y.: Local Derivative Pattern Versus Local Binary Pattern: Face Recognition With High Order Local Pattern Descriptor. IEEE Trans. Image Process. 19(2), 533–544 (2010)
Article MathSciNet Google Scholar
Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: Conference on Computer Vision and Pattern Recognition (CVPR), vol. 1, pp. 886–893. IEEE Computer Society, Washington (2005)
Google Scholar
Xiong, Y., Zhu, K., Lin D., Tang, X.: Recognize Complex Events from Static Images by Fusing Deep Channels. In: Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1600–1609 (2015)
Google Scholar

Download references

Acknowledgment

This article is written in the course of the grant of the President of the Russian Federation for state support of young Russian scientists № MK-3130.2017.9 (contract № 14.Z56.17.3130-MK) on the theme “Recognition of road conditions on images using deep learning”.

Author information

Authors and Affiliations

Belgorod State Technological University Named After V.G. Shukhov, Kostukova Str. 46, Belgorod, 308012, Russia
Dmitry Yudin
ITMO University, Kronverksky Pr. 49, St. Petersburg, 197101, Russia
Bassel Zeno

Authors

Dmitry Yudin
View author publications
You can also search for this author in PubMed Google Scholar
Bassel Zeno
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Dmitry Yudin .

Editor information

Editors and Affiliations

Scientific Network for Innovation and Research Excellence, Machine Intelligence Research Labs (MIR labs), Auburn, Washington, USA
Ajith Abraham
Rostov State Transport University , Rostov-on-Don, Russia
Sergey Kovalev
Bauman Moscow State Technical University , Moscow, Russia
Valery Tarassov
VSB-Technical University of Ostrava , Ostrava, Czech Republic
Vaclav Snasel
Department Electrical Power Engineering, Technical University of Varna, Varna, Bulgaria
Margreta Vasileva
Rostov State Transport University , Rostov-on-Don, Russia
Andrey Sukhanov

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Yudin, D., Zeno, B. (2018). Event Recognition on Images by Fine-Tuning of Deep Neural Networks. In: Abraham, A., Kovalev, S., Tarassov, V., Snasel, V., Vasileva, M., Sukhanov, A. (eds) Proceedings of the Second International Scientific Conference “Intelligent Information Technologies for Industry” (IITI’17). IITI 2017. Advances in Intelligent Systems and Computing, vol 679. Springer, Cham. https://doi.org/10.1007/978-3-319-68321-8_49

Download citation

DOI: https://doi.org/10.1007/978-3-319-68321-8_49
Published: 30 September 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-68320-1
Online ISBN: 978-3-319-68321-8
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics