Advertisement

Multimedia Tools and Applications

, Volume 78, Issue 3, pp 3633–3647 | Cite as

Abnormal event detection for video surveillance using deep one-class learning

  • Jiayu Sun
  • Jie ShaoEmail author
  • Chengkun He
Article

Abstract

Abnormal event detection and localization is a challenging research problem in intelligent video surveillance. It is designed to automatically identify abnormal events from monitoring videos. The main difficulty of this task lies in that there is only one class called “normal event” in training video sequences. In recent years, many advanced algorithms have been proposed on the basis of hand-crafted features. Only a few algorithms are based on high-level features, but almost all these methods use two-stage learning. In this paper, we propose a novel end-to-end model which integrates the one-class Support Vector Machine (SVM) into Convolutional Neural Network (CNN), named Deep One-Class (DOC) model. Specifically, the robust loss function derived from the one-class SVM is proposed to optimize the parameters of this model. Compared with the hierarchical models, our model not only simplifies the complexity of the process, but also obtains the global optimal solution of the whole process. In the experiments, we validate our DOC model with a publicly available dataset and compare it with some state-of-art methods. The comparison results demonstrate that our model has great performance and it is effective for abnormal events detection from surveillance videos.

Keywords

Abnormal event detection Deep learning One-class SVM Video surveillance 

Notes

Acknowledgements

This work is supported by the National Natural Science Foundation of China (grants No. 61672133 and No. 61632007), and the Fundamental Research Funds for the Central Universities (grants No. ZYGX2015J058 and No. ZYGX2014Z007).

References

  1. 1.
    Adam A, Rivlin E, Shimshoni I, Reinitz D (2008) Robust real-time unusual event detection using multiple fixed-location monitors. IEEE Trans Pattern Anal Mach Intell 30(3):555–560CrossRefGoogle Scholar
  2. 2.
    Bengio Y (2009) Learning deep architectures for AI. Foundations and Trends in Machine Learning 2(1):1–127MathSciNetCrossRefGoogle Scholar
  3. 3.
    Boiman O, Irani M (2007) Detecting irregularities in images and in video. Int J Comput Vis 74(1):17–31CrossRefGoogle Scholar
  4. 4.
    Chan T, Jia K, Gao S, Lu J, Zeng Z, Ma Y (2015) Pcanet: a simple deep learning baseline for image classification. IEEE Trans Image Processing 24(12):5017–5032MathSciNetCrossRefGoogle Scholar
  5. 5.
    Chen Y, Zhou XS, Huang TS (2001) One-class SVM for learning in image retrieval. In: Proceedings of the 2001 international conference on image processing, ICIP 2001, Thessaloniki, Greece, October 7–10, 2001, pp 34–37Google Scholar
  6. 6.
    Cong Y, Yuan J, Liu J (2011) Sparse reconstruction cost for abnormal event detection. In: The 24th IEEE conference on computer vision and pattern recognition, CVPR 2011, Colorado Springs, CO, USA, 20–25 June 2011, pp 3449–3456Google Scholar
  7. 7.
    Cui X, Liu Q, Gao M, Metaxas DN (2011) Abnormal detection using interaction energy potentials. In: The 24th IEEE conference on computer vision and pattern recognition, CVPR 2011, Colorado Springs, CO, USA, 20–25 June 2011, pp 3161–3167Google Scholar
  8. 8.
    Fanello SR, Gori I, Metta G, Odone F (2013) One-shot learning for real-time action recognition. In: Pattern recognition and image analysis - 6th Iberian conference, IbPRIA 2013, Funchal, Madeira, Portugal, June 5–7, 2013. Proceedings, pp 31–40Google Scholar
  9. 9.
    Feichtenhofer C, Pinz A, Wildes RP (2016) Spatiotemporal residual networks for video action recognition. In: Advances in neural information processing systems 29: Annual conference on neural information processing systems 2016, December 5–10, 2016, Barcelona, Spain, pp 3468–3476Google Scholar
  10. 10.
    Feng Y, Yuan Y, Lu X (2016) Deep representation for abnormal event detection in crowded scenes. In: Proceedings of the 2016 ACM conference on multimedia conference, MM 2016, Amsterdam, The Netherlands, October 15–19, 2016, pp 591–595Google Scholar
  11. 11.
    Hu R, Zhu X, Cheng D, He W, Yan Y, Song J, Zhang S (2017) Graph self-representation method for unsupervised feature selection. Neurocomputing 220:130–137CrossRefGoogle Scholar
  12. 12.
    Itti L, Baldi P (2005) A principled approach to detecting surprising events in video. In: 2005 IEEE computer society conference on computer vision and pattern recognition (CVPR 2005), 20–26 June 2005, San Diego, CA, USA, pp 631–637Google Scholar
  13. 13.
    Kratz L, Nishino K (2009) Anomaly detection in extremely crowded scenes using spatio-temporal motion pattern models. In: 2009 IEEE computer society conference on computer vision and pattern recognition (CVPR 2009), 20–25 June 2009, Miami, Florida, USA, pp 1446–1453Google Scholar
  14. 14.
    Li W, Mahadevan V, Vasconcelos N (2014) Anomaly detection and localization in crowded scenes. IEEE Trans Pattern Anal Mach Intell 36(1):18–32CrossRefGoogle Scholar
  15. 15.
    Lu C, Shi J, Jia J (2013) Abnormal event detection at 150 FPS in MATLAB. In: IEEE international conference on computer vision, ICCV 2013, Sydney, Australia, December 1–8, 2013, pp 2720–2727Google Scholar
  16. 16.
    Mahadevan V, Li W, Bhalodia V, Vasconcelos N (2010) Anomaly detection in crowded scenes. In: The twenty-third IEEE conference on computer vision and pattern recognition, CVPR 2010, San Francisco, CA, USA, 13–18 June 2010, pp 1975–1981Google Scholar
  17. 17.
    Reddy V, Sanderson C, Lovell BC (2011) Improved anomaly detection in crowded scenes via cell-based analysis of foreground speed, size and texture. In: IEEE conference on computer vision and pattern recognition, CVPR workshops 2011, Colorado Springs, CO, USA, 20-25 June, 2011, pp 55–61Google Scholar
  18. 18.
    Sabokrou M, Fayyaz M, Fathy M, Klette R (2017) Deep-cascade: cascading 3d deep neural networks for fast anomaly detection and localization in crowded scenes. IEEE Trans Image Processing 26(4):1992–2004MathSciNetCrossRefGoogle Scholar
  19. 19.
    Wang P, Cao Y, Shen C, Liu L, Shen HT (2017) Temporal pyramid pooling based convolutional neural networks for action recognition. IEEE Trans Circuits Syst Video Techn.  https://doi.org/10.1109/TCSVT.2016.2576761
  20. 20.
    Wang P, Liu L, Shen C, Huang Z, van den Hengel A, Shen HT (2016) What’s wrong with that object? Identifying images of unusual objects by modelling the detection score distribution. In: 2016 IEEE conference on computer vision and pattern recognition, CVPR 2016, Las Vegas, NV, USA, June 27–30, 2016, pp 1573–1581Google Scholar
  21. 21.
    Wang P, Liu L, Shen C, Huang Z, van den Hengel A, Shen HT (2017) Multi-attention network for one shot learning. In: 2017 IEEE conference on computer vision and pattern recognition, CVPR 2017, Honolulu, Hawaii, USA, July 22–25, 2017, pp 2721–2729Google Scholar
  22. 22.
    Xu D, Ricci E, Yan Y, Song J, Sebe N (2015) Learning deep representations of appearance and motion for anomalous event detection. In: Proceedings of the british machine vision conference 2015, BMVC 2015, Swansea, UK, September 7–10, 2015, pp 8.1–8.12Google Scholar
  23. 23.
    Yuan Y, Feng Y, Lu X (2017) Statistical hypothesis detector for abnormal event detection in crowded scenes. IEEE Trans Cybernetics.  https://doi.org/10.1109/TCYB.2016.2572609
  24. 24.
    Zhang B, Wang L, Wang Z, Qiao Y, Wang H (2016) Real-time action recognition with enhanced motion vector cnns. In: 2016 IEEE conference on computer vision and pattern recognition, CVPR 2016, Las Vegas, NV, USA, June 27–30, 2016, pp 2718–2726Google Scholar
  25. 25.
    Zhao B, Li F, Xing EP (2011) Online detection of unusual events in videos via dynamic sparse coding. In: The 24th IEEE conference on computer vision and pattern recognition, CVPR 2011, Colorado Springs, CO, USA, 20–25 June 2011, pp 3313–3320Google Scholar
  26. 26.
    Zhu X, Li X, Zhang S (2016) Block-row sparse multiview multilabel learning for image classification. IEEE Trans Cybernetics 46(2):450–461CrossRefGoogle Scholar
  27. 27.
    Zhu X, Li X, Zhang S, Ju C, Wu X (2017) Robust joint graph sparse coding for unsupervised spectral feature selection. IEEE Trans Neural Netw Learning Syst 28(6):1263–1275MathSciNetCrossRefGoogle Scholar
  28. 28.
    Zhu X, Zhang L, Huang Z (2014) A sparse embedding and least variance encoding approach to hashing. IEEE Trans Image Processing 23(9):3737–3750MathSciNetCrossRefGoogle Scholar

Copyright information

© Springer Science+Business Media, LLC 2017

Authors and Affiliations

  1. 1.Center for Future Media, School of Computer Science and EngineeringUniversity of Electronic Science and Technology of ChinaChengduChina

Personalised recommendations