Skip to main content

Fine-Tuning a Pre-trained CAE for Deep One Class Anomaly Detection in Video Footage

  • Conference paper
  • First Online:
Pattern Recognition and Artificial Intelligence (MedPRAI 2020)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1322))

Abstract

In recent years, abnormal event detection in video surveillance has become a very important task mainly treated by deep learning methods taken into account many challenges. However, these methods still not trained on an anomaly detection based objective which proves their ineffectiveness in such a problem. In this paper, we propose an unsupervised method based on a new architecture for deep one class of convolutional auto-encoders (CAEs) for representing a compact Spatio-temporal feature for anomaly detection. Our CAEs are constructed by added deconvolutions layers to the CNN VGG 16. Then, we train our CAEs for a one-class training objective by fine-tuning our model to properly exploit the richness of the dataset with which CNN was trained. The first CAE is trained on the original frames to extract a good descriptor of shapes and the second CAE is learned using optical flow representations to provide a strength description of motion between frames. For this purpose, we define two loss functions, compactness loss and representativeness loss for training our CAEs architectures not only to maximize the inter-classes distance and to minimize the intra-class distance but also to ensure the tightness and the representativeness of features of normal images. We reduce features dimensions by applying a PCA (Principal Component Analyser) to combine our two descriptors with a Gaussian classifier for abnormal Spatio-temporal events detection. Our method has a high performance in terms of reliability and accuracy. It achieved abnormal event detection with good efficiency in challenging datasets compared to state-of-the-art methods.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Beltramelli, T.: Generating code from a graphical user interface screenshot. In: Proceedings of the ACM SIGCHI Symposium on Engineering Interactive Computing Systems, pp. 329–343 (2018)

    Google Scholar 

  2. Xu, D., Ricci, E., Yan, Y., Song, J., Sebe, N.: Learning deep representations of appearance and motion for anomalous event detection (2015)

    Google Scholar 

  3. Gutoski, M., Aquino, N.M.R., Ribeiro, M., Lazzaretti, E., Lopes, S.: Detection of video anomalies using convolutional autoencoders and one-class support vector machines (2017)

    Google Scholar 

  4. Hasan, M., Choi, J., Neumann, J., Roy-Chowdhury, A.K., Davis, L.S.: Learning Temporal Regularity in Video Sequences, pp. 733–742 (2016)

    Google Scholar 

  5. He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016)

    Google Scholar 

  6. Taigman, Y., Yang, M., Ranzato, M., Wolf, L.: Deepface: closing the gap to human- level performance in face verification. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1701–1708 (2014)

    Google Scholar 

  7. Beltramelli, T.: Generating code from a graphical user interface screenshot. In: Proceedings of the ACM SIGCHI Symposium on Engineering Interactive Computing (2018)

    Google Scholar 

  8. Conneau, A., Schwenk, H., Barrault, L., Lecun, Y.: Very deep convolutional networks for natural language processing, vol. 2 (2016). arXiv preprint. arXiv:1606.01781

  9. Amodei, D., et al.: Deep speech 2: end-to-end speech recognition in English and Mandarin. In: International Conference on Machine Learning, pp. 173–182 (2016)

    Google Scholar 

  10. Yen, S., Wang, C.: Abnormal Event Detection Using HOSF, pp. 1–4 (2013)

    Google Scholar 

  11. Reddy, V., Sanderson, C., Lovell, B.C.: Improved anomaly detection in crowded scenes via cell-based analysis of foreground speed, size and texture, pp. 55–61 (2011)

    Google Scholar 

  12. Wang, T., Snoussi, H.: Detection of abnormal visual events via global optical flow orientation histogram. IEEE Trans. Inf. Forensics Secur. 9(6), 988–998 (2014)

    Article  Google Scholar 

  13. Zhao, B., Fei-Fei, L., Xing, E.P.: Online detection of unusual events in videos via dynamic sparse coding. In: CVPR 2011, pp. 3313–3320 (2011)

    Google Scholar 

  14. Zhou, S., Shen, W., Zeng, D., Zhang, Z.: Unusual event detection in crowded scenes by trajectory analysis. In: 2015 IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP), pp. 1300–1304 (2015)

    Google Scholar 

  15. Piciarelli, C., Micheloni, C., Foresti, G.L.: Trajectory-based anomalous event detection. IEEE Trans. Circuits Syst. Video Technol. 18, 1544–1554

    Google Scholar 

  16. Johnson, N., Hogg, D.: Learning the distribution of object trajectories for event recognition. Image Vis. Comput. 14(8), 609–615 (1996). ISSN 0262–8856. https://doi.org/10.1016/0262-8856(96)01101-8

  17. Redmon, J., Farhadi, A.: YOLOv3: An Incremental Improvement (2018)

    Google Scholar 

  18. Tran, D., Bourdev, L., Fergus, R., Torresani, L., Paluri, M.: Learning spatiotemporal features with 3D convolutional networks. In: The IEEE International Conference on Computer Vision (ICCV) (2015)

    Google Scholar 

  19. Schroff, F., Kalenichenko, D., Philbin, J.: FaceNet: a unified embedding for face recognition and clustering. In: The IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2015)

    Google Scholar 

  20. Zhou, S., Shen, W., Zeng, D., Fang, M., Wei, Y., Zhang, Z.: Spatial-temporal convolutional neural networks for anomaly detection and localization in crowded scenes. Sig. Process. Image Commun. 47, 358–368 (2016)

    Google Scholar 

  21. Ravanbakhsh, M., Nabi, M., Mousavi, H., Sangineto, E., Sebe, N.: Plug-and-Play CNN for crowd motion analysis: an application in abnormal event detection. In: 2018 IEEE Winter Conference on Applications of Computer Vision (WACV) (2018)

    Google Scholar 

  22. Sabokrou, M., et al.: Avid: adversarial visual irregularity detection. arXiv preprint arXiv:1805.09521 (2018)

  23. Mehran, R., Oyama, A., Shah, M.: Abnormal crowd behavior detection using social force model. In: Computer Vision and Pattern Recognition, pp. 935–942 (2009)

    Google Scholar 

  24. Kim, J., Grauma, K.: Observe locally, infer globally: a space-time MRF for detecting abnormal activities with incremental updates. In: Computer Vision and Pattern Recognition, pp. 2921–2928 (2009)

    Google Scholar 

  25. Bertini, M., Del Bimbo, A., Seidenari, L.: Multi-scale and real-time non-parametric approach for anomaly detection and localization. Computer Vis. Image Underst. 116(3), 320–329 (2012)

    Article  Google Scholar 

  26. Zhou, S., Shen, W., Zeng, D., Fang, M., Wei, Y., Zhang, Z.: Spatial-temporal convolutional neural networks for anomaly detection and localization in crowded scenes. Signal Process. Image Commun. 47, 358–368 (2016)

    Article  Google Scholar 

  27. Bouindour, S., Hittawe, M.M., Mahfouz, S., Snoussi, H.: Abnormal event detection using convolutional neural networks and 1-class SVM classifier. In: 8th International Conference on Imaging for Crime Detection and Prevention (ICDP 2017) (2017)

    Google Scholar 

  28. Hamdi, S., Bouindour, S., Loukil, K., Snoussi, H., Abid, M.: Hybrid deep learning and HOF for Anomaly Detection. In: 2019 6th International Conference on Control, Decision and Information Technologies (CoDIT), pp. 575–580 (2019)

    Google Scholar 

  29. Li, W., Mahadevan, V., Vasconcelos, N.: Anomaly detection and localization in crowded scenes. IEEE Trans. Pattern Anal. Mach. Intell. 36(1), 18–32 (2014)

    Article  Google Scholar 

  30. Chong, Y.S., Tay, Y.H.: Abnormal event detection in videos using spatiotemporal autoencoder. In: Proceedings CVPRR in International Symposium on Neural Networks, pp. 189–196 (2017)

    Google Scholar 

  31. Xiao, T., Zhang, C., Zha, H.: Learning to detect anomalies in surveillance video. IEEE Sig. Process. Lett. I 22(9), 1477–1481 (2015)

    Article  Google Scholar 

  32. Wu, S., et al.: Chaotic invariants of Lagrangian particle trajectories for anomaly detection in crowded scenes. In: IEEE Conference on Computer Vision Pattern Recognition, pp. 2054–2060 (2010)

    Google Scholar 

  33. Saligrama, V., Chen, Z.: Chaotic invariants based on local statistical aggregates. J. IEEE Conf. Comput. Vis. Pattern Recogn. 2112–2119 (2012)

    Google Scholar 

  34. Cong, Y., et al.: Sparse reconstruction cost for abnormal event detection. In: IEEE Conference on Computer Vision Pattern Recognition, pp. 3449–3456 (2011)

    Google Scholar 

  35. Perera, P., Patel, V.M.: Learning deep features for one-class classification. In: IEEE Conference on Computer Vision Pattern Recognition, pp. 3449–3456 (2011)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Slim Hamdi .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Hamdi, S., Snoussi, H., Abid, M. (2021). Fine-Tuning a Pre-trained CAE for Deep One Class Anomaly Detection in Video Footage. In: Djeddi, C., Kessentini, Y., Siddiqi, I., Jmaiel, M. (eds) Pattern Recognition and Artificial Intelligence. MedPRAI 2020. Communications in Computer and Information Science, vol 1322. Springer, Cham. https://doi.org/10.1007/978-3-030-71804-6_1

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-71804-6_1

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-71803-9

  • Online ISBN: 978-3-030-71804-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics