Fine-Tuning a Pre-trained CAE for Deep One Class Anomaly Detection in Video Footage

Hamdi, Slim; Snoussi, Hichem; Abid, Mohamed

doi:10.1007/978-3-030-71804-6_1

Slim Hamdi^9,10,
Hichem Snoussi⁹ &
Mohamed Abid¹⁰

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1322))

Included in the following conference series:

Mediterranean Conference on Pattern Recognition and Artificial Intelligence

582 Accesses
2 Citations

Abstract

In recent years, abnormal event detection in video surveillance has become a very important task mainly treated by deep learning methods taken into account many challenges. However, these methods still not trained on an anomaly detection based objective which proves their ineffectiveness in such a problem. In this paper, we propose an unsupervised method based on a new architecture for deep one class of convolutional auto-encoders (CAEs) for representing a compact Spatio-temporal feature for anomaly detection. Our CAEs are constructed by added deconvolutions layers to the CNN VGG 16. Then, we train our CAEs for a one-class training objective by fine-tuning our model to properly exploit the richness of the dataset with which CNN was trained. The first CAE is trained on the original frames to extract a good descriptor of shapes and the second CAE is learned using optical flow representations to provide a strength description of motion between frames. For this purpose, we define two loss functions, compactness loss and representativeness loss for training our CAEs architectures not only to maximize the inter-classes distance and to minimize the intra-class distance but also to ensure the tightness and the representativeness of features of normal images. We reduce features dimensions by applying a PCA (Principal Component Analyser) to combine our two descriptors with a Gaussian classifier for abnormal Spatio-temporal events detection. Our method has a high performance in terms of reliability and accuracy. It achieved abnormal event detection with good efficiency in challenging datasets compared to state-of-the-art methods.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Abnormal Event Detection in Videos Using Spatiotemporal Autoencoder

Multi-Stream 3D latent feature clustering for abnormality detection in videos

Article 16 May 2021

Anomaly detection in video surveillance: a supervised inception encoder approach

Article 26 February 2024

References

Beltramelli, T.: Generating code from a graphical user interface screenshot. In: Proceedings of the ACM SIGCHI Symposium on Engineering Interactive Computing Systems, pp. 329–343 (2018)
Google Scholar
Xu, D., Ricci, E., Yan, Y., Song, J., Sebe, N.: Learning deep representations of appearance and motion for anomalous event detection (2015)
Google Scholar
Gutoski, M., Aquino, N.M.R., Ribeiro, M., Lazzaretti, E., Lopes, S.: Detection of video anomalies using convolutional autoencoders and one-class support vector machines (2017)
Google Scholar
Hasan, M., Choi, J., Neumann, J., Roy-Chowdhury, A.K., Davis, L.S.: Learning Temporal Regularity in Video Sequences, pp. 733–742 (2016)
Google Scholar
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016)
Google Scholar
Taigman, Y., Yang, M., Ranzato, M., Wolf, L.: Deepface: closing the gap to human- level performance in face verification. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1701–1708 (2014)
Google Scholar
Beltramelli, T.: Generating code from a graphical user interface screenshot. In: Proceedings of the ACM SIGCHI Symposium on Engineering Interactive Computing (2018)
Google Scholar
Conneau, A., Schwenk, H., Barrault, L., Lecun, Y.: Very deep convolutional networks for natural language processing, vol. 2 (2016). arXiv preprint. arXiv:1606.01781
Amodei, D., et al.: Deep speech 2: end-to-end speech recognition in English and Mandarin. In: International Conference on Machine Learning, pp. 173–182 (2016)
Google Scholar
Yen, S., Wang, C.: Abnormal Event Detection Using HOSF, pp. 1–4 (2013)
Google Scholar
Reddy, V., Sanderson, C., Lovell, B.C.: Improved anomaly detection in crowded scenes via cell-based analysis of foreground speed, size and texture, pp. 55–61 (2011)
Google Scholar
Wang, T., Snoussi, H.: Detection of abnormal visual events via global optical flow orientation histogram. IEEE Trans. Inf. Forensics Secur. 9(6), 988–998 (2014)
Article Google Scholar
Zhao, B., Fei-Fei, L., Xing, E.P.: Online detection of unusual events in videos via dynamic sparse coding. In: CVPR 2011, pp. 3313–3320 (2011)
Google Scholar
Zhou, S., Shen, W., Zeng, D., Zhang, Z.: Unusual event detection in crowded scenes by trajectory analysis. In: 2015 IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP), pp. 1300–1304 (2015)
Google Scholar
Piciarelli, C., Micheloni, C., Foresti, G.L.: Trajectory-based anomalous event detection. IEEE Trans. Circuits Syst. Video Technol. 18, 1544–1554
Google Scholar
Johnson, N., Hogg, D.: Learning the distribution of object trajectories for event recognition. Image Vis. Comput. 14(8), 609–615 (1996). ISSN 0262–8856. https://doi.org/10.1016/0262-8856(96)01101-8
Redmon, J., Farhadi, A.: YOLOv3: An Incremental Improvement (2018)
Google Scholar
Tran, D., Bourdev, L., Fergus, R., Torresani, L., Paluri, M.: Learning spatiotemporal features with 3D convolutional networks. In: The IEEE International Conference on Computer Vision (ICCV) (2015)
Google Scholar
Schroff, F., Kalenichenko, D., Philbin, J.: FaceNet: a unified embedding for face recognition and clustering. In: The IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2015)
Google Scholar
Zhou, S., Shen, W., Zeng, D., Fang, M., Wei, Y., Zhang, Z.: Spatial-temporal convolutional neural networks for anomaly detection and localization in crowded scenes. Sig. Process. Image Commun. 47, 358–368 (2016)
Google Scholar
Ravanbakhsh, M., Nabi, M., Mousavi, H., Sangineto, E., Sebe, N.: Plug-and-Play CNN for crowd motion analysis: an application in abnormal event detection. In: 2018 IEEE Winter Conference on Applications of Computer Vision (WACV) (2018)
Google Scholar
Sabokrou, M., et al.: Avid: adversarial visual irregularity detection. arXiv preprint arXiv:1805.09521 (2018)
Mehran, R., Oyama, A., Shah, M.: Abnormal crowd behavior detection using social force model. In: Computer Vision and Pattern Recognition, pp. 935–942 (2009)
Google Scholar
Kim, J., Grauma, K.: Observe locally, infer globally: a space-time MRF for detecting abnormal activities with incremental updates. In: Computer Vision and Pattern Recognition, pp. 2921–2928 (2009)
Google Scholar
Bertini, M., Del Bimbo, A., Seidenari, L.: Multi-scale and real-time non-parametric approach for anomaly detection and localization. Computer Vis. Image Underst. 116(3), 320–329 (2012)
Article Google Scholar
Zhou, S., Shen, W., Zeng, D., Fang, M., Wei, Y., Zhang, Z.: Spatial-temporal convolutional neural networks for anomaly detection and localization in crowded scenes. Signal Process. Image Commun. 47, 358–368 (2016)
Article Google Scholar
Bouindour, S., Hittawe, M.M., Mahfouz, S., Snoussi, H.: Abnormal event detection using convolutional neural networks and 1-class SVM classifier. In: 8th International Conference on Imaging for Crime Detection and Prevention (ICDP 2017) (2017)
Google Scholar
Hamdi, S., Bouindour, S., Loukil, K., Snoussi, H., Abid, M.: Hybrid deep learning and HOF for Anomaly Detection. In: 2019 6th International Conference on Control, Decision and Information Technologies (CoDIT), pp. 575–580 (2019)
Google Scholar
Li, W., Mahadevan, V., Vasconcelos, N.: Anomaly detection and localization in crowded scenes. IEEE Trans. Pattern Anal. Mach. Intell. 36(1), 18–32 (2014)
Article Google Scholar
Chong, Y.S., Tay, Y.H.: Abnormal event detection in videos using spatiotemporal autoencoder. In: Proceedings CVPRR in International Symposium on Neural Networks, pp. 189–196 (2017)
Google Scholar
Xiao, T., Zhang, C., Zha, H.: Learning to detect anomalies in surveillance video. IEEE Sig. Process. Lett. I 22(9), 1477–1481 (2015)
Article Google Scholar
Wu, S., et al.: Chaotic invariants of Lagrangian particle trajectories for anomaly detection in crowded scenes. In: IEEE Conference on Computer Vision Pattern Recognition, pp. 2054–2060 (2010)
Google Scholar
Saligrama, V., Chen, Z.: Chaotic invariants based on local statistical aggregates. J. IEEE Conf. Comput. Vis. Pattern Recogn. 2112–2119 (2012)
Google Scholar
Cong, Y., et al.: Sparse reconstruction cost for abnormal event detection. In: IEEE Conference on Computer Vision Pattern Recognition, pp. 3449–3456 (2011)
Google Scholar
Perera, P., Patel, V.M.: Learning deep features for one-class classification. In: IEEE Conference on Computer Vision Pattern Recognition, pp. 3449–3456 (2011)
Google Scholar

Download references

Author information

Authors and Affiliations

LM2S University of Technology of Troyes, 12, rue Marie Curie - CS 42060, 10004, Troyes Cedex, France
Slim Hamdi & Hichem Snoussi
CES Laboratory ENIS National Engineering School University of Sfax, B.P. 3038, Sfax, Tunisia
Slim Hamdi & Mohamed Abid

Authors

Slim Hamdi
View author publications
You can also search for this author in PubMed Google Scholar
Hichem Snoussi
View author publications
You can also search for this author in PubMed Google Scholar
Mohamed Abid
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Slim Hamdi .

Editor information

Editors and Affiliations

Larbi Tebessi University, Tebessa, Algeria
Chawki Djeddi
Digital Research Center of Sfax, Sfax, Tunisia
Yousri Kessentini
Bahria University, Islamabad, Pakistan
Imran Siddiqi
Digital Research Centre of Sfax, Sfax, Tunisia
Mohamed Jmaiel

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Hamdi, S., Snoussi, H., Abid, M. (2021). Fine-Tuning a Pre-trained CAE for Deep One Class Anomaly Detection in Video Footage. In: Djeddi, C., Kessentini, Y., Siddiqi, I., Jmaiel, M. (eds) Pattern Recognition and Artificial Intelligence. MedPRAI 2020. Communications in Computer and Information Science, vol 1322. Springer, Cham. https://doi.org/10.1007/978-3-030-71804-6_1

Download citation

DOI: https://doi.org/10.1007/978-3-030-71804-6_1
Published: 18 March 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-71803-9
Online ISBN: 978-3-030-71804-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Fine-Tuning a Pre-trained CAE for Deep One Class Anomaly Detection in Video Footage

Abstract

Access this chapter

Similar content being viewed by others

Abnormal Event Detection in Videos Using Spatiotemporal Autoencoder

Multi-Stream 3D latent feature clustering for abnormality detection in videos

Anomaly detection in video surveillance: a supervised inception encoder approach

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Fine-Tuning a Pre-trained CAE for Deep One Class Anomaly Detection in Video Footage

Abstract

Access this chapter

Similar content being viewed by others

Abnormal Event Detection in Videos Using Spatiotemporal Autoencoder

Multi-Stream 3D latent feature clustering for abnormality detection in videos

Anomaly detection in video surveillance: a supervised inception encoder approach

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation