Video anomaly detection with memory-guided multilevel embedding

Zhou, Liuping; Yang, Jing

doi:10.1007/s13735-023-00272-x

Video anomaly detection with memory-guided multilevel embedding

Regular Paper
Published: 15 March 2023

Volume 12, article number 6, (2023)
Cite this article

International Journal of Multimedia Information Retrieval Aims and scope Submit manuscript

Liuping Zhou¹ &
Jing Yang^1,2

450 Accesses
Explore all metrics

Abstract

Playing a vitally important role in the operation of intelligent video surveillance system and smart city, video anomaly detection (VAD) has been widely practiced and studied in both industrial circles and academia. In the present study, a new anomaly detection method is proposed for multi-level memory embedding. According to the novel method, the feature prototype of the sample is stored in the memory pool, which enhances the diversity of the sample feature prototype paradigm. Besides, the memory is embedded in the decoder in a hierarchical integrating manner, which makes the feature information of the object more complete and improves the quality of features. At the end of the model, modeling is performed for the channel relationship between the features of the object in the channel dimension, thus making the model capable of more efficient anomaly detection. This method is verified by conducting evaluation on three publicly available datasets: UCSD Ped2, CUHK Avenue, ShanghaiTech.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Video steganography: recent advances and challenges

Article Open access 04 April 2023

Jayakanth Kunhoth, Nandhini Subramanian, … Ahmed Bouridane

Video summarization using deep learning techniques: a detailed analysis and investigation

Article 15 March 2023

Parul Saini, Krishan Kumar, … Alok Negi

Attention-based residual autoencoder for video anomaly detection

Article Open access 25 May 2022

Viet-Tuan Le & Yong-Guk Kim

Data availability

Not applicable.

References

Tao X, Gong X, Zhang X et al (2022) Deep learning for unsupervised anomaly localization in industrial images: a survey. IEEE Trans Instrum Meas 71:1–21
Google Scholar
Suarez JJP, Naval Jr P C (2020) A survey on deep learning techniques for video anomaly detection. arXiv preprint arXiv:2009.14146
Saligrama V, Konrad J, Jodoin PM (2010) Video anomaly identification. IEEE Signal Process Mag 27(5):18–33
Article Google Scholar
Gong MG, Zeng HM, Xie Y et al (2020) Local distinguishability aggrandizing network for human anomaly detection. Neural Netw 122:364–373
Article Google Scholar
Redmon J, Divvala S, Girshick R, et al (2016) You only look once: unified, real-time object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition. pp 779–788
Srivastava N, Mansimov E, Salakhudinov R (2015) Unsupervised learning of video representations using lstms, In: International conference on machine learning, pp 843–852
Zou Y F (2109) Recognition and research about abnormal behavior of human based on video, Yunnan University, Kunming
Ristea NC, Madan N, Ionescu RT, et al (2022) Self-supervised predictive convolutional attentive block for anomaly detection. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. pp 13576–13586
Peng J, Zhao Y, Wang L (2021) Research on video abnormal behavior detection based on deep learning. Prog Laser Optoelectron 58(6):0600004
Article Google Scholar
Liu W, Luo W, Lian D, et al (2018) Future frame prediction for anomaly detection–a new baseline. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 6536–6545
Wang G, Wang Y, Qin J, et al (2022) Video anomaly detection by solving decoupled spatio-temporal jigsaw puzzles. In: Proceedings of the ieee conference on european conference on computer vision, pp 494–511
Zaheer MZ, Mahmood A, Khan MH, et al (2022) Generative cooperative learning for unsupervised video anomaly detection. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. pp 14744–14754
Yang J, Shi Y, Qi Z (2022) Learning deep feature correspondence for unsupervised anomaly detection and segmentation. Patt Recogn 132:108874
Article Google Scholar
Zong B, Song Q, Min MR, et al (2018). Deep autoencoding gaussian mixture model for unsupervised anomaly detection, In: International conference on learning representations
Lu Y, Kumar KM, Shahabeddin Nabavi S, et al (2019). Future frame prediction using convolutional vrnn for anomaly detection. In: 2019 16th IEEE international conference on advanced video and signal based surveillance, pp 1–8
Yu G, Wang S, Cai Z, et al (2020) Cloze test helps: effective video anomaly detection via learning to complete video events. In: Proceedings of the 28th ACM international conference on multimedia, pp 583–591
Paffenroth R, Du Toit P, Nong R et al (2013) Space-time signal processing for distributed pattern detection in sensor networks. IEEE J Select Topics Sign Process 7(1):38–49
Article Google Scholar
Hoffmann H (2007) Kernel PCA for novelty detection. Pattern Recogn 40(3):863–874
Article MATH Google Scholar
Laxhammar R, Falkman G, Sviestins E (2009) Anomaly detection in sea traffic-a comparison of the gaussian mixture model and the kernel density estimator In: 2009 12th international conference on information fusion. pp 756–763
Latecki LJ, Lazarevic A, Pokrajac D (2007) Outlier detection with kernel density functions. MLDM 7:61–75
Google Scholar
Ma J, Perkins S (2003) Time-series novelty detection using one-class support vector machines. In: Proceedings of the international joint conference on neural networks, pp 1741–1745
Krizhevsky A, Sutskever I, Hinton GE (2017) Imagenet classification with deep convolutional neural networks. Commun ACM 60(6):84–90
Article Google Scholar
Hasan M, Choi J, Neumann J, et al (2016) Learning temporal regularity in video sequences In: Proceedings of the IEEE conference on computer vision and pattern recognition. pp 733–742
Ronneberger O, Fischer P, Brox T (2015) U-net: convolutional networks for biomedical image segmentation. In: International conference on medical image computing and computer-assisted intervention. pp 234–241
Goodfellow IJ, Pouget-Abadie J, Mirza M et al (2014) Generative adversarial networks. Adv Neural Inf Process Syst 3:2672–2680
Google Scholar
Hasan M, Choi J, Neumann J, et al (2016) Learning temporal regularity in video sequences. In: Proceedings of the IEEE conference on computer vision and pattern recognition. pp 733–742
Zhao Y, Deng B, Shen C, et al (2017) Spatio temporal autoencoder for video anomaly detection In: Proceedings of the 25th ACM international conference on multimedia. pp 1933–1941
Zaheer MZ, Lee JH, Astrid M, et al (2020) Old is gold: redefining the adversarially learned one-class classifier training paradigm. In: 2020 IEEE/CVF Conference on computer vision and pattern recognition, pp 14183–14193
Zhang C, Song D, Chen Y et al (2019) A deep neural network for unsupervised anomaly detection and diagnosis in multivariate time series data. Proceed AAAI Conf Artif Intell 33(1):1409–1416
Google Scholar
Park H, Noh J, Ham B (2020) Learning memory-guided normality for anomaly detection. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. pp 14372–14381
Medel J R, Savakis A (2016) Anomaly detection in video using predictive convolutional long short-term memory networks. arXiv preprint arXiv:1612.00390
Zhao Y, Deng B, Shen C, et al (2017) Spatio-temporal autoencoder for video anomaly detection. In: Proceedings of the 25th ACM international conference on Multimedia. pp 1933–1941
Le VT, Kim YG (2022). Attention-based residual autoencoder for video anomaly detection. Applied Intelligence, pp 1–15
Liu Z, Nie Y, Long C, et al (2021) A hybrid video anomaly detection framework via memory-augmented flow reconstruction and flow-guided frame prediction. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 13588–13597
Chen D, Wang P, Yue L et al (2020) Anomaly detection in surveillance video based on bidirectional prediction. Image Vis Comput 98:103915
Article Google Scholar
Zhang XX, Zhu Z, Zhao Y, Chang DX (2017) Learning a general assignment model for video analytics. IEEE Trans Circuits Syst Video Technol 28(10):3066–3076
Article Google Scholar
Xia GY, Chen BJ, Sun HJ, Liu QS (2020) Nonconvex low-rank kernel sparse subspace learning for keyframe extraction and motion segmentation. IEEE Trans Neural Netw Learn Sys 32(4):1612–1626
Article MathSciNet Google Scholar
Meng J, Wang H, Yuan J, et al (2016) From keyframes to key objects: video summarization by representative object proposal selection. In: Proceedings of the IEEE conference on computer vision and pattern recognition. pp 1039–1048
Wang H, Kawahara Y, Weng C et al (2017) Representative selection with structured sparsity. Pattern Recogn 63:268–278
Article Google Scholar
Gong D, Liu L, Le V, et al (2019) Memorizing normality to detect anomaly: memory- augmented deep autoencoder for unsupervised anomaly detection. In: Proceedings of the IEEE/CVF international conference on computer vision. pp 1705–1714
Graves A, Wayne G, Reynolds M et al (2016) Hybrid computing using a neural network with dynamic external memory. Nature 538(7626):471–476
Article Google Scholar
Lv H, Chen C, Cui Z, et al (2021) Learning normal dynamics in videos with meta prototype network. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. pp 15425–15434
Zhang Y, Wang J, Chen Y et al (2022) Adaptive memory networks with self-supervised learning for unsupervised anomaly detection. IEEE Trans Knowl Data Eng. https://doi.org/10.1109/TKDE.2021.3139916
Article Google Scholar
Hendrycks D, Gimpel K (2016) Gaussian error linear units (gelus). arXiv preprint arXiv:1606.08415
Hu J, Shen L, Sun G (2018) Squeeze-and-excitation networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition. pp 7132–7141
Mahadevan V, Li W, Bhalodia V, et al (2010) Anomaly detection in crowded scenes. In: 2010 IEEE computer society conference on computer vision and pattern recognition. pp 1975–1981
Lu C, Shi J, Jia J (2013) Abnormal event detection at 150 fps in matlab. In: Proceedings of the IEEE international conference on computer vision. pp 2720–2727
Luo W, Liu W, Gao S (2017) A revisit of sparse coding based anomaly detection in stacked rnn framework. In: Proceedings of the IEEE international conference on computer vision. pp 341–349
Paszke A, Gross S, Chintala S, et al (2017) Automatic differentiation in pytorch. In: Advances in neural information processing systems 30
Kim J, Grauman K (2009) Observe locally, infer globally: a space-time MRF for detecting abnormal activities with incremental updates. In: 2009 IEEE conference on computer vision and pattern recognition, pp 2921–2928
Xu D, Ricci E, Yan Y, et al (2015) Learning deep representations of appearance and motion for anomalous event detection. arXiv preprint arXiv:1510.01553
Tudor Ionescu R, Smeureanu S, Alexe B, et al (2017) Unmasking the abnormal events in video. In: Proceedings of the IEEE international conference on computer vision. pp 2895–2903
Hinami R, Mei T, Satoh S (2017) Joint detection and recounting of abnormal events by learning deep generic knowledge. In: Proceedings of the IEEE international conference on computer vision. pp 3619–3627
Nguyen T N, Meunier J (2019) Anomaly detection in video sequence with appearance-motion correspondence. In: Proceedings of the IEEE/CVF international conference on computer vision. pp 1273–1283.
Chang Y, Tu Z, Xie W, et al (2020) Clustering driven deep autoencoder for video anomaly detection. In: European conference on computer vision. pp 329–345
Tang Y, Zhao L, Zhang S et al (2020) Integrating prediction and reconstruction for anomaly detection. Pattern Recogn Lett 129:123–130
Article Google Scholar
Kanu-Asiegbu A M, Vasudevan R, Du X (2021) Leveraging trajectory prediction for pedestrian video anomaly detection. In: 2021 IEEE symposium series on computational intelligence, pp 01–08
Li B, Leroux S, Simoens P (2021) Decoupled appearance and motion learning for efficient anomaly detection in surveillance video. Comput Vis Image Underst 210:103249
Article Google Scholar

Download references

Acknowledgements

Not applicable.

Funding

The Young Innovative Talents Project of Guangdong Province (No.2020KQNCX198); Basic and Applied Basic Research Project of Guangzhou Basic Research Program.

Author information

Authors and Affiliations

School of Information Engineering, Guang Zhou Railway Ploytechnic, Guangzhou, 510430, China
Liuping Zhou & Jing Yang
St. Paul University Phillippines, Tuguegarao City, Cagayan, 3500, Philippines
Jing Yang

Authors

Liuping Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Jing Yang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

All authors contributed to the study conception and design. Material preparation, datacollection and analysis were performed by Liuping Zhou. The first draft of the manuscript was written by Jing Yang and all authors commented on previous versions of the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Liuping Zhou.

Ethics declarations

Conflict of interest

The author declare that they have no conflict of interest.

Consent for publication

Yes.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Zhou, L., Yang, J. Video anomaly detection with memory-guided multilevel embedding. Int J Multimed Info Retr 12, 6 (2023). https://doi.org/10.1007/s13735-023-00272-x

Download citation

Received: 03 January 2023
Revised: 11 February 2023
Accepted: 23 February 2023
Published: 15 March 2023
DOI: https://doi.org/10.1007/s13735-023-00272-x

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Video anomaly detection with memory-guided multilevel embedding

Abstract

Access this article

Similar content being viewed by others

Video steganography: recent advances and challenges

Video summarization using deep learning techniques: a detailed analysis and investigation

Attention-based residual autoencoder for video anomaly detection

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Consent for publication

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Video anomaly detection with memory-guided multilevel embedding

Abstract

Access this article

Similar content being viewed by others

Video steganography: recent advances and challenges

Video summarization using deep learning techniques: a detailed analysis and investigation

Attention-based residual autoencoder for video anomaly detection

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Consent for publication

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation