Abstract
Playing a vitally important role in the operation of intelligent video surveillance system and smart city, video anomaly detection (VAD) has been widely practiced and studied in both industrial circles and academia. In the present study, a new anomaly detection method is proposed for multi-level memory embedding. According to the novel method, the feature prototype of the sample is stored in the memory pool, which enhances the diversity of the sample feature prototype paradigm. Besides, the memory is embedded in the decoder in a hierarchical integrating manner, which makes the feature information of the object more complete and improves the quality of features. At the end of the model, modeling is performed for the channel relationship between the features of the object in the channel dimension, thus making the model capable of more efficient anomaly detection. This method is verified by conducting evaluation on three publicly available datasets: UCSD Ped2, CUHK Avenue, ShanghaiTech.
Similar content being viewed by others
Data availability
Not applicable.
References
Tao X, Gong X, Zhang X et al (2022) Deep learning for unsupervised anomaly localization in industrial images: a survey. IEEE Trans Instrum Meas 71:1–21
Suarez JJP, Naval Jr P C (2020) A survey on deep learning techniques for video anomaly detection. arXiv preprint arXiv:2009.14146
Saligrama V, Konrad J, Jodoin PM (2010) Video anomaly identification. IEEE Signal Process Mag 27(5):18–33
Gong MG, Zeng HM, Xie Y et al (2020) Local distinguishability aggrandizing network for human anomaly detection. Neural Netw 122:364–373
Redmon J, Divvala S, Girshick R, et al (2016) You only look once: unified, real-time object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition. pp 779–788
Srivastava N, Mansimov E, Salakhudinov R (2015) Unsupervised learning of video representations using lstms, In: International conference on machine learning, pp 843–852
Zou Y F (2109) Recognition and research about abnormal behavior of human based on video, Yunnan University, Kunming
Ristea NC, Madan N, Ionescu RT, et al (2022) Self-supervised predictive convolutional attentive block for anomaly detection. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. pp 13576–13586
Peng J, Zhao Y, Wang L (2021) Research on video abnormal behavior detection based on deep learning. Prog Laser Optoelectron 58(6):0600004
Liu W, Luo W, Lian D, et al (2018) Future frame prediction for anomaly detection–a new baseline. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 6536–6545
Wang G, Wang Y, Qin J, et al (2022) Video anomaly detection by solving decoupled spatio-temporal jigsaw puzzles. In: Proceedings of the ieee conference on european conference on computer vision, pp 494–511
Zaheer MZ, Mahmood A, Khan MH, et al (2022) Generative cooperative learning for unsupervised video anomaly detection. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. pp 14744–14754
Yang J, Shi Y, Qi Z (2022) Learning deep feature correspondence for unsupervised anomaly detection and segmentation. Patt Recogn 132:108874
Zong B, Song Q, Min MR, et al (2018). Deep autoencoding gaussian mixture model for unsupervised anomaly detection, In: International conference on learning representations
Lu Y, Kumar KM, Shahabeddin Nabavi S, et al (2019). Future frame prediction using convolutional vrnn for anomaly detection. In: 2019 16th IEEE international conference on advanced video and signal based surveillance, pp 1–8
Yu G, Wang S, Cai Z, et al (2020) Cloze test helps: effective video anomaly detection via learning to complete video events. In: Proceedings of the 28th ACM international conference on multimedia, pp 583–591
Paffenroth R, Du Toit P, Nong R et al (2013) Space-time signal processing for distributed pattern detection in sensor networks. IEEE J Select Topics Sign Process 7(1):38–49
Hoffmann H (2007) Kernel PCA for novelty detection. Pattern Recogn 40(3):863–874
Laxhammar R, Falkman G, Sviestins E (2009) Anomaly detection in sea traffic-a comparison of the gaussian mixture model and the kernel density estimator In: 2009 12th international conference on information fusion. pp 756–763
Latecki LJ, Lazarevic A, Pokrajac D (2007) Outlier detection with kernel density functions. MLDM 7:61–75
Ma J, Perkins S (2003) Time-series novelty detection using one-class support vector machines. In: Proceedings of the international joint conference on neural networks, pp 1741–1745
Krizhevsky A, Sutskever I, Hinton GE (2017) Imagenet classification with deep convolutional neural networks. Commun ACM 60(6):84–90
Hasan M, Choi J, Neumann J, et al (2016) Learning temporal regularity in video sequences In: Proceedings of the IEEE conference on computer vision and pattern recognition. pp 733–742
Ronneberger O, Fischer P, Brox T (2015) U-net: convolutional networks for biomedical image segmentation. In: International conference on medical image computing and computer-assisted intervention. pp 234–241
Goodfellow IJ, Pouget-Abadie J, Mirza M et al (2014) Generative adversarial networks. Adv Neural Inf Process Syst 3:2672–2680
Hasan M, Choi J, Neumann J, et al (2016) Learning temporal regularity in video sequences. In: Proceedings of the IEEE conference on computer vision and pattern recognition. pp 733–742
Zhao Y, Deng B, Shen C, et al (2017) Spatio temporal autoencoder for video anomaly detection In: Proceedings of the 25th ACM international conference on multimedia. pp 1933–1941
Zaheer MZ, Lee JH, Astrid M, et al (2020) Old is gold: redefining the adversarially learned one-class classifier training paradigm. In: 2020 IEEE/CVF Conference on computer vision and pattern recognition, pp 14183–14193
Zhang C, Song D, Chen Y et al (2019) A deep neural network for unsupervised anomaly detection and diagnosis in multivariate time series data. Proceed AAAI Conf Artif Intell 33(1):1409–1416
Park H, Noh J, Ham B (2020) Learning memory-guided normality for anomaly detection. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. pp 14372–14381
Medel J R, Savakis A (2016) Anomaly detection in video using predictive convolutional long short-term memory networks. arXiv preprint arXiv:1612.00390
Zhao Y, Deng B, Shen C, et al (2017) Spatio-temporal autoencoder for video anomaly detection. In: Proceedings of the 25th ACM international conference on Multimedia. pp 1933–1941
Le VT, Kim YG (2022). Attention-based residual autoencoder for video anomaly detection. Applied Intelligence, pp 1–15
Liu Z, Nie Y, Long C, et al (2021) A hybrid video anomaly detection framework via memory-augmented flow reconstruction and flow-guided frame prediction. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 13588–13597
Chen D, Wang P, Yue L et al (2020) Anomaly detection in surveillance video based on bidirectional prediction. Image Vis Comput 98:103915
Zhang XX, Zhu Z, Zhao Y, Chang DX (2017) Learning a general assignment model for video analytics. IEEE Trans Circuits Syst Video Technol 28(10):3066–3076
Xia GY, Chen BJ, Sun HJ, Liu QS (2020) Nonconvex low-rank kernel sparse subspace learning for keyframe extraction and motion segmentation. IEEE Trans Neural Netw Learn Sys 32(4):1612–1626
Meng J, Wang H, Yuan J, et al (2016) From keyframes to key objects: video summarization by representative object proposal selection. In: Proceedings of the IEEE conference on computer vision and pattern recognition. pp 1039–1048
Wang H, Kawahara Y, Weng C et al (2017) Representative selection with structured sparsity. Pattern Recogn 63:268–278
Gong D, Liu L, Le V, et al (2019) Memorizing normality to detect anomaly: memory- augmented deep autoencoder for unsupervised anomaly detection. In: Proceedings of the IEEE/CVF international conference on computer vision. pp 1705–1714
Graves A, Wayne G, Reynolds M et al (2016) Hybrid computing using a neural network with dynamic external memory. Nature 538(7626):471–476
Lv H, Chen C, Cui Z, et al (2021) Learning normal dynamics in videos with meta prototype network. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. pp 15425–15434
Zhang Y, Wang J, Chen Y et al (2022) Adaptive memory networks with self-supervised learning for unsupervised anomaly detection. IEEE Trans Knowl Data Eng. https://doi.org/10.1109/TKDE.2021.3139916
Hendrycks D, Gimpel K (2016) Gaussian error linear units (gelus). arXiv preprint arXiv:1606.08415
Hu J, Shen L, Sun G (2018) Squeeze-and-excitation networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition. pp 7132–7141
Mahadevan V, Li W, Bhalodia V, et al (2010) Anomaly detection in crowded scenes. In: 2010 IEEE computer society conference on computer vision and pattern recognition. pp 1975–1981
Lu C, Shi J, Jia J (2013) Abnormal event detection at 150 fps in matlab. In: Proceedings of the IEEE international conference on computer vision. pp 2720–2727
Luo W, Liu W, Gao S (2017) A revisit of sparse coding based anomaly detection in stacked rnn framework. In: Proceedings of the IEEE international conference on computer vision. pp 341–349
Paszke A, Gross S, Chintala S, et al (2017) Automatic differentiation in pytorch. In: Advances in neural information processing systems 30
Kim J, Grauman K (2009) Observe locally, infer globally: a space-time MRF for detecting abnormal activities with incremental updates. In: 2009 IEEE conference on computer vision and pattern recognition, pp 2921–2928
Xu D, Ricci E, Yan Y, et al (2015) Learning deep representations of appearance and motion for anomalous event detection. arXiv preprint arXiv:1510.01553
Tudor Ionescu R, Smeureanu S, Alexe B, et al (2017) Unmasking the abnormal events in video. In: Proceedings of the IEEE international conference on computer vision. pp 2895–2903
Hinami R, Mei T, Satoh S (2017) Joint detection and recounting of abnormal events by learning deep generic knowledge. In: Proceedings of the IEEE international conference on computer vision. pp 3619–3627
Nguyen T N, Meunier J (2019) Anomaly detection in video sequence with appearance-motion correspondence. In: Proceedings of the IEEE/CVF international conference on computer vision. pp 1273–1283.
Chang Y, Tu Z, Xie W, et al (2020) Clustering driven deep autoencoder for video anomaly detection. In: European conference on computer vision. pp 329–345
Tang Y, Zhao L, Zhang S et al (2020) Integrating prediction and reconstruction for anomaly detection. Pattern Recogn Lett 129:123–130
Kanu-Asiegbu A M, Vasudevan R, Du X (2021) Leveraging trajectory prediction for pedestrian video anomaly detection. In: 2021 IEEE symposium series on computational intelligence, pp 01–08
Li B, Leroux S, Simoens P (2021) Decoupled appearance and motion learning for efficient anomaly detection in surveillance video. Comput Vis Image Underst 210:103249
Acknowledgements
Not applicable.
Funding
The Young Innovative Talents Project of Guangdong Province (No.2020KQNCX198); Basic and Applied Basic Research Project of Guangzhou Basic Research Program.
Author information
Authors and Affiliations
Contributions
All authors contributed to the study conception and design. Material preparation, datacollection and analysis were performed by Liuping Zhou. The first draft of the manuscript was written by Jing Yang and all authors commented on previous versions of the manuscript. All authors read and approved the final manuscript.
Corresponding author
Ethics declarations
Conflict of interest
The author declare that they have no conflict of interest.
Consent for publication
Yes.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Zhou, L., Yang, J. Video anomaly detection with memory-guided multilevel embedding. Int J Multimed Info Retr 12, 6 (2023). https://doi.org/10.1007/s13735-023-00272-x
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s13735-023-00272-x