Skip to main content

Advertisement

Log in

CNN-BiLSTM Model for Violence Detection in Smart Surveillance

  • Original Research
  • Published:
SN Computer Science Aims and scope Submit manuscript

Abstract

In this paper, a lightweight computational model has been introduced for the better classification of violent and non-violent activities. Nowadays, the occurrences of violent activities get increased in public places. It has various social and economic reasons behind the growth of violent actions. Therefore, the government agencies and public administrators need to check such incidents using smart surveillance. Deep learning-based an efficient violent activity detection model can help the authorities in detecting a violent activity in real-time. The evaluated results can be henceforth sent to store and analyze the captured video to automate the crime monitoring system. Convolutional Neural Network-based Bidirectional LSTM has been used to detect violent activities and also compared with other existing approaches. Our proposed model gives 99.27%, 100% and 98.64% classification accuracies for the widely used standard Hockey Fights, Movies and Violent-Flows video datasets, respectively.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9

Similar content being viewed by others

Notes

  1. https://www.tensorflow.org/.

  2. https://keras.io/.

  3. https://www.numpy.org/.

  4. N=total number of frames/n, n=10, x=100, y=100, c=3.

  5. https://github.com/tintybot/CNN-BiLSTM-Model.

References

  1. Yuan J, Liu Z, Wu Y. Discriminative subvolume search for efficient action detection. In: 2009 IEEE Conference on Computer Vision and Pattern Recognition, 2009; pp. 2442–2449. IEEE.

  2. Chatterjee R, Bandyopadhyay T. Eeg based motor imagery classification using svm and mlp. In: 2016 2nd International Conference on Computational Intelligence and Networks (CINE), 2016 pp. 84–89. IEEE.

  3. Neshige R, Kuroda Y, Kakigi R, Fujiyama F, Matoba R, Yarita M, Lüders H, Shibasaki H. Event-related brain potentials as indicators of visual recognition and detection of criminals by their use. Forensic Sci Int. 1991;51(1):95–103.

    Article  Google Scholar 

  4. Datta A, Chatterjee R. Comparative study of different ensemble compositions in eeg signal classification problem. In: Emerging Technologies in Data Mining and Information Security. Berlin: Springer; 2019. p. 145–54.

    Google Scholar 

  5. Abootalebi V, Moradi MH, Khalilzadeh MA. A new approach for eeg feature extraction in p300-based lie detection. Comput Methods Programs Biomed. 2009;94(1):48–57.

    Article  Google Scholar 

  6. Chatterjee R, Maitra T, Islam SKH, Hassan MM, Alamri A, Fortino G. A novel machine learning based feature selection for motor imagery eeg signal classification in internet of medical things environment. Future Gener Comput Syst. 2019;98:419–34.

    Article  Google Scholar 

  7. Nam J, Alghoniemy M, Tewfik AH. Audio-visual content-based violent scene characterization. In: Proceedings 1998 International Conference on Image Processing. ICIP98 (Cat. No. 98CB36269), volume 1, 1998; pp. 353–357. IEEE,

  8. Mousavi H, Mohammadi S, Perina A, Chellali R, Murino V. Analyzing tracklets for the detection of abnormal crowd behavior. In: 2015 IEEE Winter Conference on Applications of Computer Vision, 2015; pp. 148–155. IEEE

  9. Xu L, Gong C, Yang J, Wu Q, Yao L. Violent video detection based on mosift feature and sparse coding. In: 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2014; pp. 3538–3542. IEEE.

  10. Chen D, Wactlar H, Chen M, Gao C, Bharucha A, Hauptmann A. Recognition of aggressive human behavior using binary local motion descriptors. In: 2008 30th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2008; pp. 5238–5241. IEEE.

  11. Xingjian SHI, Chen Z, Wang H, Yeung D-Y, Wong W-K, Woo W. Convolutional lstm network: a machine learning approach for precipitation nowcasting. In: Advances in neural information processing systems, 2015. pp. 802–810.

  12. Medel JR, Savakis A. Anomaly detection in video using predictive convolutional long short-term memory networks. arXiv preprint arXiv:1612.00390, 2016.

  13. Patraucean V, Handa A, Cipolla R. Spatio-temporal video autoencoder with differentiable memory. arXiv preprint arXiv:1511.06309, 2015.

  14. Sudhakaran S, Lanz O. Learning to detect violent videos using convolutional long short-term memory. In: 2017 14th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS), 2017; pp. 1–6. IEEE.

  15. Dong Z, Qin J, Wang Y. Multi-stream deep networks for person to person violence detection in videos. In: Chinese Conference on Pattern Recognition. Berlin: Springer; 2016. p. 517–31.

    Google Scholar 

  16. Nievas EB, Suarez OD, García GB, Sukthankar R. Violence detection in video using computer vision techniques. In: International conference on Computer analysis of images and patterns. Berlin: Springer; 2011. p. 332–9.

    Google Scholar 

  17. Hassner T, Itcher Y, Kliper-Gross O. Violent flows: Real-time detection of violent crowd behavior. In: 2012 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, 2012; pp. 1–6. IEEE

  18. Mohammadi S, Kiani H, Perina A, Murino V. Violence detection in crowded scenes using substantial derivative. In: 2015 12th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS), 2015; pp. 1–6. IEEE

  19. Zhang T, Jia W, He X, Yang J. Discriminative dictionary learning with motion weber local descriptor for violence detection. IEEE Trans Circuits Syst Video Technol. 2016;27(3):696–709.

    Article  Google Scholar 

  20. Gao Y, Liu H, Sun X, Wang C, Liu Y. Violence detection using oriented violent flows. Image Vis Comput. 2016;48:37–41.

    Article  Google Scholar 

  21. Hanson A, Pnvr K, Krishnagopal S, Davis L. Bidirectional convolutional lstm for the detection of violence in videos. In: Proceedings of the European Conference on Computer Vision (ECCV), 2018.

  22. Li J, Jiang X, Sun T, Xu K. Efficient violence detection using 3d convolutional neural networks. In: 2019 16th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS), 2019; pp. 1–8. IEEE,

  23. Laptev I. On space-time interest points. Int J Comput Vision. 2005;64(2–3):107–23.

    Article  Google Scholar 

  24. Itcher Y, Hassner T, Kliper-Gross O. Violent flows: Real-time detection of violent crowd behavior. In: 3rd IEEE International Workshop on Socially Intelligent Surveillance and Monitoring (SISM) at the IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 2012.

  25. Deniz O, Serrano I, Bueno G, Kim T-K. Fast violence detection in video. In: 2014 International Conference on Computer Vision Theory and Applications (VISAPP), 2014. volume 2, pp. 478–485. IEEE.

  26. Gracia IS, Deniz Suarez O, Garcia GB, Kim T-K. Fast fight detection. PLoS One. 2015;10(4):e0120448.

    Article  Google Scholar 

  27. Bilinski P, Bremond F. Human violence recognition and detection in surveillance videos. In: 2016 13th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS), pp. 30–36. IEEE, 2016.

  28. Jiang B, Xu F, Tu W, Yang C. Channel-wise attention in 3d convolutional networks for violence detection. In: 2019 International Conference on Intelligent Computing and its Emerging Applications (ICEA), pp. 59–64. IEEE.

  29. Abdali A-MR, Al-Tuma RF. Robust real-time violence detection in video using cnn and lstm. In: 2019 2nd Scientific Conference of Computer Sciences (SCCS), 2019; pp. 104–108. IEEE,

  30. Sharma M, Baghel R. Video surveillance for violence detection using deep learning. In: Advances in data science and management. Berlin: Springer; 2020. p. 411–20.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Rajdeep Chatterjee.

Ethics declarations

Funding

Nil.

Conflict of interest

The authors declare that they have no conflict of interest.

Ethical Approval

All procedures performed in studies involving human participants were in accordance with the ethical standards of the institutional and/or national research committee and with the 1964 Helsinki Declaration and its later amendments or comparable ethical standards.

Informed Consent

Informed consent was obtained from all individual participants included in the study.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Halder, R., Chatterjee, R. CNN-BiLSTM Model for Violence Detection in Smart Surveillance. SN COMPUT. SCI. 1, 201 (2020). https://doi.org/10.1007/s42979-020-00207-x

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s42979-020-00207-x

Keywords

Navigation