Skip to main content
Log in

A keyframe selection for summarization of informative activities using clustering in surveillance videos

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

Video informative activities summarization in surveillance video has become an important approach that leads to object detection, object classification, and multi-event detection, etc. Several approaches such as dictionary learning, representation learning, statistical approach, etc. have been used for identifying important events. However, the efficiency of these methods lack in identifying the important activities in a video effectively. To overcome this challenge, this paper presents a keyframe selection algorithm by adopting multi-level clustering using surveillance video. To efficiently identify the activities, orientation computation, Markov chain based clustering, and adjacent matrix based clustering are used. The Markov chain based clustering is used to analyze and group the adjacent activities efficiently. Adjacent matrix based clustering ensures inter-relationship among the activities and then integrates the positive activities. It leads to an efficient and informative activities summary. The experimental results are tested on the PETS 2009, VIRAT and UCLA datasets. Also, various existing video summarization algorithms are compared with the proposed method for evaluating the performance of the proposed method.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Algorithm 1:
Fig. 5
Fig. 6

Similar content being viewed by others

Data availability

1. http://cs.binghamton.edu/~mrldata/pets2009

2. https://viratdata.org

3. https://www.library.ucla.edu/social-science-data-archive/data-portals

References

  1. Benezeth Y, Jodoin P-M, Saligrama V, Rosenberger C (2009) Abnormal events detection based on spatio-temporal co-occurences. In Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 2458–2465

  2. Calic J, Thomas B (2004) Spatial analysis in key-frame extraction using video segmentation, In: Proceedings of Workshop on Image Analysis, Multimedia Interactive Services, Portugal

  3. Chen B-W, Wang J-C, Wang J-F (2009) A novel video summarization based on mining the story-structure and semantic relations among concept entities. IEEE Trans Multimedia 11(2):295–312

    Article  Google Scholar 

  4. Chen F, Delannay D, Vleeschouwer C (2011) An autonomous framework to produce and distribute personalized team-sport video summaries: a basketball case study. IEEE Trans Multimedia 13(6):1381–1394

    Article  Google Scholar 

  5. Chen DY, Huang PC (2010) Motion-based unusual event detection in human crowds. Elsevier J Vis Commun Image Represent, pp. 178–186

  6. Ciocca G, Schettini R (2006) Innovative algorithm for key frame extraction in video summarization. J Real-Time Image Process 1(1):69–88

    Article  Google Scholar 

  7. De Avila SE, Lopes ABP, da Luz Jr AA, de Albuquerque Araújo A (2011) VSUMM: a mechanism designed to produce static video summaries and a novel evaluation method. Pattern Recogn Lett 32(1):56–68

    Article  Google Scholar 

  8. De Menthon D, Kobla V, Doermann D (1998) Video summarization by curve simplification, In: Proceedings of the ACM Internaternational Conference on Multimedia, NewYork, pp. 211–218

  9. Elhoseny M (2020) “Multi-object Detection and Tracking (MODT) Machine Learning Model for Real-Time Video Surveillance Systems”, Springer, Circuits. Syst Signal Process 39(2):611–630

    Article  Google Scholar 

  10. Guironnet M, Pellerin D, Guyader N, Ladret P (2007) Video summarization based on camera motion and a subjective evaluation method, EURASIP J Image Video Process 122

  11. Hanjalic R, Langendijk L, Biemond J (1996) A new key-frame allocation method for representing stored video-streams, In: 1st International Workshop on Image Databases and Multimedia, Search, pp. 67–74.

  12. Hong R, Tang J, Tan H-K, Ngo C-W, Yan S, Chua T-S (2011) Beyond search: Event-driven summarization for Web videos. ACM Trans Multimedia Comput 7(4):35

    Google Scholar 

  13. Hoon SH, Yoon K, Kweon I (2000) A new technique for shot detection and key frames selection in histogram space, In: 12th Workshop on Image Processing and Image Understanding, pp. 217–220

  14. Kim G, Sigal L, and Xing EP (2014) Joint summarization of large-scale collections of Web images and videos for storyline reconstruction,” In Proceedings of the IEEE Conference on computer vision and pattern recognition, pp. 4225–4232

  15. Kim JS, Yeom DH, Joo YH (2011) Fast and robust algorithm of tracking multiple moving objects for intelligent video surveillance systems. IEEE Trans Consum Electron 57:1165–1170

    Article  Google Scholar 

  16. Lu Z and Grauman K (2013) Story-driven summarization for egocentric video. In Proceeding IEEE ConfComput Vis Pattern Recognit, pp. 2714–2721

  17. Luo J, Papin C, Costello K (2009) Towards extracting semantically meaningful key frames from personal video clips: from humans to computers. IEEE Trans Circuits Syst Video Technol 19(2):289–301

    Article  Google Scholar 

  18. Oh S et al (2011) A large-scale benchmark dataset for event recognition in surveillance video. In Proceeding IEEE ConfComput Vis Pattern Recognit, pp. 3153–3160

  19. Pal SK, Leigh AB (1995) Motion frame analysis and scene abstraction: discrimination ability of fuzziness measures. J Intell Fuzzy Syst 3:247–256

    Article  Google Scholar 

  20. Potapov D, Douze M, Harchaoui Z, and Schmid C, (2014) Categoryspecific video summarization, In Proc. Eur. Conf. Comput. Vis, pp. 540–555

  21. Pritch Y, Ratovitch S, Hendel A, Peleg S (2009) Clustered synopsis of surveillance video. Proceeding international conference on advanced. Video Signal Surveillance, Genova, pp 195–200

    Google Scholar 

  22. Truong BT, Venkatesh S (2007) Video abstraction: A systematic review and classification. ACM Trans Multimedia Comput Commun Appl 3(1):1–37

    Article  Google Scholar 

  23. Truong BT, Venkatesh S (2007) Video abstraction: a systematic review and classification, ACM Trans Multimedia Comput Commun Appl 3 (1)

  24. Vennila TJ and Balamurugan V (2020) A Stochastic Framework for Keyframe Extraction, In 2020 International Conference on Emerging Trends in Information Technology and Engineering (ic-ETITE), pp. 1–5, https://doi.org/10.1109/ic-ETITE47903.2020.294

  25. Yuan M, Lin Y (2006) Model selection and estimation in regression with grouped variables. J R Stat Soc B 68(1):49–67

    Article  MathSciNet  Google Scholar 

  26. Zhang XD, Liu TY, Lo KT, Feng J (2003) Dynamic selection and effective compression of key frames for video abstraction. Pattern Recogn Lett 24(9–10):1523–1532

    Article  Google Scholar 

  27. Zhang S, Zhu Y, Roy-Chowdhury AK (2016) Context-Aware Surveillance Video Summarization. IEEE Trans Image Process 25(11):5469–5478. https://doi.org/10.1109/TIP.2016.2601493

    Article  MathSciNet  Google Scholar 

  28. Zhao B, Fei-Fei L, Xing EP (2011) Online detection of unusual events in videos via dynamic sparse coding. In Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 3313–3320

  29. Zhao B and Xing EP (2014) Quasi real-time summarization for consumer videos In Proceedings IEEE Conference computer vision and pattern recognition, Columbus, pp. 2513–2520

  30. Zivkovic Z (2004) Improved adaptive Gaussian mixture model for background subtraction,” In Proceedings of the International Conference on Pattern Recognition, pp. 28–31

  31. Zhuang Y, Rui Y, Huang TS, Mehrotra S, (1998) Adaptive key frame extraction using unsupervised clustering, In: International Conference on Image Processing, pp. 866–870

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to A. Anbarasa Pandian.

Ethics declarations

Conflicts of Interest

The authors declare that they have no conflict of interest.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Pandian, A.A., Maheswari, S. A keyframe selection for summarization of informative activities using clustering in surveillance videos. Multimed Tools Appl 83, 7021–7034 (2024). https://doi.org/10.1007/s11042-023-15859-z

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-023-15859-z

Keywords

Navigation