Abstract
The expansion of growth in the generation of video data in various organizations causes an urgent requirement for effectual video summarization methods. This paper devises a novel optimization-driven deep learning technique for video summarization. The aim is to give an automated video summarization. Initially, the video data is extracted from the database. Then, the representative frame selection is done using Bayesian fuzzy clustering (BFC). After that, the frames are then temporally segmented, wherein each segment is modelled as a representative frame, which is generated by clustering the temporal segment into clusters. These segments are selected from each cluster closest to the cluster center. The next step is fine refining that is performed using Deep convolution neural network (Deep CNN), which helps to refine the final frame set. The Deep CNN is trained using the proposed Lion deer hunting (LDH) algorithm. The LDH algorithm is the integration of the Deer hunting optimization algorithm (DHOA) and Lion optimization algorithm (LOA). Thus, the final frames obtained by the proposed LDH-based Deep CNN are employed for video summarization. Here, the final frames are adapted to play as a continuous output video. The developed LDH-based Deep CNN offered enhanced performance than other techniques with the highest precision of 0.841, highest recall of 0.810, and highest F1-Score of 0.888.
Similar content being viewed by others
Data availability
The datasets analyzed during the current study are available in the VIOLENT-FLOWS repository, https://www.openu.ac.il/home/hassner/data/violentflows/, SumMe repository, https://paperswithcode.com/dataset/summe, and TvSum repository, https://paperswithcode.com/dataset/tvsum-1.
References
Abonyi J, Feil B, Nemeth S, Arva P (2003) Fuzzy clustering based segmentation of time-series. In: International Symposium on Intelligent Data Analysis, Springer, pp. 275–285
Acha AR, Pritch Y, Peleg S (2006) Making a long video short: Dynamic video synopsis. In: Proceedings CVPR, pp. 435–441
Aote SS, Potnurwar A (2018) An automatic video annotation framework based on two level keyframe extraction mechanism. Multimed Tools Appl:1–20
Boothalingam R (2018) Optimization using lion algorithm: a biological inspiration from lion’s social behavior. Evol Intel 11(1):31–52
Brammya G, Praveena S, NinuPreetha NS, Ramya R, Rajakumar BR, Binu D (2019) Deer hunting optimization algorithm: A new nature-inspired meta-heuristic paradigm. Comput J
Chen K, Franko K, Sang R (2021) Structured model pruning of convolutional networks on tensor processing units. arXiv preprint arXiv:2107.04191
Choi T-M, Chan HK, Yue X (2016) Recent development in big data analytics for business operations and risk management. IEEE Trans Cybern 47:81–92
Cong Y, Yuan JS, Luo JB (2012) Towards scalable summarization of consumer videos via sparsedictionary selection. IEEE Trans Multimed 14(1):66–75
Ejaz N, Mehmood I, Baik SW (2014) Feature aggregation based visual attention model for videosummarization. ComputElectrEng 40(3):993–1005
Fei M, Jiang W, Mao W (2017) A novel compact yet rich key frame creation method for compressed video summarization. Multimed Tools Appl 77(10):11957–11977
Glenn TC, Zare A, Gader PD (2014) Bayesian fuzzy clustering. IEEE Trans Fuzzy Syst 23(5):1545–1561
Hannane R, Elboushaki A, Afdel K (2018) MSKVS: Adaptive mean shift-based keyframe extraction for video summarization and a new objective verification approach. J Vis Commun Image Represent 55:179–200
He Y, Gao C, Sang N, Qu Z, Han J (2017) Graph coloring based surveillance video synopsis. Neurocomputing 225:64–79
Huang C, Wang H (2019) Novel key-frames selection framework for comprehensive video summarization. IEEE Trans Circuit Syst Video Technol:1–1
Hussain T, Muhammad K, Ullah A, Cao Z, Baik SW, de Albuquerque VHC (2019) Cloud-assisted multiview video summarization using CNN and bidirectional LSTM. IEEE Trans Indus Inform 16(1):77–86
Jadhav JN, Arunkumar B (2018) Web page recommendation system using laplace correction dependent probability and Chronological dragonfly-based clustering. Int J Eng Technol (UAE) 7(3.27):290–302
Jog VV, Pande V (2014) Security of outsourced data in cloud using Dynamic Auditing. Int J Sci, Eng Comput Technol 4(12):392
Li L, Zhou K, Xue GR, Video summarization via transferrable structured learning. In: Proceedings of International conference on world wide web, WWW 2011, hyderabad, India, pp 287–296, March 28 – April 2011.
Muhammad K, Tanveer H, Del Ser J, Palade V, De Albuquerque VHC (2019) DeepReS: a deep learning-based video summarization strategy for resource-constrained industrial surveillance scenarios. IEEE Trans Indus Inform:1–1
Mundur P, Rao Y, Yesha Y (2006) Keyframe-based video summarization using Delaunay clustering. Int J Digit Libr 6(2):219–232
Ngo CW, Pong TC, Zhang HJ (2002) Motion-based video representation for scene change detection. IntJ Comput Vis 50(2):127–142
Pande D, Jog VV (2014) Enhancing Security of outsourced data in cloud using Dynamic Auditing.
Puttaswamy MR (2020) Improved deer hunting optimization algorithm for video based salient object detection. Multimed Res 3(3)
Senthil Murugan T, Jog VV (2019) Systematic investigation and performance study of authentication and authorization techniques of Internet of Things. Int J Knowledge-based Intel Eng Syst 23(2):61–76
Song J, Gao L, Liu L, Zhu X, Sebe N (2018) Quantization-based hashing: a general framework for scalable image and video retrieval. Pattern Recogn 75:175–187
SumMe database taken from, (n.d.) “https://paperswithcode.com/dataset/summe”.
Taha M, Ali A, Lloret J, Gondim PRL, Canovas A (2021) An automated model for the assessment of QoE of adaptive video streaming over wireless networks. Multimed Tools Appl 80:26833–26854
Taha M, Canovas A, Lloret J, Ali J (2021) A QoE adaptive management system for high definition video streaming over wireless networks. Telecommun Syst 77(1):63–81
Thomas SS, Gupta S, Subramanian VK (2017) Perceptual video summarization—a new framework for video summarization. IEEE Trans Circuit Syst Video Technol 27(8):1790–1802
Tu F, Yin S, Ouyang P, Tang S, Liu L, Wei S (2017) Deep convolutional neural network architecture with reconfigurable computation patterns. IEEE Trans Very Large Scale Integ (VLSI) Syst 25(8):2220–2233
TvSum database taken from, (n.d.) “https://paperswithcode.com/dataset/tvsum-1”.
Ullah A, Muhammad K, Del Ser J, Baik SW, Albuquerque V (2018) Activity recognition using temporal optical flow convolutional features and multi-layer LSTM. IEEE Trans Ind Electron
VIOLENT-FLOWS DATABASE taken from, “https://www.openu.ac.il/home/hassner/data/violentflows/” Accessed on February 2021.
Wang M, Hong R, Li G (2012) Event driven web video summarization by tag localization and key-shot identification. IEEE Trans Multimed 14(4):975–985
Wang X, Nie X, Liu X, Wang B, Yin Y (2020) Modality correlation-based video summarization. Multimed Tools Appl:1–16
Wu J, Zhong S, Ma Z, Heinen SJ, Jiang J, (2019) Foveated convolutional neural networks for video summarization. Multimed Tools Appl
Zhang L, Jing P, Su Y, Zhang C, Shaoz L (2016) SnapVideo: personalized video generation for a sightseeing trip. IEEE Trans Cybern 47:3866–3878
Zhu X, Guo K, Fang H, Chen L, Ren S, Bin H (2021) Cross view capture for stereo image super-resolution. IEEE Trans Multimed 24:3074–3086
Zhu X, Guo K, Ren S, Bin H, Min H, Fang H (2021) Lightweight image super-resolution with expectation-maximization attention mechanism. IEEE Trans Circuit Syst Video Technol 32(3):1273–1284
Funding
None.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
None.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Singh, A., Kumar, M. Bayesian fuzzy clustering and deep CNN-based automatic video summarization. Multimed Tools Appl 83, 963–1000 (2024). https://doi.org/10.1007/s11042-023-15431-9
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-023-15431-9