Abstract
Video thumbnails enable users to see quick snapshots of video collections. To display the video thumbnails, the first frame or a frame selected by using simple low level features in each video clip has been set to the default thumbnail for the sake of computational efficiency and implementation simplicity. However, such methods often fail to represent the gist of the clip. To overcome this limitation, we present a new framework for both static and dynamic video thumbnail extraction. First, we formulate energy functions using the features which incorporate mid-level information to obtain superior thumbnailing. Since it is considered that frames whose layouts are similar to others in the clip are relevant in video thumbnail extraction, scene layouts are also considered in computing overall energy. For dynamic thumbnail generation, a time slot is determined by finding the duration showing the minimum energy. Experimental results show that the proposed method achieves comparable performance on a variety of challenging videos, and the subjective evaluation demonstrates the effectiveness of our method.
Similar content being viewed by others
References
Almeida J, Leite NJ, Torres RdS (2012) Vison: Video summarization for online applications. Pattern Recogn Lett 33(4):397–409
Al-Hajri A, Fong M, Miller G, Fels S (2014) Fast forward with your vcr: Visualizing single-video viewing statistics for navigation and sharing. In: Proceedings of the 2014 Graphics Interface Conference, pp 123–128
Benini S, Migliorati P, Leonardi R (2007) Hidden markov models for video skim generation. In: Eighth International Workshop on Image Analysis for Multimedia Interactive Services. IEEE, pp 6–6
Choi J, Jung C, Lee J, Kim C (2014) Determining the existence of objects in an image and its application to image thumbnailing. IEEE Signal Process Lett 21 (8):957–961
Christel MG (2006) Evaluation and user studies with respect to video summarization and browsing. In: Electronic Imaging 2006. International Society for Optics and Photonics, pp 60730M–60730M
Cotsaces C, Nikolaidis N, Pitas I (2006) Video shot detection and condensed representation: a review. IEEE Signal Process Mag 23(2):28–37
Craggs B, Scott MK, Alexander J (2014) ThumbReels: query sensitive web video previews based on temporal, crowdsourced, semantic tagging. In: Proceedings the 32nd annual ACM Conference on Human Factors in Computing Systems, pp 1217–1220
Gao Y, Zhang T, Xiao J (2009) Thematic video thumbnail selection. In: Proceedings of IEEE International Conference on Image Processing (ICIP). IEEE, pp 4333–4336
Gong Y, Liu X (2000) Generating optimal video summaries. In: Proceedings of IEEE International Conference on Multimedia and Expo (ICME), vol 3. IEEE, pp 1559–1562
Jiang J, Zhang X-P (2010) A novel video thumbnail extraction method using spatiotemporal vector quantization , Proc. of the 3rd International Workshop on Automated Information Extraction in Media Production. ACM, pp. 9–14
Jiang J, Zhang X-P (2011) Video thumbnail extraction using video time density function and independent component analysis mixture model. In: Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, pp 1417–1420
Lee K-J., Lee W-J., Jeong J-C. (2014) An enhanced error compensation method for thumbnail generation in H. 264/AVC bitstreams. In: Proceedings of the 4th IEEE International Conference on Network Infrastructure and Digital Content (IC-NIDC), pp 236–239
Li H, Ngan KN (2007) Unsupervized video segmentation with low depth of field. IEEE Trans Circuits Syst Video Technol 17(12):1742–1751
Liu C, Huang Q, Jiang S (2011) Query sensitive dynamic web video thumbnail generation. In: Proceedings of IEEE International Conference on Image Processing (ICIP). IEEE, pp 2449–2452
Ma Y-F, Hua X-S, Lu L, Zhang H-J (2005) A generic framework of user attention model and its application in video summarization. IEEE Trans Multimed 7 (5):907–919
Ma Y-F, Lu L, Zhang H-J, Li M (2002) A user attention model for video summarization. In: Proceedings of the tenth ACM international conference on Multimedia. ACM, pp 533–542
Money AG, Agius H (2008) Video summarisation: A conceptual framework and survey of the state of the art. J Vis Commun Image Represent 19(2):121–143
Ngo C-W, Ma Y-F, Zhang H-J (2005) Video summarization and scene detection by graph modeling. IEEE Trans Circuits Syst Video Technol 15(2):296–305
Ojala T, Pietikainen M, Maenpaa T (2002) Multiresolution gray-scale and rotation invariant texture classification with local binary patterns. IEEE Trans Pattern Anal Mach Intell 24(7):971–987
Sundaram S, Velisavljevic V, Qin Y (2011) Hotflashes: Thumbnailing videos of social gatherings by detecting camera flash illuminated frames. In: Proceedings of IEEE International Conference on Multimedia and Expo (ICME). IEEE, pp 1–4
Truong BT, Venkatesh S (2007) Video abstraction: A systematic review and classification. ACM Trans Multimed Comput Commun Appl (TOMCCAP) 3(1):3
Viola P, Jones MJ (2004) Robust real-time face detection. Int J Comput Vis 57(2):137–154
Wang M, Hong R, Li G, Zha Z-J, Yan S, Chua T-S (2012) Event driven web video summarization by tag localization and key-shot identification. IEEE Trans Multimed 14(4):975–985
Wang T, Mei T, Hua X-S, Liu X-L, Zhou H-Q (2007) Video collage: A novel presentation of video sequence. In: Proceedings of IEEE International Conference on Multimedia and Expo (ICME). IEEE, pp 1479–1482
Wang Y.-S, Liu F, Hsu P-S, Lee T-Y (2013) Spatially and temporally optimized video stabilization. IEEE Trans Vis Comput Graph:1
Yong S-P., Deng JD, Purvis MK (2013) Wildlife video key-frame extraction based on novelty detection in semantic context. Multimed Tools Appl 62(2):359–376
Zhang W, Liu C, Huang Q, Jiang S, Gao W (2012) A novel framework for web video thumbnail generation. In: Eighth International Conference on Intelligent Information Hiding and Multimedia Signal Processing (IIH-MSP), IEEE, pp 343–346
Zhang W, Liu C, Wang Z, Li G, Huang Q, Gao W (2013) Web video thumbnail recommendation with content-aware analysis and query-sensitive matching. Multimed Tools Appl:1–25
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Choi, J., Kim, C. A framework for automatic static and dynamic video thumbnail extraction. Multimed Tools Appl 75, 15975–15991 (2016). https://doi.org/10.1007/s11042-015-2909-6
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-015-2909-6