Abstract
The amount of digital material in video lecture archives is growing rapidly, causing the search&retrieval process to be time-consuming and almost impractical. Indeed, after the search, students receive a list of videos and often must use VCR-like functions to find the specific piece of video that covers the searched topic. Therefore, a more efficient method for video retrieval in digital video lecture archives is needed. In this paper, we propose VLB (Video Lecture Browsing), a system designed to facilitate both the retrieval of video lectures within video archives and the finding of the most appropriate segment of a video lecture that covers a searched topic by automatically producing a general picture of the contents of a video lecture. To achieve these goals, the system introduces the idea of timed tag-clouds, which are produced with a combination of aural and visual analysis. Results of a MOS evaluation show that users highly appreciate the timed tag-clouds approach and a comparison study against other popular approaches shows that 93 % of users prefer to use VLB to handle video lectures.
Similar content being viewed by others
Notes
The OCR generated lesson description is generated by “tesseract”, an open-source OCR engine.
References
Che X, Yang H, Meinel C (2013) Lecture video segmentation by automatically analyzing the synchronized slides. In: Proceedings of the ACM international conference on multimedia, pp 345–348
Dickson PE, Warshow DI, Goebel AC, Roache CC, Adrion R (2012) Student reactions to classroom lecture capture. In: Proceedings of the ACM annual conference on innovation and technology in computer science education, pp 144–149
Federico M, Furini M (2012) Enhancing learning accessibility through fully automatic captioning. In: Proceedings of the international cross-disciplinary conference on web accessibility (W4A’12), pp 40:1–40:4
Federico M, Furini M (2014) An automatic caption alignment mechanism for off-the-shelf speech recognition technologies. Multimed Tools Appl 72(1):21–40
Furini M (2007) On ameliorating the perceived playout quality in chunk-driven P2P media streaming systems. In: Proceedings of IEEE international conference on communications (ICC 2007), pp 1679–1684
Furini M (2008) Fast Play: a novel feature for digital consumer video devices. IEEE Trans Consum Electron 54(2):513–520
Furini M (2009) Secure, portable, and customizable video lectures for e-learning on the move. Informatica 33(1):77–84
Furini M (2016) On gamifying the transcription of digital video lectures. Entertain Comput 14:23–31
Furini M, Aragone M (2006) An audio/video analysis mechanism for web indexing. In: Proceedings of the 15th international conference on World Wide Web (WWW’06), pp 91–98
Furini M, Geraci F, Montangero M, Pellegrini M (2008) On using clustering algorithms to produce video abstracts for the web scenario. In: Proceedings of the IEEE consumer communication & networking 2008 (CCNC2008), pp 1112–1116
Furini M, Geraci F, Montangero M, Pellegrini M (2010) Stimo: Still and moving video storyboard for the web scenario. Multimed Tools Appl 46(1):47–69
Grcar M, Mladenic D, Kese P (2009) Semi-automatic categorization of videos on videolectures.net. In: Proceedings of the European conference on machine learning and knowledge discovery in databases: part II, pp 730–733
Gross A, Meinel C, Repp S (2008) Browsing within lecture videos based on the chain index of speech transcription. IEEE Trans Learn Technol 1(3):145–156
Hayashi Y, Ohtsuki K, Bessho K, Mizuno O, Matsuo Y, Matsunaga S, Hayashi M, Hasegawa T, Ikeda N (2003) Speech-based and video-supported indexing of multimedia broadcast news. In: Proceedings of the International ACM conference on research and development in information retrieval, pp 441–442
Huayong L, Dongru Z (2003) Newsbr: a content-based news video browsing and retrieval system. In: Proceedings of the international symposium on image and signal processing and analysis (ISPA 2003), vol 2, pp 793–798
Jeong HJ, Kim T-E, Kim MH (2012) An accurate lecture video segmentation method by using sift and adaptive threshold. In: Proceedings of the International conference on advances in mobile computing & multimedia, pp 285–288
Kamabathula VK, Iyer S (2011) Automated tagging to enable fine-grained browsing of lecture videos. In: Proceedings of the IEEE international conference on technology for education (T4E’11), pp 96–102
Liu T, Kender JR (2004) Lecture videos for e-learning: current research and challenges. In: Proceedings of the IEEE international symposium on multimedia software engineering, pp 574–578
Monserrat T-J K, Zhao S, McGee K, Notevideo AVP (2013) Facilitating navigation of blackboard-style lecture videos. In: Proceedings of the SIGCHI conference on human factors in computing systems, pp 1139–1148
Sang-Kyun K, Sun Hwang D, Ji-Yeun K, Yang-Seock S (2005) An effective news anchorperson shot detection method based on adaptive audio/visual model generation. In: Proceedings of the international conference on image and video retrieval, pp 276–285
Shin HV, Berthouzoz F, Li W, Durand F (2015) Visual transcripts: lecture notes from blackboard-style lecture videos. ACM Trans Graph 34(6):240:1–240:10
Sleit A, Hajaya M, Obisat F (2010) Video powersearcher: a text-based indexing e-learning system. In: Proceedings of the international conference on intelligent semantic web-services and applications, pp 23:1–23:5
Toppin IN (2011) Video lecture capture (vlc) system: a comparison of student versus faculty perceptions. Educ Inf Technol 16(4):383–393
Tuna T, Subhlok J, Shah S (2011) Indexing and keyword search to ease navigation in lecture videos. In: Proceedings of the IEEE international workshop on applied imagery pattern recognition, pp 1–8
Wang F, Ngo C-W, Pong T-C (October 2008) Structuring low-quality videotaped lectures for cross-reference browsing by video text analysis. Pattern Recogn 41(10):3257–3269
Weiming H, Xie N, Li L, Zeng X, Maybank S (2011) A survey on visual content-based video indexing and retrieval. IEEE Trans Syst Man Cybern Part C (Appl Rev) 41(6):797–819
Yang H, Meinel C (2014) Content based lecture video retrieval using speech and video text information. IEEE Trans Learn Technol 7(2):142–154
Zhu W, Toklu C, Liou S-P (2001) Automatic news video segmentation and categorization based on closed-captioned text. In: Proceedings of the IEEE international conference on multimedia and expo (ICME 2001), pp 829–832
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Furini, M. On introducing timed tag-clouds in video lectures indexing. Multimed Tools Appl 77, 967–984 (2018). https://doi.org/10.1007/s11042-016-4282-5
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-016-4282-5