Abstract
Informative videos (e.g. recorded lectures) are increasingly being made available online, but they are difficult to use, browse and search. Nowadays, popular platforms let users search and navigate videos via a transcript, which, in order to guarantee a satisfactory level of word accuracy, has typically been generated using some manual inputs. The goal of our work is to try and take a step closer to the fully automatic generation of informative video transcripts based on current automatic speech recognition technology. We present a user study designed to better understand viewers’ use of video transcripts for searching a video content, with the aim of estimating what minimum word recognition accuracy is needed for video captions to be a useful search interface. We found that transcripts with 70% word recognition accuracy are as effective as 100% accuracy transcripts in supporting video search when using single word search. We also found that there are large variations in the time it takes to search a video, independently of the quality of the transcript. With adequate and adapted search strategies, even low accuracy transcripts can support quick video search.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
TED Homepage. http://www.ted.com/. Last Accessed 12 Apr 2017
edX Homepage. http://www.edx.org. Last Accessed 12 Apr 2017
Coursera Homepage. http://www.coursera.org/. Last Accessed 12 Apr 2017
Breslow, L.B., Pritchard, D.E., DeBoer, J., Stump, G.S., Ho, A.D., Seaton, D.T.: Studying learning in the worldwide classroom: Research into edX’s first MOOC. Res. Pract. Assess. 8, 13–25 (2013)
Kim, J., Li, S.W., Cai, C.J., Gajos, K.Z., Miller, R.C.: Leveraging video interaction data and content analysis to improve video learning. In: Proceedings of the CHI 2014, Learning Innovation at Scale workshop, pp. 31–40 (2014)
Guo, P.J., Kim, J., Rubin, R.: How video production affects student engagement: an empirical study of MOOC videos. In: Proceedings of the first ACM Learning@scale Conference, pp. 41–50. ACM (2014)
Pavel, A., Reed, C., Hartmann, B., Agrawala, M.: Video digests: a browsable, skimmable format for informational lecture videos. In: Proceedings of UIST 2014, 5–8 October, Honolulu, USA (2014)
Victor, B.: April 2013. http://worrydream.com/MediaForThinkingTheUnthinkable. Last Accessed 12 Apr 2017
WebAim Homepage. http://webaim.org/techniques/captions/. Last Accessed 12 Apr 2017
CaptionSync Homepage. http://www.automaticsync.com/captionsync/. Last Accessed 12 Apr 2017
PlayMedia Homepage. http://www.3playmedia.com/. Last Accessed 12 Apr 2017
YouTube Homepage. https://www.youtube.com/. Last Accessed 12 Apr 2017
GoogleSpeech Homepage. https://cloud.google.com/speech/. Last Accessed 12 Apr 2017
Miró, J.D., Silvestre-Cerdà , J.A., Civera, J., Turró, C., Juan, A.: Efficiency and usability study of innovative computer-aided transcription strategies for video lecture repositories. Speech Commun. 74, 65–75 (2015)
Ranchal, R., Taber-Doughty, T., Guo, Y., Bain, K., Martin, H., Robinson, J.P., Duerstock, B.S.: Using speech recognition for real-time captioning and lecture transcription in the classroom. IEEE Trans. Learn. Technol. 6(4), 299–311 (2013)
Sphinx Homepage. http://cmusphinx.sourceforge.net/. Last Accessed 12 Apr 2017
WhiteHouse Homepage. https://www.whitehouse.gov/. Last Accessed 12 Apr 2017
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Chao, Y., Bourguet, ML. (2017). What Speech Recognition Accuracy is Needed for Video Transcripts to be a Useful Search Interface?. In: Karpov, A., Potapova, R., Mporas, I. (eds) Speech and Computer. SPECOM 2017. Lecture Notes in Computer Science(), vol 10458. Springer, Cham. https://doi.org/10.1007/978-3-319-66429-3_82
Download citation
DOI: https://doi.org/10.1007/978-3-319-66429-3_82
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-66428-6
Online ISBN: 978-3-319-66429-3
eBook Packages: Computer ScienceComputer Science (R0)