Abstract
In this paper, we present a framework for video retrieval using caption text and keyframe similarity. To extract caption text, we applied methods detecting and extracting image areas contain caption text and we used Tesseract-OCR engine to convert into plain text, then use Hunspell library for spell words. Next, we used Clucene search engine index and query on this text. We applied shape descriptors APR and ECM to descript keyframes of the video shots and use those descriptors as a feature vector of video shots. From the feature vectors were obtained, we used ANN library to index and search. The system which is built on the web-based application using ASP.NET support keyword-based and keyframe-based query. The results obtained from experiments produced very promising.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Zhang, H., Wu, J., Zhong, D., Smoliar, S.W.: An integrated system for content-based video retrieval and browsing. Pattern Recognition, 643–658 (1997)
Jawahar, C.V., Chennupati, J.B., Paluri, B., Jammalamadaka, N.: Video retrieval based on textual queries. In: The 13th Intl Conference on Advanced Computing and Communications (2005)
Jung, K.: Text information extraction in images and video: a survey. Pattern Recognition 37(5), 977–997 (2004)
Peng, J., Xiao-Lin, Q.: Keyframe-based video summary using visual attention clues. IEEE Multimedia 17, 64–73 (2010)
Smoliar, S.W., Zhang, H.: Content-based video indexing and retrieval. IEEE MultiMedia 1(2), 62–72 (1994)
Dimitrova, N., Zhang, H.J., Shahraray, B., Sezan, I., Huang, T., Zakhor, A.: Applications of video-content analysis and retrieval. IEEE MultiMedia 9(3), 42–55 (2002)
Lienhart, R.: Video ocr: A survey and practitioner’s guide. In: Video Mining, pp. 155–184. Kluwer Academic Publisher (2003)
Anthimopoulos, M., Gatos, B., Pratikakis, I.: A two-stage scheme for text detection in video images. Image Vision Comput. 28(9), 1413–1426 (2010)
Langlois, T., Chambel, T., Oliveira, E., Carvalho, P., Marques, G., Falcão, A.: Virus: video information retrieval using subtitles. In: Proceedings of the 14th International Academic MindTrek Conference: Envisioning Future Media Environments, MindTrek 2010, pp. 197–200. ACM, New York (2010)
Pickering, M.J., Rüger, S.: Evaluation of key frame-based retrieval techniques for video. Comput. Vis. Image Underst. 92(2-3), 217–235 (2003)
Browne, P., Smeaton, A.F.: Video retrieval using dialogue, keyframe similarity and video objects. In: ICIP (3), pp. 1208–1211 (2005)
Sze, K.W., Lam, K.M., Qiu, G.: A new key frame representation for video segment retrieval. IEEE Transactions on Circuits and Systems for Video Technology 15(9), 1148–1155 (2005)
Girgensohn, A., Boreczky, J.: Time-constrained keyframe selection technique. Multimedia Tools Appl. 11(3), 347–358 (2000)
Veltkamp, R.C., Latecki, L.J.: Properties and performances of shape similarity measures. In: Content-Based Retrieval (2006)
Rautkorpi, R., Iivarinen, J.: A Novel Shape Feature for Image Classification and Retrieval. In: Campilho, A.C., Kamel, M.S. (eds.) ICIAR 2004, Part I. LNCS, vol. 3211, pp. 753–760. Springer, Heidelberg (2004)
Brandt, S., Laaksonen, J., Oja, E.: Statistical shape features for content-based image retrieval. J. Math. Imaging Vis. 17(2), 187–198 (2002)
Chalechale, A., Mertins, A., Naghdy, G.: Edge image description using angular radial partitioning. IEE Proceedings Vision, Image and Signal Processing 151(2), 93–101 (2004)
Bober, M.: Mpeg-7 visual shape descriptors. IEEE Trans. Cir. and Sys. for Video Technol. 11(6), 716–719 (2001)
Anselmi, N.: Shot boundary detection in opencv. Wiki (2011), http://mmlab.disi.unitn.it/wiki/index.php/Shot_Boundary_Detection_in_OpenCV
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Mai, D., Hoang, K. (2012). Caption Text and Keyframe Based Video Retrieval System. In: Nguyen, NT., Hoang, K., JÈ©drzejowicz, P. (eds) Computational Collective Intelligence. Technologies and Applications. ICCCI 2012. Lecture Notes in Computer Science(), vol 7654. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-34707-8_25
Download citation
DOI: https://doi.org/10.1007/978-3-642-34707-8_25
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-34706-1
Online ISBN: 978-3-642-34707-8
eBook Packages: Computer ScienceComputer Science (R0)