Skip to main content

Caption Text and Keyframe Based Video Retrieval System

  • Conference paper
Computational Collective Intelligence. Technologies and Applications (ICCCI 2012)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 7654))

Included in the following conference series:

Abstract

In this paper, we present a framework for video retrieval using caption text and keyframe similarity. To extract caption text, we applied methods detecting and extracting image areas contain caption text and we used Tesseract-OCR engine to convert into plain text, then use Hunspell library for spell words. Next, we used Clucene search engine index and query on this text. We applied shape descriptors APR and ECM to descript keyframes of the video shots and use those descriptors as a feature vector of video shots. From the feature vectors were obtained, we used ANN library to index and search. The system which is built on the web-based application using ASP.NET support keyword-based and keyframe-based query. The results obtained from experiments produced very promising.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Zhang, H., Wu, J., Zhong, D., Smoliar, S.W.: An integrated system for content-based video retrieval and browsing. Pattern Recognition, 643–658 (1997)

    Google Scholar 

  2. Jawahar, C.V., Chennupati, J.B., Paluri, B., Jammalamadaka, N.: Video retrieval based on textual queries. In: The 13th Intl Conference on Advanced Computing and Communications (2005)

    Google Scholar 

  3. Jung, K.: Text information extraction in images and video: a survey. Pattern Recognition 37(5), 977–997 (2004)

    Article  Google Scholar 

  4. Peng, J., Xiao-Lin, Q.: Keyframe-based video summary using visual attention clues. IEEE Multimedia 17, 64–73 (2010)

    Google Scholar 

  5. Smoliar, S.W., Zhang, H.: Content-based video indexing and retrieval. IEEE MultiMedia 1(2), 62–72 (1994)

    Article  Google Scholar 

  6. Dimitrova, N., Zhang, H.J., Shahraray, B., Sezan, I., Huang, T., Zakhor, A.: Applications of video-content analysis and retrieval. IEEE MultiMedia 9(3), 42–55 (2002)

    Article  Google Scholar 

  7. Lienhart, R.: Video ocr: A survey and practitioner’s guide. In: Video Mining, pp. 155–184. Kluwer Academic Publisher (2003)

    Google Scholar 

  8. Anthimopoulos, M., Gatos, B., Pratikakis, I.: A two-stage scheme for text detection in video images. Image Vision Comput. 28(9), 1413–1426 (2010)

    Article  Google Scholar 

  9. Langlois, T., Chambel, T., Oliveira, E., Carvalho, P., Marques, G., Falcão, A.: Virus: video information retrieval using subtitles. In: Proceedings of the 14th International Academic MindTrek Conference: Envisioning Future Media Environments, MindTrek 2010, pp. 197–200. ACM, New York (2010)

    Chapter  Google Scholar 

  10. Pickering, M.J., Rüger, S.: Evaluation of key frame-based retrieval techniques for video. Comput. Vis. Image Underst. 92(2-3), 217–235 (2003)

    Article  Google Scholar 

  11. Browne, P., Smeaton, A.F.: Video retrieval using dialogue, keyframe similarity and video objects. In: ICIP (3), pp. 1208–1211 (2005)

    Google Scholar 

  12. Sze, K.W., Lam, K.M., Qiu, G.: A new key frame representation for video segment retrieval. IEEE Transactions on Circuits and Systems for Video Technology 15(9), 1148–1155 (2005)

    Article  Google Scholar 

  13. Girgensohn, A., Boreczky, J.: Time-constrained keyframe selection technique. Multimedia Tools Appl. 11(3), 347–358 (2000)

    Article  MATH  Google Scholar 

  14. Veltkamp, R.C., Latecki, L.J.: Properties and performances of shape similarity measures. In: Content-Based Retrieval (2006)

    Google Scholar 

  15. Rautkorpi, R., Iivarinen, J.: A Novel Shape Feature for Image Classification and Retrieval. In: Campilho, A.C., Kamel, M.S. (eds.) ICIAR 2004, Part I. LNCS, vol. 3211, pp. 753–760. Springer, Heidelberg (2004)

    Chapter  Google Scholar 

  16. Brandt, S., Laaksonen, J., Oja, E.: Statistical shape features for content-based image retrieval. J. Math. Imaging Vis. 17(2), 187–198 (2002)

    Article  MathSciNet  MATH  Google Scholar 

  17. Chalechale, A., Mertins, A., Naghdy, G.: Edge image description using angular radial partitioning. IEE Proceedings Vision, Image and Signal Processing 151(2), 93–101 (2004)

    Article  Google Scholar 

  18. Bober, M.: Mpeg-7 visual shape descriptors. IEEE Trans. Cir. and Sys. for Video Technol. 11(6), 716–719 (2001)

    Article  Google Scholar 

  19. Anselmi, N.: Shot boundary detection in opencv. Wiki (2011), http://mmlab.disi.unitn.it/wiki/index.php/Shot_Boundary_Detection_in_OpenCV

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Mai, D., Hoang, K. (2012). Caption Text and Keyframe Based Video Retrieval System. In: Nguyen, NT., Hoang, K., JÈ©drzejowicz, P. (eds) Computational Collective Intelligence. Technologies and Applications. ICCCI 2012. Lecture Notes in Computer Science(), vol 7654. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-34707-8_25

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-34707-8_25

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-34706-1

  • Online ISBN: 978-3-642-34707-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics