Skip to main content

An Effective Approach Towards Video Text Recognition

  • Conference paper
Advances in Signal Processing and Intelligent Recognition Systems

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 264))

  • 2774 Accesses

Abstract

With the rapid increase in digital video capture and editing technologies in recent times there is a great demand for Semantic-based Video Analysis and indexing algorithms. Video Text is an important semantic clue in video content analysis such as video information retrieval and summarization.In this paper, a Video text detection and localization algorithm is proposed that extracts the Eigen feature embodied in wavelet edge map. Also described is an iterative variance based threshold calculation method for Video text binarization. Experimental results on diverse videos demonstrate the robustness and efficiency of proposed method for detecting and recognizing superimposed text and multi-oriented scene text.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Jawahar Anand Mishra, C.V., Alahari, K.: An mrf model for binarization of natural scene text. In: International Conference on Document Analysis and Recognition (ICDAR) (2011)

    Google Scholar 

  2. Ofek, E., Epshtein, B., Wexler, Y.: Detecting text in natural scenes with stroke width transform. In: Proc. of Computer Vision and Pattern Recognition (2010)

    Google Scholar 

  3. Wang, C., Liu, C., Dai, R.: Text detection in images based on unsupervised classification of edge-based features. In: IEEE ICDAR (2005)

    Google Scholar 

  4. Jung, K., Lee, C.W., Kim, H.J.: Automatic text detection and removal in video sequences. Pattern Recognition Letters (2003)

    Google Scholar 

  5. Liu, W., Ma, Y., Yao, C., Bai, X., Tu, Z.: Detecting texts of arbitrary orientations in natural images. In: CVPR (2012)

    Google Scholar 

  6. Doermann, D., Li, H., Kia, O.: Automatic text detection and tracking in digital video. IEEE Transactions on Image Processing (2000)

    Google Scholar 

  7. Yang, H.S.H., Quehl, B.: Text detection in video images using adaptive edge detection and stroke width verification. In: IWSSIP (2013)

    Google Scholar 

  8. Dunn, J.C.: A fuzzy relative of the isodata process and its use in detecting compact, well separated cluster. Cybernatics (1973)

    Google Scholar 

  9. Ewerth, R., Gllavata, J., Phan, T.-Q., FreislebenP. Shivakumara, B., Tan, C.-L.: Video text detection based on filters and edge features. In: Proc. of the 2009 Int. Conf. on Multimedia and Expo (2009)

    Google Scholar 

  10. Sauvola, J.J., Pietikainen, M.: Adaptive document image binarization. Pattern Recognition (2000)

    Google Scholar 

  11. Hua, X.-S., Xi, J.: A video text detection and recognition system. In: IEEE International Conference on Multimedia and Expo (2001)

    Google Scholar 

  12. Effelsberg, W., Lienhart, R.: Automatic text segmentation and text recognition for video indexing. In: ACM/Springer Multimedia Sys. (2000)

    Google Scholar 

  13. Mariano, V.Y., Kasturi, R.: Locating uniform-colored text in video frames. In: Proc. 15th Int. Conf. Pattern Recognition (2000)

    Google Scholar 

  14. Kender., J.R., Merler, M.: Semantic keyword extraction via adaptive text binarization of unstructured unsourced video. In: 16th IEEE International Conference on Image Processing (ICIP) (2009)

    Google Scholar 

  15. Li, S., Zhao, M., Kwok, J.: Text detection in images using sparse representation with discriminative dictionaries. Journal of Image and Vision Computing (2010)

    Google Scholar 

  16. Otsu, N.: A threshold selection method from gray-level histograms. IEEE Trans. Syst., Man, Cybern. (1979)

    Google Scholar 

  17. Tan, C.L., Shivakumara, P., Huang, W.H.: Efficient video text detection using edge features. In: ICPR (2008)

    Google Scholar 

  18. Wang, W., Zeng, W., Ye, Q., Gao, W.: A robust text detection algorithm in images and video frames. Information, Communications and Signal Processing (2003)

    Google Scholar 

  19. Gao, W., Ye, Q., Huang, Q., Zhao, D.: Fast and robust text detection in images and video frames. Image and Vision Computing (2005)

    Google Scholar 

  20. Saidane, Z., Garcia, C.: Robust binarization for video text recognition. In: International Conference on Document Analysis and Recognition (ICDAR) (2007)

    Google Scholar 

  21. Hughes, E.K., Sato, T., Kanade, T., Smith, M.A.: Video ocr for digital news archive. In: Proc. IEEE International Workshop on Content Based Access of Image and Video Libraries (1998)

    Google Scholar 

  22. Niblack, W.: An introduction to digital image processing. Prentice Hill (1986)

    Google Scholar 

  23. Lam, K.K.M., Mao, W., Chung, F., Siu, W.: Hybrid chinese/english text detection in images and video frames. In: ICPR (2002)

    Google Scholar 

  24. Jain, A.K., Zhong, Y., Zhang, H.J.: Automatic caption localization in compressed video. IEEE Transactions on PAMI (2000)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Prakash Sudir .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Sudir, P., Ravishankar, M. (2014). An Effective Approach Towards Video Text Recognition. In: Thampi, S., Gelbukh, A., Mukhopadhyay, J. (eds) Advances in Signal Processing and Intelligent Recognition Systems. Advances in Intelligent Systems and Computing, vol 264. Springer, Cham. https://doi.org/10.1007/978-3-319-04960-1_29

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-04960-1_29

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-04959-5

  • Online ISBN: 978-3-319-04960-1

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics