An Effective Approach Towards Video Text Recognition

Sudir, Prakash; Ravishankar, M.

doi:10.1007/978-3-319-04960-1_29

Prakash Sudir⁵ &
M. Ravishankar⁵

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 264))

2774 Accesses

Abstract

With the rapid increase in digital video capture and editing technologies in recent times there is a great demand for Semantic-based Video Analysis and indexing algorithms. Video Text is an important semantic clue in video content analysis such as video information retrieval and summarization.In this paper, a Video text detection and localization algorithm is proposed that extracts the Eigen feature embodied in wavelet edge map. Also described is an iterative variance based threshold calculation method for Video text binarization. Experimental results on diverse videos demonstrate the robustness and efficiency of proposed method for detecting and recognizing superimposed text and multi-oriented scene text.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Jawahar Anand Mishra, C.V., Alahari, K.: An mrf model for binarization of natural scene text. In: International Conference on Document Analysis and Recognition (ICDAR) (2011)
Google Scholar
Ofek, E., Epshtein, B., Wexler, Y.: Detecting text in natural scenes with stroke width transform. In: Proc. of Computer Vision and Pattern Recognition (2010)
Google Scholar
Wang, C., Liu, C., Dai, R.: Text detection in images based on unsupervised classification of edge-based features. In: IEEE ICDAR (2005)
Google Scholar
Jung, K., Lee, C.W., Kim, H.J.: Automatic text detection and removal in video sequences. Pattern Recognition Letters (2003)
Google Scholar
Liu, W., Ma, Y., Yao, C., Bai, X., Tu, Z.: Detecting texts of arbitrary orientations in natural images. In: CVPR (2012)
Google Scholar
Doermann, D., Li, H., Kia, O.: Automatic text detection and tracking in digital video. IEEE Transactions on Image Processing (2000)
Google Scholar
Yang, H.S.H., Quehl, B.: Text detection in video images using adaptive edge detection and stroke width verification. In: IWSSIP (2013)
Google Scholar
Dunn, J.C.: A fuzzy relative of the isodata process and its use in detecting compact, well separated cluster. Cybernatics (1973)
Google Scholar
Ewerth, R., Gllavata, J., Phan, T.-Q., FreislebenP. Shivakumara, B., Tan, C.-L.: Video text detection based on filters and edge features. In: Proc. of the 2009 Int. Conf. on Multimedia and Expo (2009)
Google Scholar
Sauvola, J.J., Pietikainen, M.: Adaptive document image binarization. Pattern Recognition (2000)
Google Scholar
Hua, X.-S., Xi, J.: A video text detection and recognition system. In: IEEE International Conference on Multimedia and Expo (2001)
Google Scholar
Effelsberg, W., Lienhart, R.: Automatic text segmentation and text recognition for video indexing. In: ACM/Springer Multimedia Sys. (2000)
Google Scholar
Mariano, V.Y., Kasturi, R.: Locating uniform-colored text in video frames. In: Proc. 15th Int. Conf. Pattern Recognition (2000)
Google Scholar
Kender., J.R., Merler, M.: Semantic keyword extraction via adaptive text binarization of unstructured unsourced video. In: 16th IEEE International Conference on Image Processing (ICIP) (2009)
Google Scholar
Li, S., Zhao, M., Kwok, J.: Text detection in images using sparse representation with discriminative dictionaries. Journal of Image and Vision Computing (2010)
Google Scholar
Otsu, N.: A threshold selection method from gray-level histograms. IEEE Trans. Syst., Man, Cybern. (1979)
Google Scholar
Tan, C.L., Shivakumara, P., Huang, W.H.: Efficient video text detection using edge features. In: ICPR (2008)
Google Scholar
Wang, W., Zeng, W., Ye, Q., Gao, W.: A robust text detection algorithm in images and video frames. Information, Communications and Signal Processing (2003)
Google Scholar
Gao, W., Ye, Q., Huang, Q., Zhao, D.: Fast and robust text detection in images and video frames. Image and Vision Computing (2005)
Google Scholar
Saidane, Z., Garcia, C.: Robust binarization for video text recognition. In: International Conference on Document Analysis and Recognition (ICDAR) (2007)
Google Scholar
Hughes, E.K., Sato, T., Kanade, T., Smith, M.A.: Video ocr for digital news archive. In: Proc. IEEE International Workshop on Content Based Access of Image and Video Libraries (1998)
Google Scholar
Niblack, W.: An introduction to digital image processing. Prentice Hill (1986)
Google Scholar
Lam, K.K.M., Mao, W., Chung, F., Siu, W.: Hybrid chinese/english text detection in images and video frames. In: ICPR (2002)
Google Scholar
Jain, A.K., Zhong, Y., Zhang, H.J.: Automatic caption localization in compressed video. IEEE Transactions on PAMI (2000)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of IS, DSCE, Visvesvaraya Technological University, Bangalore, India
Prakash Sudir & M. Ravishankar

Authors

Prakash Sudir
View author publications
You can also search for this author in PubMed Google Scholar
M. Ravishankar
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Prakash Sudir .

Editor information

Editors and Affiliations

Technopark Campus, Indian Institute of Information Technology and Management – Kerala (IIITM-K), Trivandrum, Kerala, India
Sabu M. Thampi
Center for Computing Research, National Polytechnic Institute, Mexico City, Mexico
Alexander Gelbukh
Department of Computer Science and Engineering, Indian Institute of Technology, Kharagpur, India
Jayanta Mukhopadhyay

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Sudir, P., Ravishankar, M. (2014). An Effective Approach Towards Video Text Recognition. In: Thampi, S., Gelbukh, A., Mukhopadhyay, J. (eds) Advances in Signal Processing and Intelligent Recognition Systems. Advances in Intelligent Systems and Computing, vol 264. Springer, Cham. https://doi.org/10.1007/978-3-319-04960-1_29

Download citation

DOI: https://doi.org/10.1007/978-3-319-04960-1_29
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-04959-5
Online ISBN: 978-3-319-04960-1
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics