Color and Gradient Features for Text Segmentation from Video Frames

Shivakumara, P.; Guru, D. S.; Basavaraju, H. T.

doi:10.1007/978-81-322-1143-3_22

Color and Gradient Features for Text Segmentation from Video Frames

P. Shivakumara³,
D. S. Guru⁴ &
H. T. Basavaraju⁴

Conference paper
First Online: 01 January 2013

939 Accesses
3 Citations
2 Altmetric

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 213))

Abstract

Text segmentation in a video is drawing attention of researchers in the field of image processing, pattern recognition and document image analysis because it helps in annotating and labeling video events accurately. We propose a novel idea of generating an enhanced frame from the R, G, and B channels of an input frame by grouping high and low values using Min–Max clustering criteria. We also perform sliding window on enhanced frame to group high and low values from the neighboring pixel values to further enhance the frame. Subsequently, we use k-means with k = 2 clustering algorithm to separate text and non-text regions. The fully connected components will be identified in the skeleton of the frame obtained by k-means clustering. Concept of connected component analysis based on gradient feature has been adapted for the purpose of symmetry verification. The components which satisfy symmetric verification are selected to be the representatives of text regions and they are permitted to grow to cover their respective region fully containing text. The method is tested on variety of video frames to evaluate the performance of the method in terms of recall, precision and f-measure. The results show that method is promising and encouraging.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Sharma N, Pal U, Blumenstein M (2012) Recent advances in video based document processing: a review. In: Proceedings of DAS, pp 63–68
Google Scholar
Zang J, Kasturi J (2008) Extraction of text objects in video documents: recent progress. In: Proceedings of DAS, pp 5–17
Google Scholar
Jung K, Kim K, Jain K (2004) Text information extraction in images and video: a survey. Pattern Recogn 37:977–997
Google Scholar
Doermann D, Liang J, Li J (2003) Progress in camera-based document image analysis. In: Proceedings of ICDAR, pp 606–616
Google Scholar
Jung K (2001) Neural network-based text location in color images. Pattern Recogn Lett 22:1503–1515
Google Scholar
Ye Q, Huang Q, Gao W, Zhao D (2005) Fast and robust text detection in images and videos frames. Image Vis Comput 23:565–576
Google Scholar
Chen D, Odobez JM, Bourlard H (2004) Text detection and recognition in images and video frames. Pattern Recog 37:595–608
Google Scholar
Neumann L, Matas J (2012) Real-time scene text localization and recognition. In: Proceedings of CVPR, pp 3538–3545
Google Scholar
Yao C, Bai X, Liu W, Ma Y, Tu Z (2012) Detecting texts of arbitrary orientations in natural images. In: Proceedings of CVPR, pp 1083–1090
Google Scholar
Epshtein B, Ofek E, Wexler Y (2010) Detecting text in natural scenes with stroke width transform. In: Proceedings of CVPR, pp 2963–2970
Google Scholar
Jain AK, Yu B (1998) Automatic text location in images and video frames. Pattern Recogn 31:2055–2076 (1998)
Google Scholar
Mariano VY, Kasturi R (2000) Locating uniform-colored text in video frames. In: Proceedings of ICPR, pp 539–542
Google Scholar
Wu V, Manmatha V, Riseman EM (1999) Text finder: an automatic system to detect and recognize text in images. IEEE Trans PAMI 21:1224–1229
Google Scholar
Kim KL, Jung K, Kim JH (2003) Texture-based approach for text detection in images using support vector machines and continuous adaptive mean shift algorithm. IEEE Trans PAMI 25:1631–1639
Google Scholar
Shivakumara P, Phan TQ, Tan CL (2011) A laplacian approach to multi-oriented text detection in video. IEEE Trans PAMI 33:412–419
Google Scholar
Shivakumara P, Sreedhar RP, Phan TQ, Lu S, Tan CL (2012) Multi-oriented video scene text detection through bayesian classification and boundary growing. IEEE Trans CSVT 22:1227–1235
Google Scholar
Zhou J, Xu L, Xiao B, Dai R (2007) A robust system for text extraction in video. In: Proceedings of ICMV, pp 119–124
Google Scholar
Lu C, Wang C, Dai R (2005) Text detection in images based on unsupervised classification of edge-based features. In: Proceedings of ICDAR pp 610–614
Google Scholar
Wong EK, Chen M (2003) A new robust algorithm for video text extraction. Pattern Recogn 36:1397–1406
Google Scholar
Guru DS, Manjunath S, Shivakumara P, Tan CL (2010) An eigen value based approach for text detection in video. In: Proceedings of DAS, pp 501–506
Google Scholar
Basavanna M, Shivakumara P, Srivatsa SK, Hemantha Kumar G (2011) A new run-length based method for scene text detection. In: Proceedings of IICAI, pp 1730–1736
Google Scholar

Download references

Author information

Authors and Affiliations

Multimedia Unit, Faculty of Computer Science and Information Technology, University of Malaya, Kuala Lumpur, Malaysia
P. Shivakumara
Department of Studies in Computer Science, University of Mysore, Mysore, Karnataka, India
D. S. Guru & H. T. Basavaraju

Authors

P. Shivakumara
View author publications
You can also search for this author in PubMed Google Scholar
D. S. Guru
View author publications
You can also search for this author in PubMed Google Scholar
H. T. Basavaraju
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to P. Shivakumara .

Editor information

Editors and Affiliations

Master of Computer Applications, PES Institute of Technology, Banashankari 3rd stage, Near Hoskerehalli Cross 100 Feet, Bangalore, 560085, Karnataka, India
Punitha P. Swamy
Studies in Computer Science, University of Mysore, Manasagangotri, Mysore, 570006, Karnataka, India
Devanur S. Guru

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Shivakumara, P., Guru, D.S., Basavaraju, H.T. (2013). Color and Gradient Features for Text Segmentation from Video Frames. In: Swamy, P., Guru, D. (eds) Multimedia Processing, Communication and Computing Applications. Lecture Notes in Electrical Engineering, vol 213. Springer, New Delhi. https://doi.org/10.1007/978-81-322-1143-3_22

Download citation

DOI: https://doi.org/10.1007/978-81-322-1143-3_22
Published: 26 May 2013
Publisher Name: Springer, New Delhi
Print ISBN: 978-81-322-1142-6
Online ISBN: 978-81-322-1143-3
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics