Abstract
Text segmentation in a video is drawing attention of researchers in the field of image processing, pattern recognition and document image analysis because it helps in annotating and labeling video events accurately. We propose a novel idea of generating an enhanced frame from the R, G, and B channels of an input frame by grouping high and low values using Min–Max clustering criteria. We also perform sliding window on enhanced frame to group high and low values from the neighboring pixel values to further enhance the frame. Subsequently, we use k-means with k = 2 clustering algorithm to separate text and non-text regions. The fully connected components will be identified in the skeleton of the frame obtained by k-means clustering. Concept of connected component analysis based on gradient feature has been adapted for the purpose of symmetry verification. The components which satisfy symmetric verification are selected to be the representatives of text regions and they are permitted to grow to cover their respective region fully containing text. The method is tested on variety of video frames to evaluate the performance of the method in terms of recall, precision and f-measure. The results show that method is promising and encouraging.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Sharma N, Pal U, Blumenstein M (2012) Recent advances in video based document processing: a review. In: Proceedings of DAS, pp 63–68
Zang J, Kasturi J (2008) Extraction of text objects in video documents: recent progress. In: Proceedings of DAS, pp 5–17
Jung K, Kim K, Jain K (2004) Text information extraction in images and video: a survey. Pattern Recogn 37:977–997
Doermann D, Liang J, Li J (2003) Progress in camera-based document image analysis. In: Proceedings of ICDAR, pp 606–616
Jung K (2001) Neural network-based text location in color images. Pattern Recogn Lett 22:1503–1515
Ye Q, Huang Q, Gao W, Zhao D (2005) Fast and robust text detection in images and videos frames. Image Vis Comput 23:565–576
Chen D, Odobez JM, Bourlard H (2004) Text detection and recognition in images and video frames. Pattern Recog 37:595–608
Neumann L, Matas J (2012) Real-time scene text localization and recognition. In: Proceedings of CVPR, pp 3538–3545
Yao C, Bai X, Liu W, Ma Y, Tu Z (2012) Detecting texts of arbitrary orientations in natural images. In: Proceedings of CVPR, pp 1083–1090
Epshtein B, Ofek E, Wexler Y (2010) Detecting text in natural scenes with stroke width transform. In: Proceedings of CVPR, pp 2963–2970
Jain AK, Yu B (1998) Automatic text location in images and video frames. Pattern Recogn 31:2055–2076 (1998)
Mariano VY, Kasturi R (2000) Locating uniform-colored text in video frames. In: Proceedings of ICPR, pp 539–542
Wu V, Manmatha V, Riseman EM (1999) Text finder: an automatic system to detect and recognize text in images. IEEE Trans PAMI 21:1224–1229
Kim KL, Jung K, Kim JH (2003) Texture-based approach for text detection in images using support vector machines and continuous adaptive mean shift algorithm. IEEE Trans PAMI 25:1631–1639
Shivakumara P, Phan TQ, Tan CL (2011) A laplacian approach to multi-oriented text detection in video. IEEE Trans PAMI 33:412–419
Shivakumara P, Sreedhar RP, Phan TQ, Lu S, Tan CL (2012) Multi-oriented video scene text detection through bayesian classification and boundary growing. IEEE Trans CSVT 22:1227–1235
Zhou J, Xu L, Xiao B, Dai R (2007) A robust system for text extraction in video. In: Proceedings of ICMV, pp 119–124
Lu C, Wang C, Dai R (2005) Text detection in images based on unsupervised classification of edge-based features. In: Proceedings of ICDAR pp 610–614
Wong EK, Chen M (2003) A new robust algorithm for video text extraction. Pattern Recogn 36:1397–1406
Guru DS, Manjunath S, Shivakumara P, Tan CL (2010) An eigen value based approach for text detection in video. In: Proceedings of DAS, pp 501–506
Basavanna M, Shivakumara P, Srivatsa SK, Hemantha Kumar G (2011) A new run-length based method for scene text detection. In: Proceedings of IICAI, pp 1730–1736
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer India
About this paper
Cite this paper
Shivakumara, P., Guru, D.S., Basavaraju, H.T. (2013). Color and Gradient Features for Text Segmentation from Video Frames. In: Swamy, P., Guru, D. (eds) Multimedia Processing, Communication and Computing Applications. Lecture Notes in Electrical Engineering, vol 213. Springer, New Delhi. https://doi.org/10.1007/978-81-322-1143-3_22
Download citation
DOI: https://doi.org/10.1007/978-81-322-1143-3_22
Published:
Publisher Name: Springer, New Delhi
Print ISBN: 978-81-322-1142-6
Online ISBN: 978-81-322-1143-3
eBook Packages: EngineeringEngineering (R0)