Abstract
Before the step for text recognition, a text image needs to be segmented into foreground containing only the text area and background. In this paper, a method is proposed for segmenting colour natural scene texts which suffer from a wide range of degradations with complex background. A text image is firstly processed by two 3-means clustering operations with different distance measurements. Then, a modified connected component (CC)-based validation method is used to obtain the text area after clustering. Thirdly, a proposed objective segmentation evaluation method is utilised to choose the final segmentation result from the two segmented text images. The proposed method is compared with other existing methods based on the ICDAR2003 public database. Experimental results show the effectiveness of the proposed method.
Keywords
- natural scene text segmentation
- k-means clustering
- connected component analysis (CCA)
- segmentation evaluation
This is a preview of subscription content, access via your institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Jung, K., Kim, K.I., Jain, A.K.: Text Information Extraction in Images and Video: A Survey. Pattern Recognition 37(5), 977–997 (2004)
Kasar, T., Kumar, J., Ramakrishnan, A.G.: Font and Background Color Independent Text Binarization. In: 2nd CBDAR, pp. 3–9 (2007)
Kasar, T., Ramakrishnan, A.G.: COCOCLUST: Contour-based Color Clustering for Robust Binarization of Colored Text. In: 3rd CBDAR, pp. 11–17 (2009)
Yu, J., Huang, L., Liu, C.: Double-edge-model Based Character Stroke Extraction from Complex Backgrounds. In: 19th ICPR 2008, pp. 1–4 (2008)
Ye, X., Cheriet, M., Suen, C.Y.: Stroke-model-based Character Extraction from Gray-level Document Images. IEEE Transactions on Image Processing 10(8), 1152–1161 (2001)
Li, X., Wang, W., Huang, Q., Gao, W., Qing, L.: A Hybrid Text Segmentation Approach. In: ICME, pp. 510–513 (2009)
Zhou, Z., Li, L., Tan, C.L.: Edge Based Binarization for Video Text Images. In: 20th ICPR, pp. 133–136 (2010)
Yokobayashi, M., Wakahara, T.: Segmentation and Recognition of Characters in Scene Images Using Selective Binarization in Color Space and GAT Correlation. In: ICDAR, vol. 1, pp. 167–171 (2005)
Yokobayashi, M., Wakahara, T.: Binarization and Recognition of Degraded Characters Using A Maximum Separability Axis in Color Space and GAT Correlation. In: 18th ICPR, vol. 2, pp. 885–888 (2006)
Thillou, C.M., Gosselin, B.: Color Text Extraction from Camera-based Images: the Impact of the Choice of the Clustering Distance. In: ICDAR, vol. 1, pp. 312–316 (2005)
Thillou, C.M., Gosselin, B.: Color Text Extraction with Selective Metric-based Clustering. Computer Vision and Image Understanding 107(1-2), 97–107 (2007)
Correia, P.L., Pereira, F.: Objective Evaluation of Video Segmentation Quality. IEEE Transactions on Image Processing 12(2), 186–200 (2003)
Otsu, N.: A Threshold Selection Method from Gray-level Histograms. IEEE Transactions on Systems, Man & Cybernetics SMC-9(1), 62–66 (1979)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Zeng, C., Jia, W., He, X. (2012). An Algorithm for Colour-Based Natural Scene Text Segmentation. In: Iwamura, M., Shafait, F. (eds) Camera-Based Document Analysis and Recognition. CBDAR 2011. Lecture Notes in Computer Science, vol 7139. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-29364-1_5
Download citation
DOI: https://doi.org/10.1007/978-3-642-29364-1_5
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-29363-4
Online ISBN: 978-3-642-29364-1
eBook Packages: Computer ScienceComputer Science (R0)
