Abstract
In this article, we present a robust scheme for detection of Devanagari or Bangla texts in scene images. These are the two most popular scripts in India. The proposed scheme is primarily based on two major characteristics of such texts - (i) variations in stroke thickness for text components of a script are low compared to their non-text counterparts and (ii) presence of a headline along with a few vertical downward strokes originating from this headline. We use the Euclidean distance transform to verify the general characteristics of texts in (i). Also, we apply the probabilistic Hough line transform to detect the characteristic headline of Devanagari and Bangla texts. Further, similarity and adjacency measures are applied to identify text regions, which do not satisfy the verification in (ii). The proposed approach has been simulated on a repository of 120 images taken from Indian roads and the results are encouraging. Also, we have discussed the applicability of the proposed scheme for detection of English texts. Towards this end, we have considered the training and test samples from the image database of ICDAR 2003 Robust Reading Competition.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Liang, J., Doermann, D., Li, H.: Camera Based Analysis of Text and Documents: A Survey. Int. Journ. on Doc. Anal. and Recog. 7, 84–104 (2005)
Jung, K., Kim, K.I., Jain, A.K.: Text Information Extraction in Images and Video: a Survey. Pattern Recognition 37, 977–997 (2004)
Li, H., Doermann, D., Kia, O.: Automatic Text Detection and Tracking in Digital Video. IEEE Trans. Image Processing 9, 147–167 (2000)
Gllavata, J., Ewerth, R., Freisleben, B.: Text Detection in Images Based on Unsupervised Classification of High Frequency Wavelet Coefficients. In: Proc. of 17th Int. Conf. on Patt. Recog., vol. 1, pp. 425–428 (2004)
Saoi, T., Goto, H., Kobayashi, H.: Text Detection in Color Scene Images Based on Unsupervised Clustering of Multihannel Wavelet Features. In: Proc. of 8th Int. Conf. on Doc. Anal. and Recog., pp. 690–694 (2005)
Ezaki, N., Bulacu, M., Schomaker, L.: Text Detection From Natural Scene Images: Towards a System for Visually Impaired Persons. In: Proc. of 17th Int. Conf. on Patt. Recog., vol. II, pp. 683–686 (2004)
Ye, Q., Huang, Q., Gao, W., Zhao, D.: Fast and Robust Text Detection in Images and Video Frames. Image and Vis. Comp. 23, 565–576 (2005)
Subramanian, K., Natarajan, P., Decerbo, M., Castan̈on, D.: Character-Stroke Detection for Text-Localization and Extraction. In: Proc. of Int. Conf. on Doc. Anal. and Recog., pp. 33–37 (2005)
Epshtein, B., Ofek, E., Wexler, Y.: Detecting Text in Natural Scenes with Stroke Width Transform. In: Proc. of IEEE Conf. on Comp. Vis. and Patt. Recog., pp. 2963–2970 (2010)
Bhattacharya, U., Parui, S.K., Mondal, S.: Devanagari and Bangla Text Extraction from Natural Scene Images. In: 10th Int. Conf. on Doc. Anal. and Recog., pp. 171–175 (2009)
Kumar, S., Perrault, A.: Text Detection on Nokia N900 Using Stroke Width Transform, http://www.cs.cornell.edu/courses/cs4670/2010fa/projects/final/results/group_of_arp86_sk2357/Writeup.pdf (last accessed on October 31, 2011)
Canny, J.: A Computational Approach to Edge Detection. IEEE Trans. Patt. Anal. and Mach. Intell. 8, 679–714 (1986)
Borgefors, G.: Distance Transformations in Digital Images. Comp. Vis., Graph. and Image Proc. 34, 344–371 (1986)
Matas, J., Galambos, C., Kittler, J.: Progressive Probabilistic Hough Transform. In: Proc. of BMVC 1998, vol. 1, pp. 256–265 (1998)
Bradski, G., Kaehler, A.: Learning OpenCV. O’Reilly Media, Inc. (2008)
Lucas, S.M., et al.: ICDAR 2003 Robust Reading Competitions. In: Proc. of 7th Int. Conf. on Doc. Anal. and Recog., pp. 682–668 (2003)
Zhou, L., Lu, Y., Tan, C.L.: Bangla/English Script Identification Based on Analysis of Connected Component Profiles. In: Proc. Doc. Anal. Syst., pp. 243–254 (2006)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Roy Chowdhury, A., Bhattacharya, U., Parui, S.K. (2012). Text Detection of Two Major Indian Scripts in Natural Scene Images. In: Iwamura, M., Shafait, F. (eds) Camera-Based Document Analysis and Recognition. CBDAR 2011. Lecture Notes in Computer Science, vol 7139. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-29364-1_4
Download citation
DOI: https://doi.org/10.1007/978-3-642-29364-1_4
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-29363-4
Online ISBN: 978-3-642-29364-1
eBook Packages: Computer ScienceComputer Science (R0)