Abstract
In this paper, we propose an efficient Arabic text detection method based on the Laplacian operator in the frequency domain. The zero crossing value is computed for each pixel in the Laplacian-filtered image to found edges in four directions. K-means is then used to classify all the pixels of the filtered image into two clusters: text and non-text. For each candidate text region, the corresponding region in the canny edge map of the input image undergoes projection profile analysis to determine the boundary of the text blocks. Finally, we employ empirical rules to eliminate false positives based on geometrical properties. Experimental results show that the proposed algorithm is able to detect texts of different fonts, contrasts and backgrounds. Moreover, it outperforms four existing algorithms in terms of detection and false positive rates.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Jain, A.K., Yu, B.: Automatic Text Location in Images and Video Frames. Pattern Recotion 31(12), 2055–2076 (1998)
Liu, C., Wang, C., Dai, R.: Text Detection in Images Based on Unsupervised Classification of Edge-based Features. In: IEEE ICDAR, pp. 610–661 (2005)
Lee, C.W., Jung, K., Kim, H.J.: Automatic text detection and removal in video sequences. Pattern Recognition Letters 24, 2607–2623 (2003)
Jung, K., Kim, K.I., Jain, A.K.: Text information extraction in images and video: a survey. Pattern Recognition 37, 977–997 (2004)
Mariano, V.Y., Kasturi, R.: Locating Uniform-Colored Text in Video Frames. In: IEEE 15th ICPR, vol. 4, pp. 539–542 (2000)
Ye, Q., Huang, Q., Gao, W., Zhao, D.: Fast and robust text detection in images and video frames. Image and Vision Computing 23, 565–576 (2005)
Ye, Q., Gao, W., Wang, W., Zeng, W.: A Robust Text Detection Algorithm in Images and Video Frames. In: IEEE ICICSPCM, pp. 802–806 (2003)
Antani, S., Crandall, D., Kasturi, R.: Robust Extraction of Text in Video. In: IEEE 15th ICPR, vol. 1, pp. 831–834 (2000)
Phan, T.Q., Shivakumara, P., Tan, C.L.: A Laplacian Method for Video Text Detection. In: 2009 10th International Conference on Document Analysis and Recognition (2009)
Phan, T.Q., Shivakumara, P., Tan, C.L.: A Laplacian Approach to Multi-Oriented Text Detection in Video. In: IEEE Transactions on Pattern Analysis and Machine Intelligence (2010)
Mariano, V.Y., Kasturi, R.: Locating Uniform- Colored Text in Video Frames. In: 15th ICPR, vol. 4, pp. 539–542 (2000)
Zhong, Y., Zhang, H., Jain, A.K.: Automatic Caption Localization in Compressed Video. IEEE Trans. Pattern Analysis and Machine Intelligence 22(4), 385–392 (2000)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag GmbH Berlin Heidelberg
About this paper
Cite this paper
Ahmad, A.M.A., Alqutami, A., Atoum, J. (2012). A Robust Algorithm for Arabic Video Text Detection. In: Gaol, F., Nguyen, Q. (eds) Proceedings of the 2011 2nd International Congress on Computer Applications and Computational Science. Advances in Intelligent and Soft Computing, vol 145. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-28308-6_35
Download citation
DOI: https://doi.org/10.1007/978-3-642-28308-6_35
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-28307-9
Online ISBN: 978-3-642-28308-6
eBook Packages: EngineeringEngineering (R0)