Abstract
Localization of text from camera captured images with complex background is now-a-days a growing demand of modern IT enable service. Most of the current text localization techniques are sensitive to text features like color, size, style and also to the background clutter. Among all the methods proposed in different literatures, Stroke Filter is much more effective in localization of text. The effectiveness of traditional stroke filter is limited because of its fixed width and is capable of segmenting strokes/texts of predefined range of width. The proposed method uses Fuzzy Distance Transform based adaptive stroke filter which can effectively localize text regions from camera captured images with complex background. The method is applied by experiment on a database containing 600 images and the visual response of text segmentation is quite impressive. To get the accuracy of the proposed method, it is applied on a set of 16 test images and the segmentation result is compared with the ground truth images resulting in a recall, precision and f-measure values of 96.65%, 87.77% and 91.89% respectively.
Similar content being viewed by others
References
Anthimopoulos M, Gatos B, Pratikakis I (2010) A two-stage scheme for text detection in video images. Image Vis Comput 28(9):1413–1426
Bai X, Shi B, Zhang C, Cai X, Qi L (2017) Text/non-text image classification in the wild with convolutional neural networks. Pattern Recogn 66:437–446
Bezdek JC, Pal SK (1992) Fuzzy models for pattern recognition, vol. 267. IEEE press New York
Borgefors G (1986) Distance transformations in digital images. Comput Vis, Graph, Image Proc 34(3):344–371
Bušta M, Neumann L, Matas J (2017) Deep textspotter: An end-to-end trainable scene text localization and recognition framework. in Computer Vision (ICCV), 2017 IEEE International Conference on 2223–2231
X. Chen and AL. Yuille, “Detecting and reading text in natural scenes,” in Computer Vision and Pattern Recognition, 2004. CVPR 2004. Proc 2004 IEEE Comput Soc Conf, 2004, vol. 2, p. II–366
Danielsson P-E (1980) Euclidean distance mapping. Comput Graph Image Proc 14(3):227–248
Dimitrova N, Zhang H-J, Shahraray B, Sezan I, Huang T, Zakhor A (2002) Applications of video-content analysis and retrieval. Multi Media, IEEE 9(3):42–55
Dutta IN, Chakraborty N, Mollah AF, Basu S, Sarkar R (2019) Multi-lingual Text Localization from Camera Captured Images Based on Foreground Homogenity Analysis, in Recent Developments in Machine Learning and Data Analytics, Springer, 149–158
Emmanouilidis C, Batsalas C, Papamarkos N (2009) Development and Evaluation of Text Localization Techniques Based on Structural Texture Features and Neural Classifiers, 2009 10th International Conference on Document Analysis and Recognition 1270–1274
Gavrila DM, Davis LS (1996) 3-D model-based tracking of humans in action: a multi-view approach. in IEEE Computer Society Conference on CVPR 73–80
Gómez L, Karatzas D (2017) Textproposals: a text-specific selective search algorithm for word spotting in the wild. Pattern Recogn 70:60–74
Gonzales RC, Woods RE (2002) Digital Image Processing, vol. 6. Prentice Hall
Jin D, Saha PK (2013) A new fuzzy skeletonization algorithm and its applications to medical imaging. in International Conference on Image Analysis and Processing 662–671
Jin D, Liu Y, Saha PK (2013) Application of fuzzy skeletonization ot quantitatively assess trabecular bone micro-architecture. in Engineering in Medicine and Biology Society (EMBC), 2013 35th Annual International Conference of the IEEE. 3682–3685
Jin D, Chen C, Saha PK (2015) Filtering non-significant quench points using collision impact in grassfire propagation. in International Conference on Image Analysis and Processing 432–443
K. Jung (2004) In Kim, and A. K Jain. Text information extraction in images and video: a survey. Pattern recognition 37(5):977–997
Jung C, Liu Q, Kim J (2008) A new approach for text segmentation using a stroke filter. Signal Process 88(7):1907–1916
Jung C, Liu Q, Kim J (Jan. 2009) A stroke filter and its application to text localization. Pattern Recogn Lett 30(2):114–122
Kaufmann A, Swanson DL (1975) Introduction to the theory of fuzzy subsets, vol. 1. Academic Press New York
Liang J, Doermann D, Li H (2005) Camera-based analysis of text and documents: a survey. IJDAR 7(2–3):84–104
Liu Y, Nie L, Liu L, Rosenblum DS (2016) From action to activity: sensor-based activity recognition. Neurocomputing 181:108–115
Lyu MR, Song J, Cai M (2005) A comprehensive method for multilingual video text detection, localization, and extraction. Circ Syst Video Technol, IEEE Trans 15(2):243–255
Ma J, Shao W, Ye H, Wang L, Wang H, Zheng Y, Xue X (2018) Arbitrary-oriented scene text detection via rotation proposals, IEEE Transactions on Multimedia
Paul S, Saha S, Basu S, Nasipuri M (2015) Text Localization in Camera Captured Images Using Adaptive Stroke Filter. in Information Systems Design and Intelligent Applications, J. Mandal, S. Satpathy, M. Sanyal, P. Sarkar, and A. Mukhopadhyay, Eds. Springer, 217–225
Rong X, Yi C, Tian Y (2017) Unambiguous text localization and retrieval for cluttered scenes, in Computer Vision and Pattern Recognition (CVPR), 2017 IEEE Conference on, 3279–3287
Rosenfeld A, Pfaltz JL (1968) Distance functions on digital pictures. Pattern Recogn 1(1):33–61
Saha PK, Wehrli FW, Gomberg BR (2002) Fuzzy distance transform: theory, algorithms, and applications. Comput Vis Image Underst 86(3):171–190
Saha S, Basu S, Nasipuri M, Basu DK (2009) Development of an automated Red Light Violation Detection System ( RLVDS ) for Indian vehicles, in National Conference on Computing and Communication Systems (COCOSYS-09). 59–64
Saha S, Basu S, Nasipuri M, Basu DK (2009) License plate localization from vehicle images: an edge based multi-stage approach. Int J Recent Trends Eng (Comput Sci) 1(1):284–288
Saha S, Basu S, Nasipuri M, Basu DK (2011) Localization of license plates from Indian vehicle images using iterative edge map generation technique. J Comput 3(6):48–57
S. Saha, S. Basu, and M. Nasipuri (2012) License Plate Localization Using Vertical Edge Map and Hough Transform Based Technique,” in Proceedings of the International Conference on Information Systems Design and Intelligent Applications (INDIA 2012) held in Visakhapatnam, India, pp. 649–656
S. Saha, S. Basu, and M. Nasipuri (2013) Development of a Stop-Line Violation Detection System for Indian Vehicles, in Handbook of Research on Computational Intelligence for Engineering, Science and Business, S. Bhattacharyya and P. Dutta, Eds. IGI Global. 200–227
Saha S, Basu S, Nasipuri M (2014) iLPR: an Indian license plate recognition system. Multimed Tools Appl 1–36
Saha PK, Borgefors G, di Baja GS (2016) A survey on skeletonization algorithms and their applications. Pattern Recogn Lett 76:3–12
Shi C, Wang C, Xiao B, Zhang Y, Gao S (2013) Scene text detection using graph model built upon maximally stable extremal regions. Pattern Recogn Lett 34(2):107–116
Shi B, Bai X, Belongie S (2017) Detecting oriented text in natural images by linking segments, arXiv preprint arXiv:1703.06520
Shi B, Bai X, Yao C (2017) An end-to-end trainable neural network for image-based sequence recognition and its application to scene text recognition. IEEE Trans Pattern Anal Mach Intell 39(11):2298–2304
Subramanian K, Natarajan P, Decerbo M, Castañòn D (2007) Character-stroke detection for text-localization and extraction, in Document Analysis and Recognition, 2007. ICDAR 2007. Ninth Int Conf 1:33–37
Wei Y, Zhang Z, Shen W, Zeng D, Fang M, Zhou S (2017) Text detection in scene images based on exhaustive segmentation. Signal Process Image Commun 50:1–8
Ye Q, Doermann D (2015) Text detection and recognition in imagery: a survey. IEEE Trans Pattern Anal Mach Intell 37(7):1480–1500
Zadeh LA (1978) Fuzzy sets as a basis for a theory of possibility. Fuzzy Sets Syst 1(1):3–28
Zhou X, Yao C, Wen H, Wang Y, Zhou S, He W, Liang J (2017) EAST: an efficient and accurate scene text detector. in Proc. CVPR 2642–2651
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Paul, S., Saha, S., Basu, S. et al. Text localization in camera captured images using fuzzy distance transform based adaptive stroke filter. Multimed Tools Appl 78, 18017–18036 (2019). https://doi.org/10.1007/s11042-019-7178-3
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-019-7178-3