Text localization in camera captured images using fuzzy distance transform based adaptive stroke filter

  • Shauvik Paul
  • Satadal SahaEmail author
  • Subhadip Basu
  • Punam Kumar Saha
  • Mita Nasipuri


Localization of text from camera captured images with complex background is now-a-days a growing demand of modern IT enable service. Most of the current text localization techniques are sensitive to text features like color, size, style and also to the background clutter. Among all the methods proposed in different literatures, Stroke Filter is much more effective in localization of text. The effectiveness of traditional stroke filter is limited because of its fixed width and is capable of segmenting strokes/texts of predefined range of width. The proposed method uses Fuzzy Distance Transform based adaptive stroke filter which can effectively localize text regions from camera captured images with complex background. The method is applied by experiment on a database containing 600 images and the visual response of text segmentation is quite impressive. To get the accuracy of the proposed method, it is applied on a set of 16 test images and the segmentation result is compared with the ground truth images resulting in a recall, precision and f-measure values of 96.65%, 87.77% and 91.89% respectively.


Collision impact Fuzzy distance transform Quench pixel Skeletonization Stroke filter Text localization 



  1. 1.
    Anthimopoulos M, Gatos B, Pratikakis I (2010) A two-stage scheme for text detection in video images. Image Vis Comput 28(9):1413–1426CrossRefGoogle Scholar
  2. 2.
    Bai X, Shi B, Zhang C, Cai X, Qi L (2017) Text/non-text image classification in the wild with convolutional neural networks. Pattern Recogn 66:437–446CrossRefGoogle Scholar
  3. 3.
    Bezdek JC, Pal SK (1992) Fuzzy models for pattern recognition, vol. 267. IEEE press New YorkGoogle Scholar
  4. 4.
    Borgefors G (1986) Distance transformations in digital images. Comput Vis, Graph, Image Proc 34(3):344–371CrossRefGoogle Scholar
  5. 5.
    Bušta M, Neumann L, Matas J (2017) Deep textspotter: An end-to-end trainable scene text localization and recognition framework. in Computer Vision (ICCV), 2017 IEEE International Conference on 2223–2231Google Scholar
  6. 6.
    X. Chen and AL. Yuille, “Detecting and reading text in natural scenes,” in Computer Vision and Pattern Recognition, 2004. CVPR 2004. Proc 2004 IEEE Comput Soc Conf, 2004, vol. 2, p. II–366Google Scholar
  7. 7.
    Danielsson P-E (1980) Euclidean distance mapping. Comput Graph Image Proc 14(3):227–248CrossRefGoogle Scholar
  8. 8.
    Dimitrova N, Zhang H-J, Shahraray B, Sezan I, Huang T, Zakhor A (2002) Applications of video-content analysis and retrieval. Multi Media, IEEE 9(3):42–55CrossRefGoogle Scholar
  9. 9.
    Dutta IN, Chakraborty N, Mollah AF, Basu S, Sarkar R (2019) Multi-lingual Text Localization from Camera Captured Images Based on Foreground Homogenity Analysis, in Recent Developments in Machine Learning and Data Analytics, Springer, 149–158Google Scholar
  10. 10.
    Emmanouilidis C, Batsalas C, Papamarkos N (2009) Development and Evaluation of Text Localization Techniques Based on Structural Texture Features and Neural Classifiers, 2009 10th International Conference on Document Analysis and Recognition 1270–1274Google Scholar
  11. 11.
    Gavrila DM, Davis LS (1996) 3-D model-based tracking of humans in action: a multi-view approach. in IEEE Computer Society Conference on CVPR 73–80Google Scholar
  12. 12.
    Gómez L, Karatzas D (2017) Textproposals: a text-specific selective search algorithm for word spotting in the wild. Pattern Recogn 70:60–74CrossRefGoogle Scholar
  13. 13.
    Gonzales RC, Woods RE (2002) Digital Image Processing, vol. 6. Prentice HallGoogle Scholar
  14. 14.
    Jin D, Saha PK (2013) A new fuzzy skeletonization algorithm and its applications to medical imaging. in International Conference on Image Analysis and Processing 662–671Google Scholar
  15. 15.
    Jin D, Liu Y, Saha PK (2013) Application of fuzzy skeletonization ot quantitatively assess trabecular bone micro-architecture. in Engineering in Medicine and Biology Society (EMBC), 2013 35th Annual International Conference of the IEEE. 3682–3685Google Scholar
  16. 16.
    Jin D, Chen C, Saha PK (2015) Filtering non-significant quench points using collision impact in grassfire propagation. in International Conference on Image Analysis and Processing 432–443Google Scholar
  17. 17.
    K. Jung (2004) In Kim, and A. K Jain. Text information extraction in images and video: a survey. Pattern recognition 37(5):977–997Google Scholar
  18. 18.
    Jung C, Liu Q, Kim J (2008) A new approach for text segmentation using a stroke filter. Signal Process 88(7):1907–1916CrossRefGoogle Scholar
  19. 19.
    Jung C, Liu Q, Kim J (Jan. 2009) A stroke filter and its application to text localization. Pattern Recogn Lett 30(2):114–122CrossRefGoogle Scholar
  20. 20.
    Kaufmann A, Swanson DL (1975) Introduction to the theory of fuzzy subsets, vol. 1. Academic Press New YorkGoogle Scholar
  21. 21.
    Liang J, Doermann D, Li H (2005) Camera-based analysis of text and documents: a survey. IJDAR 7(2–3):84–104CrossRefGoogle Scholar
  22. 22.
    Liu Y, Nie L, Liu L, Rosenblum DS (2016) From action to activity: sensor-based activity recognition. Neurocomputing 181:108–115CrossRefGoogle Scholar
  23. 23.
    Lyu MR, Song J, Cai M (2005) A comprehensive method for multilingual video text detection, localization, and extraction. Circ Syst Video Technol, IEEE Trans 15(2):243–255CrossRefGoogle Scholar
  24. 24.
    Ma J, Shao W, Ye H, Wang L, Wang H, Zheng Y, Xue X (2018) Arbitrary-oriented scene text detection via rotation proposals, IEEE Transactions on MultimediaGoogle Scholar
  25. 25.
    Paul S, Saha S, Basu S, Nasipuri M (2015) Text Localization in Camera Captured Images Using Adaptive Stroke Filter. in Information Systems Design and Intelligent Applications, J. Mandal, S. Satpathy, M. Sanyal, P. Sarkar, and A. Mukhopadhyay, Eds. Springer, 217–225Google Scholar
  26. 26.
    Rong X, Yi C, Tian Y (2017) Unambiguous text localization and retrieval for cluttered scenes, in Computer Vision and Pattern Recognition (CVPR), 2017 IEEE Conference on, 3279–3287Google Scholar
  27. 27.
    Rosenfeld A, Pfaltz JL (1968) Distance functions on digital pictures. Pattern Recogn 1(1):33–61MathSciNetCrossRefGoogle Scholar
  28. 28.
    Saha PK, Wehrli FW, Gomberg BR (2002) Fuzzy distance transform: theory, algorithms, and applications. Comput Vis Image Underst 86(3):171–190CrossRefGoogle Scholar
  29. 29.
    Saha S, Basu S, Nasipuri M, Basu DK (2009) Development of an automated Red Light Violation Detection System ( RLVDS ) for Indian vehicles, in National Conference on Computing and Communication Systems (COCOSYS-09). 59–64Google Scholar
  30. 30.
    Saha S, Basu S, Nasipuri M, Basu DK (2009) License plate localization from vehicle images: an edge based multi-stage approach. Int J Recent Trends Eng (Comput Sci) 1(1):284–288Google Scholar
  31. 31.
    Saha S, Basu S, Nasipuri M, Basu DK (2011) Localization of license plates from Indian vehicle images using iterative edge map generation technique. J Comput 3(6):48–57Google Scholar
  32. 32.
    S. Saha, S. Basu, and M. Nasipuri (2012) License Plate Localization Using Vertical Edge Map and Hough Transform Based Technique,” in Proceedings of the International Conference on Information Systems Design and Intelligent Applications (INDIA 2012) held in Visakhapatnam, India, pp. 649–656Google Scholar
  33. 33.
    S. Saha, S. Basu, and M. Nasipuri (2013) Development of a Stop-Line Violation Detection System for Indian Vehicles, in Handbook of Research on Computational Intelligence for Engineering, Science and Business, S. Bhattacharyya and P. Dutta, Eds. IGI Global. 200–227Google Scholar
  34. 34.
    Saha S, Basu S, Nasipuri M (2014) iLPR: an Indian license plate recognition system. Multimed Tools Appl 1–36Google Scholar
  35. 35.
    Saha PK, Borgefors G, di Baja GS (2016) A survey on skeletonization algorithms and their applications. Pattern Recogn Lett 76:3–12CrossRefGoogle Scholar
  36. 36.
    Shi C, Wang C, Xiao B, Zhang Y, Gao S (2013) Scene text detection using graph model built upon maximally stable extremal regions. Pattern Recogn Lett 34(2):107–116CrossRefGoogle Scholar
  37. 37.
    Shi B, Bai X, Belongie S (2017) Detecting oriented text in natural images by linking segments, arXiv preprint arXiv:1703.06520Google Scholar
  38. 38.
    Shi B, Bai X, Yao C (2017) An end-to-end trainable neural network for image-based sequence recognition and its application to scene text recognition. IEEE Trans Pattern Anal Mach Intell 39(11):2298–2304CrossRefGoogle Scholar
  39. 39.
    Subramanian K, Natarajan P, Decerbo M, Castañòn D (2007) Character-stroke detection for text-localization and extraction, in Document Analysis and Recognition, 2007. ICDAR 2007. Ninth Int Conf 1:33–37Google Scholar
  40. 40.
    Wei Y, Zhang Z, Shen W, Zeng D, Fang M, Zhou S (2017) Text detection in scene images based on exhaustive segmentation. Signal Process Image Commun 50:1–8CrossRefGoogle Scholar
  41. 41.
    Ye Q, Doermann D (2015) Text detection and recognition in imagery: a survey. IEEE Trans Pattern Anal Mach Intell 37(7):1480–1500CrossRefGoogle Scholar
  42. 42.
    Zadeh LA (1978) Fuzzy sets as a basis for a theory of possibility. Fuzzy Sets Syst 1(1):3–28MathSciNetCrossRefGoogle Scholar
  43. 43.
    Zhou X, Yao C, Wen H, Wang Y, Zhou S, He W, Liang J (2017) EAST: an efficient and accurate scene text detector. in Proc. CVPR 2642–2651Google Scholar

Copyright information

© Springer Science+Business Media, LLC, part of Springer Nature 2019

Authors and Affiliations

  • Shauvik Paul
    • 1
  • Satadal Saha
    • 2
    Email author
  • Subhadip Basu
    • 3
  • Punam Kumar Saha
    • 4
  • Mita Nasipuri
    • 3
  1. 1.MCA DepartmentTechno IndiaSalt LakeIndia
  2. 2.ECE DepartmentMCKV Institute of EngineeringHowrahIndia
  3. 3.CSE DepartmentJadavpur UniversityKolkataIndia
  4. 4.Department of RadiologyUniversity of IowaIowa CityUSA

Personalised recommendations