Abstract
Text localization from scene images is a challenging task that finds application in many areas. In this work, we propose a novel hybrid text localization approach that exploits Multi-resolution Maximally Stable Extremal Regions to discard false-positive detections from the text confidence maps generated by a Fast Feature Pyramid based sliding window classifier. The use of a multi-scale approach during both feature computation and connected component extraction allows our method to identify uncommon text elements that are usually not detected by competing algorithms, while the adoption of approximated features and appropriately filtered connected components assures a low overall computational complexity of the proposed system.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
References
Pan, Y.F., Hou, X., Liu, C.L.: Text localization in natural scene images based on conditional random field. In: Proceedings of the ICDAR (2009)
Coates, A., Carpenter, B., Case, C., Satheesh, S., Suresh, B., Wang, T., Wu, D.J., Ng, A.Y.: Text detection and character recognition in scene images with unsupervised feature learning. In: Proceedings of the ICDAR (2011)
Mishra, A., Alahari, K., Jawahar, C.: Scene text recognition using higher order language priors. In: Proceedings of the BVMC (2012)
Wang, K., Babenko, B., Belongie, S.: End-to-end scene text recognition. In: Proceedings of the ICCV (2011)
Koo, H.I., Kim, D.H.: Scene text detection via connected component clustering and non-text filtering. IEEE Trans. IP 22, 2296–2305 (2013)
Li, Y., Jia, W., Shen, C., Hengel, A.: Characterness: an indicator of text in the wild. IEEE Trans. IP 23, 1666–1677 (2014)
Neumann, L., Matas, J.: A method for text localization and recognition in real-world images. In: Kimmel, R., Klette, R., Sugimoto, A. (eds.) ACCV 2010, Part III. LNCS, vol. 6494, pp. 770–783. Springer, Heidelberg (2011)
Shi, C., Wang, C., Xiao, B., Zhang, Y., Gao, S.: Scene text detection using graph model built upon maximally stable extremal regions. Pattern Recogn. Lett. 34, 107–116 (2013)
Yin, X.C., Yin, X., Huang, K.: Robust text detection in natural scene images. IEEE Trans. PAMI 36, 970–983 (2013)
Epshtein, B., Ofek, E., Wexler, Y.: Detecting text in natural scenes with stroke width transform. In: Proceedings of the CVPR (2010)
Matas, J., Chum, O., Urban, M., Pajdla, T.: Robust wide baseline stereo from maximally stable extremal regions. In: Proceedings of the BMVC (2002)
Karatzas, D., Shafait, F., Uchida, S., Iwamura, M., Bigorda, L., Mestre, S., Mas, J., Mota, D., Almaz, J., Heras, L.: ICDAR 2013 robust reading competition. In: Proceedings of the ICDAR (2013)
Dollár, P., Appel, R., Belongie, S., Perona, P.: Fast feature pyramids for object detection. IEEE Trans. PAMI 36, 1532–1545 (2014)
Forssén, P.E., Lowe, D.G.: Shape descriptors for maximally stable extremal regions. In: Proceedings of the ICCV (2007)
Crimisi, A.: Microsoft Research Cambridge Object Recognition Image Database (2004)
Yao, C., Bai, X., Liu, W., Ma, Y.: Detecting texts of arbitrary orientations in natural images. In: Proceedings of the CVPR (2010)
Neumann, L., Matas, J.: Real-time scene text localization and recognition. In: Proceedings of the CVPR (2012)
Mathias, M., Timofte, R., Benenson, R., Gool, L.V.: Traffic sign recognition: how far are we from the solution? In: Proceedings of the IJCNN (2013)
Benenson, R., Mathias, M., Tuytelaars, T., Gool, L.V.: Seeking the strongest rigid detector. In: Proceedings of the CVPR (2013)
Appeal, R., Fuchs, T., Dollár, P., Perona, P.: Quickly boosting decision trees pruning underachieving features early. In: Proceedings of the ICML (2013)
Villamizar, M., Andrade-Cetto, J., Sanfeliu, A., Moreno-Noguer, F.: Bootstrapping boosted random ferns for discriminative and efficient object classification. Pattern Recogn. 45, 3141–3153 (2012)
de Campos, T.E., Babu, B.R., Varma, M.: Character recognition in natural images. In: Proceedings of the VISAPP (2009)
Alexe, B., Deselaers, T., Ferrari, V.: What is an object? In: Proceedings of the CVPR (2010)
Manen, S., Guillaumin, M., Gool, L.V.: Prime object proposals with randomized prims algorithm. In: Proceedings of the ICCV (2013)
Yi, C., Tian, Y.: Localizing text in scene images by boundary clustering, stroke segmentation, and string fragment classification. IEEE Trans. IP 21, 4256–4268 (2012)
Neumann, L., Matas, J.: On combining multiple segmentations in scene text recognition. In: Proceedings of the ICDAR (2013)
Bai, B., Yin, F., Liu, C.L.: Scene text localization using gradient local correlation. In: Proceedings of the ICDAR (2013)
Lucas, S.M., Panaretos, A., Sosa, L., Tang, A., Wong, S., Young, R.: ICDAR 2003 robust reading competition. In: Proceedings of the ICDAR (2003)
Wolf, C., Jolion, J.M.: Object count/area graphs for the evaluation of object detection and segmentation algorithms. IJDAR 8, 280–296 (2006)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Zamberletti, A., Noce, L., Gallo, I. (2015). Text Localization Based on Fast Feature Pyramids and Multi-Resolution Maximally Stable Extremal Regions. In: Jawahar, C., Shan, S. (eds) Computer Vision - ACCV 2014 Workshops. ACCV 2014. Lecture Notes in Computer Science(), vol 9009. Springer, Cham. https://doi.org/10.1007/978-3-319-16631-5_7
Download citation
DOI: https://doi.org/10.1007/978-3-319-16631-5_7
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-16630-8
Online ISBN: 978-3-319-16631-5
eBook Packages: Computer ScienceComputer Science (R0)