Text Localization Based on Fast Feature Pyramids and Multi-Resolution Maximally Stable Extremal Regions

Zamberletti, Alessandro; Noce, Lucia; Gallo, Ignazio

doi:10.1007/978-3-319-16631-5_7

Alessandro Zamberletti¹⁵,
Lucia Noce¹⁵ &
Ignazio Gallo¹⁵

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 9009))

Included in the following conference series:

Asian Conference on Computer Vision

2160 Accesses
14 Citations

Abstract

Text localization from scene images is a challenging task that finds application in many areas. In this work, we propose a novel hybrid text localization approach that exploits Multi-resolution Maximally Stable Extremal Regions to discard false-positive detections from the text confidence maps generated by a Fast Feature Pyramid based sliding window classifier. The use of a multi-scale approach during both feature computation and connected component extraction allows our method to identify uncommon text elements that are usually not detected by competing algorithms, while the adoption of approximated features and appropriately filtered connected components assures a low overall computational complexity of the proposed system.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Fast and Accurate Text Detection in Natural Scene Images

Scene Text Detection Based on Text Probability and Pruning Algorithm

TextCatcher: a method to detect curved and challenging text in natural scenes

Article 11 March 2016

Notes

1.
http://dag.cvc.uab.es/icdar2013competition/?ch=2&com=results Method: iwrr2014.
2.
http://vision.ucsd.edu/~pdollar/toolbox/doc/index.html.

References

Pan, Y.F., Hou, X., Liu, C.L.: Text localization in natural scene images based on conditional random field. In: Proceedings of the ICDAR (2009)
Google Scholar
Coates, A., Carpenter, B., Case, C., Satheesh, S., Suresh, B., Wang, T., Wu, D.J., Ng, A.Y.: Text detection and character recognition in scene images with unsupervised feature learning. In: Proceedings of the ICDAR (2011)
Google Scholar
Mishra, A., Alahari, K., Jawahar, C.: Scene text recognition using higher order language priors. In: Proceedings of the BVMC (2012)
Google Scholar
Wang, K., Babenko, B., Belongie, S.: End-to-end scene text recognition. In: Proceedings of the ICCV (2011)
Google Scholar
Koo, H.I., Kim, D.H.: Scene text detection via connected component clustering and non-text filtering. IEEE Trans. IP 22, 2296–2305 (2013)
MathSciNet Google Scholar
Li, Y., Jia, W., Shen, C., Hengel, A.: Characterness: an indicator of text in the wild. IEEE Trans. IP 23, 1666–1677 (2014)
Google Scholar
Neumann, L., Matas, J.: A method for text localization and recognition in real-world images. In: Kimmel, R., Klette, R., Sugimoto, A. (eds.) ACCV 2010, Part III. LNCS, vol. 6494, pp. 770–783. Springer, Heidelberg (2011)
Chapter Google Scholar
Shi, C., Wang, C., Xiao, B., Zhang, Y., Gao, S.: Scene text detection using graph model built upon maximally stable extremal regions. Pattern Recogn. Lett. 34, 107–116 (2013)
Article Google Scholar
Yin, X.C., Yin, X., Huang, K.: Robust text detection in natural scene images. IEEE Trans. PAMI 36, 970–983 (2013)
MathSciNet Google Scholar
Epshtein, B., Ofek, E., Wexler, Y.: Detecting text in natural scenes with stroke width transform. In: Proceedings of the CVPR (2010)
Google Scholar
Matas, J., Chum, O., Urban, M., Pajdla, T.: Robust wide baseline stereo from maximally stable extremal regions. In: Proceedings of the BMVC (2002)
Google Scholar
Karatzas, D., Shafait, F., Uchida, S., Iwamura, M., Bigorda, L., Mestre, S., Mas, J., Mota, D., Almaz, J., Heras, L.: ICDAR 2013 robust reading competition. In: Proceedings of the ICDAR (2013)
Google Scholar
Dollár, P., Appel, R., Belongie, S., Perona, P.: Fast feature pyramids for object detection. IEEE Trans. PAMI 36, 1532–1545 (2014)
Article Google Scholar
Forssén, P.E., Lowe, D.G.: Shape descriptors for maximally stable extremal regions. In: Proceedings of the ICCV (2007)
Google Scholar
Crimisi, A.: Microsoft Research Cambridge Object Recognition Image Database (2004)
Google Scholar
Yao, C., Bai, X., Liu, W., Ma, Y.: Detecting texts of arbitrary orientations in natural images. In: Proceedings of the CVPR (2010)
Google Scholar
Neumann, L., Matas, J.: Real-time scene text localization and recognition. In: Proceedings of the CVPR (2012)
Google Scholar
Mathias, M., Timofte, R., Benenson, R., Gool, L.V.: Traffic sign recognition: how far are we from the solution? In: Proceedings of the IJCNN (2013)
Google Scholar
Benenson, R., Mathias, M., Tuytelaars, T., Gool, L.V.: Seeking the strongest rigid detector. In: Proceedings of the CVPR (2013)
Google Scholar
Appeal, R., Fuchs, T., Dollár, P., Perona, P.: Quickly boosting decision trees pruning underachieving features early. In: Proceedings of the ICML (2013)
Google Scholar
Villamizar, M., Andrade-Cetto, J., Sanfeliu, A., Moreno-Noguer, F.: Bootstrapping boosted random ferns for discriminative and efficient object classification. Pattern Recogn. 45, 3141–3153 (2012)
Article Google Scholar
de Campos, T.E., Babu, B.R., Varma, M.: Character recognition in natural images. In: Proceedings of the VISAPP (2009)
Google Scholar
Alexe, B., Deselaers, T., Ferrari, V.: What is an object? In: Proceedings of the CVPR (2010)
Google Scholar
Manen, S., Guillaumin, M., Gool, L.V.: Prime object proposals with randomized prims algorithm. In: Proceedings of the ICCV (2013)
Google Scholar
Yi, C., Tian, Y.: Localizing text in scene images by boundary clustering, stroke segmentation, and string fragment classification. IEEE Trans. IP 21, 4256–4268 (2012)
MathSciNet Google Scholar
Neumann, L., Matas, J.: On combining multiple segmentations in scene text recognition. In: Proceedings of the ICDAR (2013)
Google Scholar
Bai, B., Yin, F., Liu, C.L.: Scene text localization using gradient local correlation. In: Proceedings of the ICDAR (2013)
Google Scholar
Lucas, S.M., Panaretos, A., Sosa, L., Tang, A., Wong, S., Young, R.: ICDAR 2003 robust reading competition. In: Proceedings of the ICDAR (2003)
Google Scholar
Wolf, C., Jolion, J.M.: Object count/area graphs for the evaluation of object detection and segmentation algorithms. IJDAR 8, 280–296 (2006)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Theoretical and Applied Science, University of Insubria, Varese, Italy
Alessandro Zamberletti, Lucia Noce & Ignazio Gallo

Authors

Alessandro Zamberletti
View author publications
You can also search for this author in PubMed Google Scholar
Lucia Noce
View author publications
You can also search for this author in PubMed Google Scholar
Ignazio Gallo
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Alessandro Zamberletti .

Editor information

Editors and Affiliations

Center for Visual Information Technology, International Institute of Information Technology, Hyderabad, India
C.V. Jawahar
Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China
Shiguang Shan

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zamberletti, A., Noce, L., Gallo, I. (2015). Text Localization Based on Fast Feature Pyramids and Multi-Resolution Maximally Stable Extremal Regions. In: Jawahar, C., Shan, S. (eds) Computer Vision - ACCV 2014 Workshops. ACCV 2014. Lecture Notes in Computer Science(), vol 9009. Springer, Cham. https://doi.org/10.1007/978-3-319-16631-5_7

Download citation

DOI: https://doi.org/10.1007/978-3-319-16631-5_7
Published: 11 April 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-16630-8
Online ISBN: 978-3-319-16631-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Text Localization Based on Fast Feature Pyramids and Multi-Resolution Maximally Stable Extremal Regions

Abstract

Access this chapter

Similar content being viewed by others

Fast and Accurate Text Detection in Natural Scene Images

Scene Text Detection Based on Text Probability and Pruning Algorithm

TextCatcher: a method to detect curved and challenging text in natural scenes

Notes

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Text Localization Based on Fast Feature Pyramids and Multi-Resolution Maximally Stable Extremal Regions

Abstract

Access this chapter

Similar content being viewed by others

Fast and Accurate Text Detection in Natural Scene Images

Scene Text Detection Based on Text Probability and Pruning Algorithm

TextCatcher: a method to detect curved and challenging text in natural scenes

Notes

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation