Skip to main content

Text Localization Based on Fast Feature Pyramids and Multi-Resolution Maximally Stable Extremal Regions

  • Conference paper
  • First Online:
Computer Vision - ACCV 2014 Workshops (ACCV 2014)

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 9009))

Included in the following conference series:

Abstract

Text localization from scene images is a challenging task that finds application in many areas. In this work, we propose a novel hybrid text localization approach that exploits Multi-resolution Maximally Stable Extremal Regions to discard false-positive detections from the text confidence maps generated by a Fast Feature Pyramid based sliding window classifier. The use of a multi-scale approach during both feature computation and connected component extraction allows our method to identify uncommon text elements that are usually not detected by competing algorithms, while the adoption of approximated features and appropriately filtered connected components assures a low overall computational complexity of the proposed system.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

Notes

  1. 1.

    http://dag.cvc.uab.es/icdar2013competition/?ch=2&com=results Method: iwrr2014.

  2. 2.

    http://vision.ucsd.edu/~pdollar/toolbox/doc/index.html.

References

  1. Pan, Y.F., Hou, X., Liu, C.L.: Text localization in natural scene images based on conditional random field. In: Proceedings of the ICDAR (2009)

    Google Scholar 

  2. Coates, A., Carpenter, B., Case, C., Satheesh, S., Suresh, B., Wang, T., Wu, D.J., Ng, A.Y.: Text detection and character recognition in scene images with unsupervised feature learning. In: Proceedings of the ICDAR (2011)

    Google Scholar 

  3. Mishra, A., Alahari, K., Jawahar, C.: Scene text recognition using higher order language priors. In: Proceedings of the BVMC (2012)

    Google Scholar 

  4. Wang, K., Babenko, B., Belongie, S.: End-to-end scene text recognition. In: Proceedings of the ICCV (2011)

    Google Scholar 

  5. Koo, H.I., Kim, D.H.: Scene text detection via connected component clustering and non-text filtering. IEEE Trans. IP 22, 2296–2305 (2013)

    MathSciNet  Google Scholar 

  6. Li, Y., Jia, W., Shen, C., Hengel, A.: Characterness: an indicator of text in the wild. IEEE Trans. IP 23, 1666–1677 (2014)

    Google Scholar 

  7. Neumann, L., Matas, J.: A method for text localization and recognition in real-world images. In: Kimmel, R., Klette, R., Sugimoto, A. (eds.) ACCV 2010, Part III. LNCS, vol. 6494, pp. 770–783. Springer, Heidelberg (2011)

    Chapter  Google Scholar 

  8. Shi, C., Wang, C., Xiao, B., Zhang, Y., Gao, S.: Scene text detection using graph model built upon maximally stable extremal regions. Pattern Recogn. Lett. 34, 107–116 (2013)

    Article  Google Scholar 

  9. Yin, X.C., Yin, X., Huang, K.: Robust text detection in natural scene images. IEEE Trans. PAMI 36, 970–983 (2013)

    MathSciNet  Google Scholar 

  10. Epshtein, B., Ofek, E., Wexler, Y.: Detecting text in natural scenes with stroke width transform. In: Proceedings of the CVPR (2010)

    Google Scholar 

  11. Matas, J., Chum, O., Urban, M., Pajdla, T.: Robust wide baseline stereo from maximally stable extremal regions. In: Proceedings of the BMVC (2002)

    Google Scholar 

  12. Karatzas, D., Shafait, F., Uchida, S., Iwamura, M., Bigorda, L., Mestre, S., Mas, J., Mota, D., Almaz, J., Heras, L.: ICDAR 2013 robust reading competition. In: Proceedings of the ICDAR (2013)

    Google Scholar 

  13. Dollár, P., Appel, R., Belongie, S., Perona, P.: Fast feature pyramids for object detection. IEEE Trans. PAMI 36, 1532–1545 (2014)

    Article  Google Scholar 

  14. Forssén, P.E., Lowe, D.G.: Shape descriptors for maximally stable extremal regions. In: Proceedings of the ICCV (2007)

    Google Scholar 

  15. Crimisi, A.: Microsoft Research Cambridge Object Recognition Image Database (2004)

    Google Scholar 

  16. Yao, C., Bai, X., Liu, W., Ma, Y.: Detecting texts of arbitrary orientations in natural images. In: Proceedings of the CVPR (2010)

    Google Scholar 

  17. Neumann, L., Matas, J.: Real-time scene text localization and recognition. In: Proceedings of the CVPR (2012)

    Google Scholar 

  18. Mathias, M., Timofte, R., Benenson, R., Gool, L.V.: Traffic sign recognition: how far are we from the solution? In: Proceedings of the IJCNN (2013)

    Google Scholar 

  19. Benenson, R., Mathias, M., Tuytelaars, T., Gool, L.V.: Seeking the strongest rigid detector. In: Proceedings of the CVPR (2013)

    Google Scholar 

  20. Appeal, R., Fuchs, T., Dollár, P., Perona, P.: Quickly boosting decision trees pruning underachieving features early. In: Proceedings of the ICML (2013)

    Google Scholar 

  21. Villamizar, M., Andrade-Cetto, J., Sanfeliu, A., Moreno-Noguer, F.: Bootstrapping boosted random ferns for discriminative and efficient object classification. Pattern Recogn. 45, 3141–3153 (2012)

    Article  Google Scholar 

  22. de Campos, T.E., Babu, B.R., Varma, M.: Character recognition in natural images. In: Proceedings of the VISAPP (2009)

    Google Scholar 

  23. Alexe, B., Deselaers, T., Ferrari, V.: What is an object? In: Proceedings of the CVPR (2010)

    Google Scholar 

  24. Manen, S., Guillaumin, M., Gool, L.V.: Prime object proposals with randomized prims algorithm. In: Proceedings of the ICCV (2013)

    Google Scholar 

  25. Yi, C., Tian, Y.: Localizing text in scene images by boundary clustering, stroke segmentation, and string fragment classification. IEEE Trans. IP 21, 4256–4268 (2012)

    MathSciNet  Google Scholar 

  26. Neumann, L., Matas, J.: On combining multiple segmentations in scene text recognition. In: Proceedings of the ICDAR (2013)

    Google Scholar 

  27. Bai, B., Yin, F., Liu, C.L.: Scene text localization using gradient local correlation. In: Proceedings of the ICDAR (2013)

    Google Scholar 

  28. Lucas, S.M., Panaretos, A., Sosa, L., Tang, A., Wong, S., Young, R.: ICDAR 2003 robust reading competition. In: Proceedings of the ICDAR (2003)

    Google Scholar 

  29. Wolf, C., Jolion, J.M.: Object count/area graphs for the evaluation of object detection and segmentation algorithms. IJDAR 8, 280–296 (2006)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Alessandro Zamberletti .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Zamberletti, A., Noce, L., Gallo, I. (2015). Text Localization Based on Fast Feature Pyramids and Multi-Resolution Maximally Stable Extremal Regions. In: Jawahar, C., Shan, S. (eds) Computer Vision - ACCV 2014 Workshops. ACCV 2014. Lecture Notes in Computer Science(), vol 9009. Springer, Cham. https://doi.org/10.1007/978-3-319-16631-5_7

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-16631-5_7

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-16630-8

  • Online ISBN: 978-3-319-16631-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics