Skip to main content

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 8294))

Included in the following conference series:

  • 2905 Accesses

Abstract

In this paper, we present a scene text extraction approach which can realize text localization and segmentation simultaneously. Two popular paradigms (machine learning method and rule-based method) are combined to achieve competitive performance. For a given image, a sliding window is used to detect scene text. The texture feature Local Binary Pattern is extracted to represent the content of each window, and an unbalanced SVM classifier is designed to identify candidate text regions. Then, candidate text windows are further verified using color contrast and binarized by an adaptive local thresholding computation to get candidate text connected components. Further, non-text ones among them are removed utilizing some empirical rules. Finally, text connected components are linked into text lines according to their spatial relationships and appearance similarities. The evaluation results on two challenging and standard datasets ICDAR 2003 and ICDAR 2011 demonstrate that the proposed approach can effectively detect and segment scene text with different sizes, fonts, colors and arrangement directions.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Park, S.H., Kim, K.I., Jung, K., Kim, H.J.: Locating car license plates using neural networks. Electronics Letters 35(17), 1475–1477 (1999)

    Article  Google Scholar 

  2. Liu, C.-L., Koga, M., Fujisawa, H.: Lexicon-driven segmentation and recognition of handwritten character strings for japanese address reading. IEEE Transactions on Pattern Analisis and Machine Intelligence 24(11), 1425–1437 (2012)

    Google Scholar 

  3. Ezaki, N., Bulacu, M., Schomaker, L.: Text detection from natural scene images: towards a system for visually impaired persons. In: Proc. of 17th International Conference on Pattern Recognition, Cambridge, England, UK, August 23-26 (2004)

    Google Scholar 

  4. Liu, Q., Jung, C., Kim, S., Moon, Y., Kim, J.: Stroke filter for text localization in video images. In: Proc. of IEEE International Conference on Image Processing, Atlanta, GA, USA, October 8-11, pp. 1473–1476 (2006)

    Google Scholar 

  5. Li, Y., Lu, H.: Scene text detection via stroke width. In: Proc. of 21st International Conference on Pattern Recognition, Tsukuba, Japan, November 11-15, pp. 681–684 (2012)

    Google Scholar 

  6. Chen, X., Yang, J., Zhang, J., Waibel, A.: Automatic detection and recognition of signs from natural scenes. IEEE Transactions on Image Processing 13(1), 87–99 (2004)

    Article  Google Scholar 

  7. Chen, X., Yuille, A.L.: Detecting and reading text in natural scenes. In: Proc. of Computer Vision and Pattern Recognition, Washington, DC, USA, June 27-July 2, pp. 366–373 (2004)

    Google Scholar 

  8. Mancas-Thillou, C., Gosselin, B.: Spatial and color spaces combination for natural scene text extraction. In: Proc. of the 13th International Conference on Image Proceedings, Atlanta, GA, October 8-11, pp. 985–988 (2006)

    Google Scholar 

  9. Neumann, L., Matas, J.: A method for text localization and recognition in real-world images. In: Proc. of Asian Conference on Computer Vision, New Zealand, November 8-12, pp. 30–35 (2010)

    Google Scholar 

  10. Carpenter, B., Case, C., Satheesh, S., Suresh, B., Wang, T., Wu, D.J., Ng, A.Y.: Text detection and character recognition in scene images with unsupervised feature learning. In: Proc. of the 11th International Conference on Document Analysis and Recognition, Beijing, China, September 18-21, pp. 440–445 (2011)

    Google Scholar 

  11. Epshtein, B., Ofek, E., Wexler, Y.: Detecting text in natural scenes with stroke width transform. In: Proc. of the 23rd IEEE Conference on Computer Vision and Pattern Recognition, San Francisco, CA, June 13-18, pp. 2963–2970 (2010)

    Google Scholar 

  12. Neumann, L., Matas, J.: Text localization in real-world images using efficiently pruned exhaustive search. In: Proc. of the 11th International Conference on Document Analysis and Recognition, Beijing, China, September 18-21, pp. 687–691 (2011)

    Google Scholar 

  13. Lucas, S.M.: Icdar 2005 text locating competition results. In: Proc. of the 8th International Conference on Document Analysis and Recognition, Seoul, Korea, August 29-September 1, pp. 80–84 (2005)

    Google Scholar 

  14. Lucas, S.M., Panaretos, A., Sosa, L., Tang, A., Wong, S., Young, R.: ICDAR 2003 robust reading competitions. In: Proc. of the 7th International Conference on Document Analysis and Recognition, Edinburgh, Scotland, UK, August 3-6, 2003, pp. 682–687 (2003)

    Google Scholar 

  15. Neumann, L., Matas, J.: Real-time scene text localization and recognition. In: Proc. of the 25th IEEE Conference on Computer Vision and Pattern Recognition, Providence, Rhode Island, June 16-21, pp. 3538–3545 (2012)

    Google Scholar 

  16. Shahab, A., Shafait, F., Dengel, A.: ICDAR 2011 robust reading competition challenge 2: Reading text in scene images. In: Proc. of the 11th International Conference on Document Analysis and Recognition, pp. 1491–1496 (2011)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer International Publishing Switzerland

About this paper

Cite this paper

Liu, X., Wang, W. (2013). Localize and Segment Scene Text. In: Huet, B., Ngo, CW., Tang, J., Zhou, ZH., Hauptmann, A.G., Yan, S. (eds) Advances in Multimedia Information Processing – PCM 2013. PCM 2013. Lecture Notes in Computer Science, vol 8294. Springer, Cham. https://doi.org/10.1007/978-3-319-03731-8_6

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-03731-8_6

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-03730-1

  • Online ISBN: 978-3-319-03731-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics