Localize and Segment Scene Text

Liu, Xiaoqian; Wang, Weiqiang

doi:10.1007/978-3-319-03731-8_6

Xiaoqian Liu²² &
Weiqiang Wang^22,23

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 8294))

Included in the following conference series:

Pacific-Rim Conference on Multimedia

2905 Accesses

Abstract

In this paper, we present a scene text extraction approach which can realize text localization and segmentation simultaneously. Two popular paradigms (machine learning method and rule-based method) are combined to achieve competitive performance. For a given image, a sliding window is used to detect scene text. The texture feature Local Binary Pattern is extracted to represent the content of each window, and an unbalanced SVM classifier is designed to identify candidate text regions. Then, candidate text windows are further verified using color contrast and binarized by an adaptive local thresholding computation to get candidate text connected components. Further, non-text ones among them are removed utilizing some empirical rules. Finally, text connected components are linked into text lines according to their spatial relationships and appearance similarities. The evaluation results on two challenging and standard datasets ICDAR 2003 and ICDAR 2011 demonstrate that the proposed approach can effectively detect and segment scene text with different sizes, fonts, colors and arrangement directions.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Park, S.H., Kim, K.I., Jung, K., Kim, H.J.: Locating car license plates using neural networks. Electronics Letters 35(17), 1475–1477 (1999)
Article Google Scholar
Liu, C.-L., Koga, M., Fujisawa, H.: Lexicon-driven segmentation and recognition of handwritten character strings for japanese address reading. IEEE Transactions on Pattern Analisis and Machine Intelligence 24(11), 1425–1437 (2012)
Google Scholar
Ezaki, N., Bulacu, M., Schomaker, L.: Text detection from natural scene images: towards a system for visually impaired persons. In: Proc. of 17th International Conference on Pattern Recognition, Cambridge, England, UK, August 23-26 (2004)
Google Scholar
Liu, Q., Jung, C., Kim, S., Moon, Y., Kim, J.: Stroke filter for text localization in video images. In: Proc. of IEEE International Conference on Image Processing, Atlanta, GA, USA, October 8-11, pp. 1473–1476 (2006)
Google Scholar
Li, Y., Lu, H.: Scene text detection via stroke width. In: Proc. of 21st International Conference on Pattern Recognition, Tsukuba, Japan, November 11-15, pp. 681–684 (2012)
Google Scholar
Chen, X., Yang, J., Zhang, J., Waibel, A.: Automatic detection and recognition of signs from natural scenes. IEEE Transactions on Image Processing 13(1), 87–99 (2004)
Article Google Scholar
Chen, X., Yuille, A.L.: Detecting and reading text in natural scenes. In: Proc. of Computer Vision and Pattern Recognition, Washington, DC, USA, June 27-July 2, pp. 366–373 (2004)
Google Scholar
Mancas-Thillou, C., Gosselin, B.: Spatial and color spaces combination for natural scene text extraction. In: Proc. of the 13th International Conference on Image Proceedings, Atlanta, GA, October 8-11, pp. 985–988 (2006)
Google Scholar
Neumann, L., Matas, J.: A method for text localization and recognition in real-world images. In: Proc. of Asian Conference on Computer Vision, New Zealand, November 8-12, pp. 30–35 (2010)
Google Scholar
Carpenter, B., Case, C., Satheesh, S., Suresh, B., Wang, T., Wu, D.J., Ng, A.Y.: Text detection and character recognition in scene images with unsupervised feature learning. In: Proc. of the 11th International Conference on Document Analysis and Recognition, Beijing, China, September 18-21, pp. 440–445 (2011)
Google Scholar
Epshtein, B., Ofek, E., Wexler, Y.: Detecting text in natural scenes with stroke width transform. In: Proc. of the 23rd IEEE Conference on Computer Vision and Pattern Recognition, San Francisco, CA, June 13-18, pp. 2963–2970 (2010)
Google Scholar
Neumann, L., Matas, J.: Text localization in real-world images using efficiently pruned exhaustive search. In: Proc. of the 11th International Conference on Document Analysis and Recognition, Beijing, China, September 18-21, pp. 687–691 (2011)
Google Scholar
Lucas, S.M.: Icdar 2005 text locating competition results. In: Proc. of the 8th International Conference on Document Analysis and Recognition, Seoul, Korea, August 29-September 1, pp. 80–84 (2005)
Google Scholar
Lucas, S.M., Panaretos, A., Sosa, L., Tang, A., Wong, S., Young, R.: ICDAR 2003 robust reading competitions. In: Proc. of the 7th International Conference on Document Analysis and Recognition, Edinburgh, Scotland, UK, August 3-6, 2003, pp. 682–687 (2003)
Google Scholar
Neumann, L., Matas, J.: Real-time scene text localization and recognition. In: Proc. of the 25th IEEE Conference on Computer Vision and Pattern Recognition, Providence, Rhode Island, June 16-21, pp. 3538–3545 (2012)
Google Scholar
Shahab, A., Shafait, F., Dengel, A.: ICDAR 2011 robust reading competition challenge 2: Reading text in scene images. In: Proc. of the 11th International Conference on Document Analysis and Recognition, pp. 1491–1496 (2011)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Computer and Control Engineering, University of Chinese Academy of Sciences, Beijing, China
Xiaoqian Liu & Weiqiang Wang
Key Lab of Intell. Info. Process., Institute of Computing Technology, CAS, Beijing, China
Weiqiang Wang

Authors

Xiaoqian Liu
View author publications
You can also search for this author in PubMed Google Scholar
Weiqiang Wang
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

EURECOM, Multimedia Department, Sophia Antipolis, France
Benoit Huet
Department of Computer Science, City University of Hong Kong, Tat Chee Ave, Kowloon, Hong Kong
Chong-Wah Ngo
Nanjing University of Science and Technology, 210093, Nanjing, China
Jinhui Tang
Department of Computer Science and Technology, Nanjing University, Xianlin Avenue No. 163, 210023, Nanjing, China
Zhi-Hua Zhou
School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA
Alexander G. Hauptmann
Department of Electrical and Computer Engineering, National University of Singapore, 4 Engineering Drive 3, 117583, Singapore, Singapore
Shuicheng Yan

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Liu, X., Wang, W. (2013). Localize and Segment Scene Text. In: Huet, B., Ngo, CW., Tang, J., Zhou, ZH., Hauptmann, A.G., Yan, S. (eds) Advances in Multimedia Information Processing – PCM 2013. PCM 2013. Lecture Notes in Computer Science, vol 8294. Springer, Cham. https://doi.org/10.1007/978-3-319-03731-8_6

Download citation

DOI: https://doi.org/10.1007/978-3-319-03731-8_6
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-03730-1
Online ISBN: 978-3-319-03731-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics