Scene Text Segmentation Method Based on MSER and MLBP

Guo, Miaomiao; Yi, Yaohua; Liu, Juhua; Li, Ying

doi:10.1007/978-981-10-3530-2_38

Miaomiao Guo⁶,
Yaohua Yi⁶,
Juhua Liu⁶ &
…
Ying Li⁶

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 417))

Included in the following conference series:

China Academic Conference on Printing & Packaging and Media Technology

2550 Accesses
1 Citations

Abstract

An effective algorithm for segmentation of scene text based on maximally stable extremal regions (MSER) and MLBP (Multiple Local Binary Patterns) is proposed to overcome the interference of uneven illumination and clutter background to scene text segmentation. Firstly, MSER algorithm is used to extract character candidates. Secondly, in the process of character classification, character candidates represented by the effective texture feature MLBP are verified using an AdaBoost trained classifier. Then, we use some heuristic rules to carry on character refinement. The final text segmentation output is obtained by combining the results from the R, G, B color channels in two polarities (bright text on dark background and dark text on bright background). The proposed method is evaluated on the ICDAR_2013 datasets and experiments show that it performs well and can achieve good segmentation results especially in case of uneven light and complex background.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 259.00; Price excludes VAT (USA)

Softcover Book: USD 329.99; Price excludes VAT (USA)

Hardcover Book: USD 329.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Sharma N, Pal U, Blumenstein M. Recent advances in video based document processing: A review [C]. Proc of the 10th IAPR Int Workshop on Document Analysis Systems. Queensland: IEEE Press, 2012: 63–68.
Google Scholar
Zhang H G, Zhao K L, Song Y Z, et al. Text extraction from natural scene image: A survey [J]. Neurocomputing, 2013, 122: 310–323.
Google Scholar
D. Kumar, M. Prasad, and A. Ramakrishnan, “ Benchmarking recognition results on camera captured word image data sets,” in Proceeding of the workshop on Document Analysis and Recognition, 2012, pp. 100–107.
Google Scholar
J. Matas, O. Chum, M. Urban, and T. Pajdla, “Robust wide-baselin stereo from maximally stable extremal regions,” Image and Vision Computing, vol. 22, no. 10, pp. 761–767, 2004.
Google Scholar
Nist’er, D., Stew’ enius, H. Scalable recognition with a vocabulary tree [C]// IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2006, 2: 2161–2168.
Google Scholar
Obdrzalek, S., Matas, J. Object recognition using local affine frames on distinguished regions [C] // British Machine Vision Conference, 2002, 1:113–122.
Google Scholar
Donoser, M., Bischof, H. Efficient maximally stable extremal region (mser) tracking [C] // IEEE Conference on Computer Vision and Pattern Recogniton (CVPR), 2006: 553–560.
Google Scholar
G. Bai, Y. Zhu, and Z. Ding, “A hierarchical face recognition method based on local binary pattern,” Proc. Congr. Image Signal Process, pp. 610–614, 2008.
Google Scholar
Mikolajczyk K, Tuytelaars T, Schmid C, et al. A comparison of affine region detectors [J]. International Journal of Computer Vision, 2005, 65 (1–2): 43–72.
Google Scholar
D. Karatzas, L. Gomez-Bigorda, A. Nicolaou, S. Ghosh, A. Bagdanov, M. Iwamura, J. Matas, L. Neumann, V. R. Chandrasekhar, S. Lu, et al. In Document Analysis and Recognition (ICDAR), 2015 13th International Conference on, pages 1156–1160. IEEE, 2015.
Google Scholar
S. M. Lucas, A. Panaretos, L. Sosa, A. Tang, S. Wong, and R. Yong, “Icdar 2003 robust reading competitions,” in Document Analysis and Recognition, 2003 International Conference on, 2003, pp. 682–687.
Google Scholar

Download references

Author information

Authors and Affiliations

School of Printing and Packaging, Wuhan University, Wuhan, China
Miaomiao Guo, Yaohua Yi, Juhua Liu & Ying Li

Authors

Miaomiao Guo
View author publications
You can also search for this author in PubMed Google Scholar
Yaohua Yi
View author publications
You can also search for this author in PubMed Google Scholar
Juhua Liu
View author publications
You can also search for this author in PubMed Google Scholar
Ying Li
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yaohua Yi .

Editor information

Editors and Affiliations

China Academy of Printing Technology, Beijing, China
Pengfei Zhao
China Academy of Printing Technology, Beijing, China
Yun Ouyang
China Academy of Printing Technology, Beijing, China
Min Xu
China Academy of Printing Technology, Beijing, China
Li Yang
China Academy of Printing Technology, Beijing, China
Yujie Ouyang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Guo, M., Yi, Y., Liu, J., Li, Y. (2017). Scene Text Segmentation Method Based on MSER and MLBP. In: Zhao, P., Ouyang, Y., Xu, M., Yang, L., Ouyang, Y. (eds) Advanced Graphic Communications and Media Technologies . PPMT 2016. Lecture Notes in Electrical Engineering, vol 417. Springer, Singapore. https://doi.org/10.1007/978-981-10-3530-2_38

Download citation

DOI: https://doi.org/10.1007/978-981-10-3530-2_38
Published: 22 March 2017
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-3529-6
Online ISBN: 978-981-10-3530-2
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics