Abstract
Recognizing text in scene images plays a vital role especially for applications dealing with environmental interactions. For the system to recognize the environment, textual regions present in them hold a great source of information. But the text recognition task in scene images is complicated due to various unavoidable clutter and distortion in the scene images. Font styling in scene images is also not regulated, and hence, there is a lot of touching between fonts as well. Prior to the recognition of text present in the scene image, identification of correct textual regions and extracting textual edges pose as tedious tasks. Inclusion of unwanted edge features in the task will deteriorate the accuracy of the model. In this work, various methodologies which have been proposed for the identification of textual regions, extraction of textual edges, and recognition of text in scenes have been reviewed. Also, a simple implementation of the same has been done using deep learning classifiers.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
X. Wang, Y. Song, Y. Zhang, J. Xin, Natural scene text detection with multi-layer segmentation and higher order conditional random field-based analysis. Pattern Recognit. Lett. 60–61, 41–47 (2015)
Y. Wei, Z. Zhang, W. Shen, D. Zeng, M. Fang, S. Zhou, Text detection in scene images based on exhaustive segmentation. Signal Process. Image Commun. 50, 1–8 (2017)
A. Sain, A.K. Bhunia, P.P. Roy, U. Pal, Multi-oriented text detection and verification in video frames and scene images. Neurocomputing 275, 1531–1549 (2018)
Y. Zheng, Q. Li, J. Liu, H. Liu, G. Li, S. Zhang, A cascaded method for text detection in natural scene images. Neurocomputing 238, 307–315 (2017)
G.J. Ansari, J.H. Shah, M. Yasmin, M. Sharif, S.L. Fernandes, A novel machine learning approach for scene text extraction. Futur. Gener. Comput. Syst. 87, 328–340 (2018)
X. Zhang, X. Gao, C. Tian, Text detection in natural scene images based on color prior guided MSER. Neurocomputing 307, 61–71 (2018)
B. Su, S. Lu, Accurate recognition of words in scenes without character segmentation using recurrent neural network. Pattern Recognit. 63(June 2016), 397–405 (2017)
V. Khare, P. Shivakumara, P. Raveendran, M. Blumenstein, A blind deconvolution model for scene text detection and recognition in video. Pattern Recognit. 54, 128–148 (2016)
C. Yu, Y. Song, Y. Zhang, Scene text localization using edge analysis and feature pool. Neurocomputing 175, 652–661 (2016)
J.H. Seok, J.H. Kim, Scene text recognition using a Hough forest implicit shape model and semi-Markov conditional random fields. Pattern Recognit. 48, 3584–3599 (2015)
M. Šarić, Scene text segmentation using low variation extremal regions and sorting based character grouping. Neurocomputing 266, 56–65 (2017)
S. Dey et al., Script independent approach for multi-oriented text detection in scene image. Neurocomputing 242, 96–112 (2017)
L.M. Francis, N. Sreenath, TEDLESS—text detection using least-square SVM from natural scene. J. King Saud Univ.—Comput. Inf. Sci. (2017). https://doi.org/10.1016/j.jksuci.2017.09.001
C. Shi, C. Wang, B. Xiao, S. Gao, J. Hu, Author’ s accepted manuscript end-to-end scene text recognition using tree-structured models. Pattern Recognit. 47, 2853–2866 (2014)
A. Mishra, K. Alahari, C.V. Jawahar, Enhancing energy minimization framework for scene text recognition with top-down cues. Comput. Vis. Image Underst. 145, 30–42 (2016)
K. Fan, S.J. Baek, A robust proposal generation method for text lines in natural scene images. Neurocomputing 304, 47–63 (2018)
L. Sun, Q. Huo, W. Jia, K. Chen, A robust approach for text detection from natural scene images. Pattern Recognit. 48, 2906–2920 (2015)
Wahyono, K. Jo, LED Dot matrix text recognition method in natural scene. Neurocomputing 151, 1033–1041 (2015)
C. Merino-Gracia, M. Mirmehdi, J. Sigut, J.L. González-Mora, Fast perspective recovery of text in natural scenes. Image Vis. Comput. 31(10), 714–724 (2013)
D. Bazazian, R. Gómez, A. Nicolaou, L. Gómez, FAST: facilitated and accurate scene text proposals through FCN. Pattern Recognit. Lett. 0, 1–9 (2017)
S. Roy, P. Shivakumara, N. Jain, V. Khare, A. Dutta, U. Pal, T. Lu, Rough-fuzzy based scene categorization for text detection and recognition in video. Pattern Recognit. 80, 64–82 (2018)
D. NguyenVan, S. Lu, S. Tian, N. Ouarti, M. Mokhtari, A pooling-based scene text proposal technique for scene text reading in the wild. Pattern Recognit. (2018)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Bhavesh Shri Kumar, N., Reddy, D.N.B.K.S., Sairam, K., Naren, J. (2020). Scene Text Recognition: A Preliminary Investigation on Various Techniques and Implementation Using Deep Learning Classifiers. In: Khanna, A., Gupta, D., Bhattacharyya, S., Snasel, V., Platos, J., Hassanien, A. (eds) International Conference on Innovative Computing and Communications. Advances in Intelligent Systems and Computing, vol 1087. Springer, Singapore. https://doi.org/10.1007/978-981-15-1286-5_20
Download citation
DOI: https://doi.org/10.1007/978-981-15-1286-5_20
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-1285-8
Online ISBN: 978-981-15-1286-5
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)