Scene Text Recognition: A Preliminary Investigation on Various Techniques and Implementation Using Deep Learning Classifiers

Bhavesh Shri Kumar, N.; Reddy, Dasi Naga Brahma Krishna Sumanth; Sairam, K.; Naren, J.

doi:10.1007/978-981-15-1286-5_20

N. Bhavesh Shri Kumar²⁰,
Dasi Naga Brahma Krishna Sumanth Reddy²⁰,
K. Sairam²⁰ &
…
J. Naren²⁰

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 1087))

744 Accesses

Abstract

Recognizing text in scene images plays a vital role especially for applications dealing with environmental interactions. For the system to recognize the environment, textual regions present in them hold a great source of information. But the text recognition task in scene images is complicated due to various unavoidable clutter and distortion in the scene images. Font styling in scene images is also not regulated, and hence, there is a lot of touching between fonts as well. Prior to the recognition of text present in the scene image, identification of correct textual regions and extracting textual edges pose as tedious tasks. Inclusion of unwanted edge features in the task will deteriorate the accuracy of the model. In this work, various methodologies which have been proposed for the identification of textual regions, extraction of textual edges, and recognition of text in scenes have been reviewed. Also, a simple implementation of the same has been done using deep learning classifiers.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

X. Wang, Y. Song, Y. Zhang, J. Xin, Natural scene text detection with multi-layer segmentation and higher order conditional random field-based analysis. Pattern Recognit. Lett. 60–61, 41–47 (2015)
Article Google Scholar
Y. Wei, Z. Zhang, W. Shen, D. Zeng, M. Fang, S. Zhou, Text detection in scene images based on exhaustive segmentation. Signal Process. Image Commun. 50, 1–8 (2017)
Article Google Scholar
A. Sain, A.K. Bhunia, P.P. Roy, U. Pal, Multi-oriented text detection and verification in video frames and scene images. Neurocomputing 275, 1531–1549 (2018)
Article Google Scholar
Y. Zheng, Q. Li, J. Liu, H. Liu, G. Li, S. Zhang, A cascaded method for text detection in natural scene images. Neurocomputing 238, 307–315 (2017)
Article Google Scholar
G.J. Ansari, J.H. Shah, M. Yasmin, M. Sharif, S.L. Fernandes, A novel machine learning approach for scene text extraction. Futur. Gener. Comput. Syst. 87, 328–340 (2018)
Article Google Scholar
X. Zhang, X. Gao, C. Tian, Text detection in natural scene images based on color prior guided MSER. Neurocomputing 307, 61–71 (2018)
Article Google Scholar
B. Su, S. Lu, Accurate recognition of words in scenes without character segmentation using recurrent neural network. Pattern Recognit. 63(June 2016), 397–405 (2017)
Article Google Scholar
V. Khare, P. Shivakumara, P. Raveendran, M. Blumenstein, A blind deconvolution model for scene text detection and recognition in video. Pattern Recognit. 54, 128–148 (2016)
Article Google Scholar
C. Yu, Y. Song, Y. Zhang, Scene text localization using edge analysis and feature pool. Neurocomputing 175, 652–661 (2016)
Article Google Scholar
J.H. Seok, J.H. Kim, Scene text recognition using a Hough forest implicit shape model and semi-Markov conditional random fields. Pattern Recognit. 48, 3584–3599 (2015)
Article Google Scholar
M. Šarić, Scene text segmentation using low variation extremal regions and sorting based character grouping. Neurocomputing 266, 56–65 (2017)
Article Google Scholar
S. Dey et al., Script independent approach for multi-oriented text detection in scene image. Neurocomputing 242, 96–112 (2017)
Article Google Scholar
L.M. Francis, N. Sreenath, TEDLESS—text detection using least-square SVM from natural scene. J. King Saud Univ.—Comput. Inf. Sci. (2017). https://doi.org/10.1016/j.jksuci.2017.09.001
C. Shi, C. Wang, B. Xiao, S. Gao, J. Hu, Author’ s accepted manuscript end-to-end scene text recognition using tree-structured models. Pattern Recognit. 47, 2853–2866 (2014)
Article Google Scholar
A. Mishra, K. Alahari, C.V. Jawahar, Enhancing energy minimization framework for scene text recognition with top-down cues. Comput. Vis. Image Underst. 145, 30–42 (2016)
Article Google Scholar
K. Fan, S.J. Baek, A robust proposal generation method for text lines in natural scene images. Neurocomputing 304, 47–63 (2018)
Article Google Scholar
L. Sun, Q. Huo, W. Jia, K. Chen, A robust approach for text detection from natural scene images. Pattern Recognit. 48, 2906–2920 (2015)
Article Google Scholar
Wahyono, K. Jo, LED Dot matrix text recognition method in natural scene. Neurocomputing 151, 1033–1041 (2015)
Article Google Scholar
C. Merino-Gracia, M. Mirmehdi, J. Sigut, J.L. González-Mora, Fast perspective recovery of text in natural scenes. Image Vis. Comput. 31(10), 714–724 (2013)
Article Google Scholar
D. Bazazian, R. Gómez, A. Nicolaou, L. Gómez, FAST: facilitated and accurate scene text proposals through FCN. Pattern Recognit. Lett. 0, 1–9 (2017)
Google Scholar
S. Roy, P. Shivakumara, N. Jain, V. Khare, A. Dutta, U. Pal, T. Lu, Rough-fuzzy based scene categorization for text detection and recognition in video. Pattern Recognit. 80, 64–82 (2018)
Article Google Scholar
D. NguyenVan, S. Lu, S. Tian, N. Ouarti, M. Mokhtari, A pooling-based scene text proposal technique for scene text reading in the wild. Pattern Recognit. (2018)
Google Scholar

Download references

Author information

Authors and Affiliations

SASTRA Deemed University, Thanjavur, Tamil Nadu, 613401, India
N. Bhavesh Shri Kumar, Dasi Naga Brahma Krishna Sumanth Reddy, K. Sairam & J. Naren

Authors

N. Bhavesh Shri Kumar
View author publications
You can also search for this author in PubMed Google Scholar
Dasi Naga Brahma Krishna Sumanth Reddy
View author publications
You can also search for this author in PubMed Google Scholar
K. Sairam
View author publications
You can also search for this author in PubMed Google Scholar
J. Naren
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to J. Naren .

Editor information

Editors and Affiliations

Department of Computer Science and Engineering, Maharaja Agrasen Institute of Technology, New Delhi, Delhi, India
Ashish Khanna
Department of Computer Science and Engineering, Maharaja Agrasen Institute of Technology, New Delhi, Delhi, India
Deepak Gupta
Department of Computer Science and Engineering, Christ University, Bangalore, India
Siddhartha Bhattacharyya
Department of Computer Science, VŠB—Technical University of Ostrava, Ostrava, Czech Republic
Vaclav Snasel
Department of Computer Science, VŠB—Technical University of Ostrava, Ostrava, Czech Republic
Jan Platos
Faculty of Computers and Information, Cairo University, Giza, Egypt
Aboul Ella Hassanien

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Bhavesh Shri Kumar, N., Reddy, D.N.B.K.S., Sairam, K., Naren, J. (2020). Scene Text Recognition: A Preliminary Investigation on Various Techniques and Implementation Using Deep Learning Classifiers. In: Khanna, A., Gupta, D., Bhattacharyya, S., Snasel, V., Platos, J., Hassanien, A. (eds) International Conference on Innovative Computing and Communications. Advances in Intelligent Systems and Computing, vol 1087. Springer, Singapore. https://doi.org/10.1007/978-981-15-1286-5_20

Download citation

DOI: https://doi.org/10.1007/978-981-15-1286-5_20
Published: 29 February 2020
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-1285-8
Online ISBN: 978-981-15-1286-5
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics