Abstract
It is estimated that 285 million people globally are visually impaired. A majority of these people live in developing countries and are among the elderly population. Reading is essential in daily life for everyone. Visually impaired persons can read only by use of special scripts specially designed for them such as Braille language. Further, only trained people can read and understand. Since every product does not provide the product information on product cover in Braille, the present work proposes an assistive text reading framework to help visually impaired persons to read texts from various products/objects in their daily lives. The first step in implementation captures the image of the required by extracting frames from real-time video input from the camera. This is followed by preprocessing steps which includes conversion to grey scale and filtering. The text regions are further extracted using MSER followed by canny edge detection. The text regions from the captured image are then extracted and recognized by using Optical Character Recognition software (OCR). The OCR engine Tesseract is used here. This extracts the text of various fonts and then sizes can be recognized individually and then combined to form a word. Further, producing audio output by using Text to Speech module. The result obtained is very much comparable with other existing methods with better time efficiency. The real-time input is taken and passed through the algorithm which applies filters and removes noise then later image is passed through MSER, OCR, Canny edge detection to get the final audio output.
Keywords
- Maximally stable extremal regions
- Optical character recognition
- Canny edge detection
- Real-time visual aid
This is a preview of subscription content, access via your institution.
Buying options











References
Strotthe T et al (1997) Mobility of blind and elderly people interacting with computers. National Institute for the Blind, report on the MOBIC project. http://www.tiresias.org/reports/mobicf.htm
Real Time Text Detection and Recognition on Hand Held Objects to Assist Blind People. In: 2016 international conference on automatic control and dynamic optimization techniques (ICACDOT), International Institute of Information Technology (I2IT), Pune
Venkateswarlu K, Velaga SM. Text detection on scene images using MSER
Islam MR, Mondal C, Azam MK, Syed A, Islam MJ Text detection and recognition using enhanced MSER detection and a novel OCR technique
Gómez L, Karatzas D. MSER-based real-time text detection and tracking
Kim KI, Jung K, Kim JH (2003) Texture-based approach for text detection in images using support vector machines and continuously adaptive mean shift algorithm. IEEE Trans Pattern Anal Mach Intelligence 25(12):1631–1639
Koo HI, Kim DH (2013) Scene text detection via connected component clustering and nontext filtering. IEEE Trans Image Process 22(6):2296–2305
Srivastav A, Kumar J (2008) Text detection in scene images using stroke width and nearest-neighbor constraints. In: TENCON IEEE region 10 conference, pp 1–5
Zhou G, Liu Y, Tian Z, Su Y (2011) A new hybrid method to detect text in natural scene. In: 18th IEEE international conference on image processing (ICIP), pp 2605–2608
Gómez L, Karatzas D (2014) MSER-based real-time text detection and tracking. In: 22nd international conference on pattern recognition (ICPR), pp 3110–3115
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
RajithKumar, B.K., Mohana, H.S., Jamakhandi, D.A., Akshatha, K.V., Hegde, D.B., Singh, A. (2019). Real-Time Input Text Recognition System for the Aid of Visually Impaired. In: Pandian, D., Fernando, X., Baig, Z., Shi, F. (eds) Proceedings of the International Conference on ISMAC in Computational Vision and Bio-Engineering 2018 (ISMAC-CVB). ISMAC 2018. Lecture Notes in Computational Vision and Biomechanics, vol 30. Springer, Cham. https://doi.org/10.1007/978-3-030-00665-5_16
Download citation
DOI: https://doi.org/10.1007/978-3-030-00665-5_16
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-00664-8
Online ISBN: 978-3-030-00665-5
eBook Packages: EngineeringEngineering (R0)