Abstract
Text recognition at character/word level is one of the very important steps for development of automated systems for understanding low resolution display board images which facilitate several new applications such as blind assistants, tour guide systems, location aware systems and many more. In this paper, a new approach for recognition of Kannada words in low resolution natural scene images from a limited vocabulary is presented. The proposed method uses structural patterns of vertical and horizontal cuts as features, which are tolerant to font variability, uncertainty, noise and other degradations. These structural representations characterize the shape of the word image. The method works in two phases; In the training phase, several patterns of vertical and horizontal cut features that can occur generally even in the presence of uncertainty are determined from training word images and templates are constructed, one for each word under study. Further, these templates are organized into knowledge bases, one for each set of word images of equal size in terms of number of characters. During testing, a test word image is processed to obtain vertical and horizontal cut features and a newly defined pattern matching procedure that measures the maximum similarity between test sample and pre-constructed templates of word images in the knowledge base is used to recognize the word. The proposed methodology is evaluated for 1,200 Kannada word images and an overall recognition accuracy of 97.67 % is achieved. The proposed method is found to be robust and insensitive to the variations in size and style of font, thickness and spacing between characters, noise, and other degradations.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Abowd GD, Atkeson CG, Hong J, Long S, Kooper R, Pinkerton M (1997) CyberGuide: a mobile context-aware tour guide. Wireless Netw 3(5):421–433
Marmasse N, Schamandt C (2000) Location aware information delivery withcomMotion. In: Proceedings of conference on human factors in computing systems, pp 157–171
Tollmar K, Yeh T, Darrell T (2004) IDeixis—image-based deixis for finding location-based information. In: Proceedings of conference on human factors in computing systems (CHI’04), pp 781–782
Leetch G, Mangina E (2005) A multi-agent system to stream multimedia to handheld devices. In: Proceedings of the 6th international conference on computational intelligence and multimedia applications
Premchaiswadi W (2009) A mobile image search for tourist information system. In: Proceedings of 9th international conference on signal processing, computational geometry and artificial vision, pp 62–67
Ma C-J, Fang J-Y (2008) Location based mobile tour guide services towards digital dunhaung. In: International archives of phtotgrammtery, remote sensing and spatial information sciences, vol XXXVII, Part B4, Beijing
Wu S-H, Li M-X, Yanga P-C, Kub T (2010) Ubiquitous wikipedia on handheld device for mobile learning. In: 6th IEEE international conference on wireless, mobile, and ubiquitous technologies in education, pp 228–230
Yeh T, Grauman K, Tollmar K (2005) A picture is worth a thousand keywords: image-based object search on a mobile platform. In: Proceedings of conference on human factors in computing systems, pp 2025–2028
Fan X, Xie X, Li Z, Li M., Ma WY (2005) Photo-to-search: using multimodal queries to search web from mobile phones. In: Proceedings of 7th ACM SIGMM international workshop on multimedia information retrieval
Hwee LJ, Chevallet JP, Merah SN (2005) SnapToTell: Ubiquitous information access from camera. In: Mobile human computer interaction with mobile devices and services, Glasgow, Scotland
Zhang J, Chen X, Hanneman A, Yang J, Waibel A (2002) A robust approach for recognition of text embedded in natural scenes. In: Proceedings of 16th international conference on pattern recognition, vol 3, pp 204–207
Chen X, Yang J, Zhang J, Waibel A (2004) Automatic detection and recognition of signs from natural scenes. IEEE Trans Image Process 13(1):87–99
Mishra A, Alahari K, Jawahar CV (2012) Top-down and bottom-up cues for scene text recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition
Weinman JJ, Learned-Miller E, Hanson A (2008) A discriminative semi-Markov model for robust scene text recognition. In: 19th international conference on pattern recognition ICPR 2008, vol 8–11, pp 1–5
Weinman JJ, Learned-Miller E, Hanson A (2009) Scene text recognition using similarity and lexicon with sparse belief propogation. IEEE Trans Pattern Anal Mach Intell 31(10):1733–1746
Weinman JJ (2010) Typographical features for scene text recognition. In: 20th international conference on pattern recognition, vol 23–26, pp 3987–3990
Park J et al (2010) Automatic detection and recognition of Korean text in outdoor signboard images. Pattern Recogn Lett 31:1728–1739
Kobayashi T, Toyamaya T, Shafait F, Iwamura M, Kise K, Dengel A (2012) Recognizing words in scenes with a head-mounted eye-tracker. In: 10th IAPR international workshop on document analysis systems, vol 27–29, pp 333–338
Ghoshal R, Roy A, Parui SK (2011) Recognition of Bangla text from scene images through perspective correction. In: 2011 international conference on image information processing (ICIIP), vol 3–5, pp 1–6
Coates A, Carpenter B, Case C, Satheesh S, Suresh B, Wang T, Wu DJ, Ng AY (2011) Text detection and character recognition in scene images with unsupervised feature learning. In: 2011 international conference on document analysis and recognition (ICDAR), vol 18–21, pp 440–445
Wang K, Babenko B, Belongie S (2011) End-to-end scene text recognition. In: 2011 IEEE international conference on computer vision, vol 6–13, pp 1457–1464
Chung H, Sihn K-H, Hong S, Song HJ, Kim D (2011) Scene text recognition system using multigrain parallelism. Consumer. In: 2011 IEEE communications and networking conference, vol 9–12, pp 865–869
Wang, X, Ding X, Liu C (2001) Character extraction and recognition in natural scene images. In: Proceedings of 6th international conference on document analysis and recognition, pp 1084–1088
Otsu N (1978) A threshold selection method from gray-level histogram. IEEE Trans Syst Man Cybern 19(1):62–66
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer India
About this paper
Cite this paper
Angadi, S.A., Kodabagi, M.M. (2013). Recognition of Limited Vocabulary Kannada Words Through Structural Pattern Matching: An Experimentation on Low Resolution Images. In: Swamy, P., Guru, D. (eds) Multimedia Processing, Communication and Computing Applications. Lecture Notes in Electrical Engineering, vol 213. Springer, New Delhi. https://doi.org/10.1007/978-81-322-1143-3_15
Download citation
DOI: https://doi.org/10.1007/978-81-322-1143-3_15
Published:
Publisher Name: Springer, New Delhi
Print ISBN: 978-81-322-1142-6
Online ISBN: 978-81-322-1143-3
eBook Packages: EngineeringEngineering (R0)