Recognition of Limited Vocabulary Kannada Words Through Structural Pattern Matching: An Experimentation on Low Resolution Images

Angadi, S. A.; Kodabagi, M. M.

doi:10.1007/978-81-322-1143-3_15

S. A. Angadi³ &
M. M. Kodabagi³

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 213))

929 Accesses

Abstract

Text recognition at character/word level is one of the very important steps for development of automated systems for understanding low resolution display board images which facilitate several new applications such as blind assistants, tour guide systems, location aware systems and many more. In this paper, a new approach for recognition of Kannada words in low resolution natural scene images from a limited vocabulary is presented. The proposed method uses structural patterns of vertical and horizontal cuts as features, which are tolerant to font variability, uncertainty, noise and other degradations. These structural representations characterize the shape of the word image. The method works in two phases; In the training phase, several patterns of vertical and horizontal cut features that can occur generally even in the presence of uncertainty are determined from training word images and templates are constructed, one for each word under study. Further, these templates are organized into knowledge bases, one for each set of word images of equal size in terms of number of characters. During testing, a test word image is processed to obtain vertical and horizontal cut features and a newly defined pattern matching procedure that measures the maximum similarity between test sample and pre-constructed templates of word images in the knowledge base is used to recognize the word. The proposed methodology is evaluated for 1,200 Kannada word images and an overall recognition accuracy of 97.67 % is achieved. The proposed method is found to be robust and insensitive to the variations in size and style of font, thickness and spacing between characters, noise, and other degradations.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Abowd GD, Atkeson CG, Hong J, Long S, Kooper R, Pinkerton M (1997) CyberGuide: a mobile context-aware tour guide. Wireless Netw 3(5):421–433
Article Google Scholar
Marmasse N, Schamandt C (2000) Location aware information delivery withcomMotion. In: Proceedings of conference on human factors in computing systems, pp 157–171
Google Scholar
Tollmar K, Yeh T, Darrell T (2004) IDeixis—image-based deixis for finding location-based information. In: Proceedings of conference on human factors in computing systems (CHI’04), pp 781–782
Google Scholar
Leetch G, Mangina E (2005) A multi-agent system to stream multimedia to handheld devices. In: Proceedings of the 6th international conference on computational intelligence and multimedia applications
Google Scholar
Premchaiswadi W (2009) A mobile image search for tourist information system. In: Proceedings of 9th international conference on signal processing, computational geometry and artificial vision, pp 62–67
Google Scholar
Ma C-J, Fang J-Y (2008) Location based mobile tour guide services towards digital dunhaung. In: International archives of phtotgrammtery, remote sensing and spatial information sciences, vol XXXVII, Part B4, Beijing
Google Scholar
Wu S-H, Li M-X, Yanga P-C, Kub T (2010) Ubiquitous wikipedia on handheld device for mobile learning. In: 6th IEEE international conference on wireless, mobile, and ubiquitous technologies in education, pp 228–230
Google Scholar
Yeh T, Grauman K, Tollmar K (2005) A picture is worth a thousand keywords: image-based object search on a mobile platform. In: Proceedings of conference on human factors in computing systems, pp 2025–2028
Google Scholar
Fan X, Xie X, Li Z, Li M., Ma WY (2005) Photo-to-search: using multimodal queries to search web from mobile phones. In: Proceedings of 7th ACM SIGMM international workshop on multimedia information retrieval
Google Scholar
Hwee LJ, Chevallet JP, Merah SN (2005) SnapToTell: Ubiquitous information access from camera. In: Mobile human computer interaction with mobile devices and services, Glasgow, Scotland
Google Scholar
Zhang J, Chen X, Hanneman A, Yang J, Waibel A (2002) A robust approach for recognition of text embedded in natural scenes. In: Proceedings of 16th international conference on pattern recognition, vol 3, pp 204–207
Google Scholar
Chen X, Yang J, Zhang J, Waibel A (2004) Automatic detection and recognition of signs from natural scenes. IEEE Trans Image Process 13(1):87–99
Article Google Scholar
Mishra A, Alahari K, Jawahar CV (2012) Top-down and bottom-up cues for scene text recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition
Google Scholar
Weinman JJ, Learned-Miller E, Hanson A (2008) A discriminative semi-Markov model for robust scene text recognition. In: 19th international conference on pattern recognition ICPR 2008, vol 8–11, pp 1–5
Google Scholar
Weinman JJ, Learned-Miller E, Hanson A (2009) Scene text recognition using similarity and lexicon with sparse belief propogation. IEEE Trans Pattern Anal Mach Intell 31(10):1733–1746
Article Google Scholar
Weinman JJ (2010) Typographical features for scene text recognition. In: 20th international conference on pattern recognition, vol 23–26, pp 3987–3990
Google Scholar
Park J et al (2010) Automatic detection and recognition of Korean text in outdoor signboard images. Pattern Recogn Lett 31:1728–1739
Article Google Scholar
Kobayashi T, Toyamaya T, Shafait F, Iwamura M, Kise K, Dengel A (2012) Recognizing words in scenes with a head-mounted eye-tracker. In: 10th IAPR international workshop on document analysis systems, vol 27–29, pp 333–338
Google Scholar
Ghoshal R, Roy A, Parui SK (2011) Recognition of Bangla text from scene images through perspective correction. In: 2011 international conference on image information processing (ICIIP), vol 3–5, pp 1–6
Google Scholar
Coates A, Carpenter B, Case C, Satheesh S, Suresh B, Wang T, Wu DJ, Ng AY (2011) Text detection and character recognition in scene images with unsupervised feature learning. In: 2011 international conference on document analysis and recognition (ICDAR), vol 18–21, pp 440–445
Google Scholar
Wang K, Babenko B, Belongie S (2011) End-to-end scene text recognition. In: 2011 IEEE international conference on computer vision, vol 6–13, pp 1457–1464
Google Scholar
Chung H, Sihn K-H, Hong S, Song HJ, Kim D (2011) Scene text recognition system using multigrain parallelism. Consumer. In: 2011 IEEE communications and networking conference, vol 9–12, pp 865–869
Google Scholar
Wang, X, Ding X, Liu C (2001) Character extraction and recognition in natural scene images. In: Proceedings of 6th international conference on document analysis and recognition, pp 1084–1088
Google Scholar
Otsu N (1978) A threshold selection method from gray-level histogram. IEEE Trans Syst Man Cybern 19(1):62–66
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science and Engineering, Basaveshwar Engineering College, Bagalkot, Karnataka, 587102, India
S. A. Angadi & M. M. Kodabagi

Authors

S. A. Angadi
View author publications
You can also search for this author in PubMed Google Scholar
M. M. Kodabagi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to S. A. Angadi .

Editor information

Editors and Affiliations

Master of Computer Applications, PES Institute of Technology, Banashankari 3rd stage, Near Hoskerehalli Cross 100 Feet, Bangalore, 560085, Karnataka, India
Punitha P. Swamy
Studies in Computer Science, University of Mysore, Manasagangotri, Mysore, 570006, Karnataka, India
Devanur S. Guru

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Angadi, S.A., Kodabagi, M.M. (2013). Recognition of Limited Vocabulary Kannada Words Through Structural Pattern Matching: An Experimentation on Low Resolution Images. In: Swamy, P., Guru, D. (eds) Multimedia Processing, Communication and Computing Applications. Lecture Notes in Electrical Engineering, vol 213. Springer, New Delhi. https://doi.org/10.1007/978-81-322-1143-3_15

Download citation

DOI: https://doi.org/10.1007/978-81-322-1143-3_15
Published: 26 May 2013
Publisher Name: Springer, New Delhi
Print ISBN: 978-81-322-1142-6
Online ISBN: 978-81-322-1143-3
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics