Machine Recognition of Printed Kannada Text

Vijay Kumar, B.; Ramakrishnan, A. G.

doi:10.1007/3-540-45869-7_4

B. Vijay Kumar⁶ &
A. G. Ramakrishnan⁶

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2423))

Included in the following conference series:

International Workshop on Document Analysis Systems

1120 Accesses
16 Citations

Abstract

This paper presents the design of a full fledged OCR system for printed Kannada text. The machine recognition of Kannada characters is dificult due to similarity in the shapes of different characters, script complexity and non-uniqueness in the representation of diacritics. The document image is subject to line segmentation, word segmentation and zone detection. From the zonal information, base characters, vowel modifiers and consonant conjucts are separated. Knowledge based approach is employed for recognizing the base characters. Various features are employed for recognising the characters. These include the coefficients of the Discrete Cosine Transform, Discrete Wavelet Transform and Karhunen-Louve Transform. These features are fed to different classifiers. Structural features are used in the subsequent levels to discriminate confused characters. Use of structural features, increases recognition rate from 93% to 98%. Apart from the classical pattern classification technique of nearest neighbour, Artificial Neural Network (ANN) based classifiers like Back Propogation and Radial Basis Function (RBF) Networks have also been studied. The ANN classifiers are trained in supervised mode using the transform features. Highest recognition rate of 99% is obtained with RBF using second level approximation coefficients of Haar wavelets as the features on presegmented base characters.

Download to read the full chapter text

Chapter PDF

Performance evaluation of different features and classifiers for Gurumukhi newspaper text recognition

Article 17 January 2022

Discrete Wavelet-Based Multi-Classifier Approach for Recognition of Offline Handwritten Hindi Numerals

An Unsupervised Classification of Printed and Handwritten Telugu Words in Pre-printed Documents Using Text Discrimination Coefficient

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Chaudhuri, B.B., Pal, U.: A Complete Printed Bangla OCR System. Pattern Recognition, Vol. 31, No. 5 (1998) 531–549
Article Google Scholar
Lu, Y.I.: Machine Printed Character Segmentation — An Overview. Pattern Recognition, Vol. 28, No. 1 (1995) 67–80
Article Google Scholar
Gonzalez, R.C., Woods, R.E.: Digital Image Processing. Addison Wesley, New York (1993)
Google Scholar
Haykin, S.: Neural Networks. A Comprehensive Foundation. Pearson Education Asia (1999)
Google Scholar
Jagadeesh, G.S., Gopinath, V.: Kantex, A Transliteration Package for Kannada. Kantex Manual. http://langmuir.eecs.berkeley.edu/venkates/KanTex 1.00.html

Download references

Author information

Authors and Affiliations

Department of Electrical Engineering, Indian Institute of Science, 560012, Bangalore, India
B. Vijay Kumar & A. G. Ramakrishnan

Authors

B. Vijay Kumar
View author publications
You can also search for this author in PubMed Google Scholar
A. G. Ramakrishnan
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Bell Labs, Lucent Technologies, 600 Mountain Avenue, 07974, Murray Hill, NJ, USA
Daniel Lopresti
Avaya Labs Research, 233 Mount Airy Road, 07920, Basking Ridge, NJ, USA
Jianying Hu & Ramanujan Kashi &

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Vijay Kumar, B., Ramakrishnan, A.G. (2002). Machine Recognition of Printed Kannada Text. In: Lopresti, D., Hu, J., Kashi, R. (eds) Document Analysis Systems V. DAS 2002. Lecture Notes in Computer Science, vol 2423. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45869-7_4

Download citation

DOI: https://doi.org/10.1007/3-540-45869-7_4
Published: 09 August 2002
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-44068-0
Online ISBN: 978-3-540-45869-2
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

Societies and partnerships

The International Association for Pattern Recognition (opens in a new tab)

Machine Recognition of Printed Kannada Text

Abstract

Chapter PDF

Similar content being viewed by others

Performance evaluation of different features and classifiers for Gurumukhi newspaper text recognition

Discrete Wavelet-Based Multi-Classifier Approach for Recognition of Offline Handwritten Hindi Numerals

An Unsupervised Classification of Printed and Handwritten Telugu Words in Pre-printed Documents Using Text Discrimination Coefficient

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Societies and partnerships

Navigation

Machine Recognition of Printed Kannada Text

Abstract

Chapter PDF

Similar content being viewed by others

Performance evaluation of different features and classifiers for Gurumukhi newspaper text recognition

Discrete Wavelet-Based Multi-Classifier Approach for Recognition of Offline Handwritten Hindi Numerals

An Unsupervised Classification of Printed and Handwritten Telugu Words in Pre-printed Documents Using Text Discrimination Coefficient

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Societies and partnerships

Search

Navigation