Abstract
This paper presents a cross language platform to recognize characters and words of low resource scripts i.e. scripts which do not have standard dataset and the datasets are not available for public access. Indic scripts come from common origin and some of the scripts have a common 3 zonal structure. Recognition of such scripts can be done with the help of other scripts having similar structure. To recognize these characters the model is trained with source language Kannada with zone-wise training and testing is done with both Kannada and the target language Telugu. An accuracy of 88% for Kannada and 62% for Telugu characters is achieved by using Inception Model which is built using Convolution Neural Networks (CNN) image classifier. The dataset consists of 10700 Kannada characters. The model is also tested for 100 words of Telugu and Kannada with an accuracy of 72% and 82% respectively.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Bhunia, A.K., Roy, P.P., Mohta, A., Pal, U.: Cross-language framework for word recognition and spotting of Indic scripts. Pattern Recogn. 79, 12–31 (2018)
Fischer, A., Keller, A., Frinken, V., Bunke, H.: Lexicon-free handwritten word spotting using character HMMs. Pattern Recogn. Lett. 33(7), 934–942 (2012)
Vishwanath, N.V., Manjunathachari, K., Satyaprasad, K.: Handwritten Telugu composite character recognition using morphological analysis. Int. J. Pure Appl. Math. 119(18), 667–676 (2018)
Mamatha, H.R., Sucharitha, S., Srikanta Murthy, K.: Multi-font and multi-size Kannada character recognition based on the curvelets and standard deviation. Int. J. Comput. Appl. 35(11), 1–8 (2011)
Das, A., Bhunia, K., Roy, P.P., Pal, U.: Handwritten word spotting in Indic scripts using foreground and background information. In: Proceedings of Asian Conference on Pattern Recognition (ACPR), Kuala Lumpur, Malaysia, pp. 426–430 (2015)
Liu, C., Liu, J., Yu, F.: Handwritten character recognition with sequential convolutional neural network. In: Proceedings of the International Conference on Machine Learning and Cybernetics, Tianjin, pp. 291–296 (2013)
Bai, J., Chen, Z., Feng, B., Xu, B.: Image character recognition using deep convolutional neural network learned from different languages. In: IEEE International Conference on Image Processing (ICIP), Paris, France, pp. 2560–2564 (2014)
Krishnan, P., Sankaran, N., Singh, A.K., Jawahar, C.V.: Towards a robust OCR system for indic scripts. In: 11th IAPR International Workshop on Document Analysis Systems, Tours, France (2014). https://doi.org/10.1109/das.2014.74
Prameela, N., Anjusha, P., Karthik, R.: Off-line Telugu handwritten characters recognition using optical character recognition. In: International Conference on Electronics, Communication and Aerospace Technology, Coimbatore, India, pp. 223–226 (2017)
Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., Erhan, D., Vanhoucke, V., Rabinovich, A.: Going deeper with convolutions. Technical report, CoRR, abs/1409.4842 (2014)
The Chars74 K dataset. http://www.ee.surrey.ac.uk/CVSSP/demos/chars74k/. Accessed 12 Mar 2018
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Hebbi, C., Mamatha, H.R., Sahana, Y.S., Dhage, S., Somayaji, S. (2020). A Convolution Neural Networks Based Character and Word Recognition System for Similar Script Languages Kannada and Telugu. In: Singh, P., Panigrahi, B., Suryadevara, N., Sharma, S., Singh, A. (eds) Proceedings of ICETIT 2019. Lecture Notes in Electrical Engineering, vol 605. Springer, Cham. https://doi.org/10.1007/978-3-030-30577-2_26
Download citation
DOI: https://doi.org/10.1007/978-3-030-30577-2_26
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-30576-5
Online ISBN: 978-3-030-30577-2
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)