Abstract
Offline handwriting recognition is an important application of pattern recognition that has attracted a lot of interest from researchers. Transforming any handwritten material into machine-readable text data by extracting hidden patterns and comprehending the texts from the documents is a complex process. There are 22 scheduled languages in India and Gujarati is one among them. There are several optical character recognition issues (OCR) in Gujarati and it is difficult to identify universal invariant patterns and irregularities in handwritten Gujarati script. The lack of a big benchmark dataset is another important issue with handwritten Gujarati script. This issue was identified, and we built a dataset with 75600 images spanning 54 Gujarati character classes. Although, this dataset is reasonably large, it is still not large enough to learn deep neural networks from scratch due to overfitting concerns. To address this problem, we have integrated transfer learning with CNN for Gujarati handwritten character recognition. We have used 5 distinct pre-trained models and have achieved approximately 97% accuracy on images of 54 different classes.
Similar content being viewed by others
References
Pal U and Chaudhuri B B 2004 Indian script character recognition: a survey. Pattern Recognition. 37(9): 1887–1899
Bag S and Harit G 2013 A survey on optical character recognition for Bangla and Devanagari scripts. Sadhana. 38(1): 133–168
Dholakia J, Negi A and Mohan S R 2009 Progress in Gujarati document processing and character recognition. In: Guide to OCR for Indic Scripts (pp. 73–95). Springer, London
Desai AA 2010 Gujarati handwritten numeral optical character reorganization through neural network. Pattern recognition. 43(7): 2582–2589
Maloo M and Kale K V 2011 Support vector machine based Gujarati numeral recognition. International Journal on Computer Science and Engineering. 3(7): 2595–2600
MJ B, Kv K A L E and Me J A D H A V 2011 Comparison of classifiers for gujarati numeral recognition. International Journal of Machine Intelligence. 3(3): 160–163
Patel C and Desai A 2013 Gujarati handwritten character recognition using hybrid method based on binary tree-classifier and k-nearest neighbour. International Journal of Engineering Research & Technology (IJERT). 2(6): 2337–2345
Shah L, Patel R, Patel S and Maniar J 2014 Handwritten character recognition using radial histogram. J Res Advent Technol E. 2321:9637
Thaker H and Kumbharana C 2014 Analysis of structural features and classification of Gujarati consonant for offline character recognition. International Journal of Scientific and Research Publications. 4(8): 1–5
Nagar R, Mitra S K 2015 Feature extraction based on stroke orientation estimation technique for handwritten numeral. In: 2015 eighth international conference on advances in pattern recognition (ICAPR) (pp. 1–6). IEEE
Goswami M M and Mitra S K 2015 Offline handwritten Gujarati numeral recognition using low-level strokes. International Journal of Applied Pattern Recognition. 2(4): 353–379
Prasad J R and Kulkarni U 2015 Gujrati character recognition using weighted k-NN and mean \(\chi ^2\) distance measure. International Journal of Machine Learning and Cybernetics. 6(1): 69–82
Sharma A K, Adhyaru D M, Zaveri T H and Thakkar P B 2015 Comparative analysis of zoning based methods for Gujarati handwritten numeral recognition. In: 2015 5th Nirma University International Conference on Engineering (NUiCONE) (pp. 1–5). IEEE
Sharma A K, Adhyaru D M and Zaveri T H 2018 A novel cross correlation-based approach for handwritten Gujarati character recognition. In: Proceedings of First International Conference on Smart System, Innovations and Computing (pp. 505–513). Springer, Singapore
Sharma A K, Thakkar P, Adhyaru D M and Zaveri T H 2019 Handwritten Gujarati Character Recognition Using Structural Decomposition Technique. Pattern Recognition and Image Analysis. 29(2): 325–338
Pareek J, Singhania D, Kumari R R and Purohit S 2020 Gujarati handwritten character recognition from text images. Procedia Computer Science. 171: 514–523
Rajyagor B and Rakholia R 2021 Isolated Gujarati Handwritten Character Recognition (HCR) using Deep Learning (LSTM). In: 2021 Fourth International Conference on Electrical, Computer and Communication Technologies (ICECCT) (pp. 1–6). IEEE
Yao G, Lei T and Zhong J 2019 A review of convolutional-neural-network-based action recognition. Pattern Recognition Letters. 118: 14–22
Dhillon A and Verma G K 2020 Convolutional neural network: a review of models, methodologies and applications to object detection. Progress in Artificial Intelligence. 9(2): 85–112
Simonyan K and Zisserman A 2014 Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556
Szegedy C, Vanhoucke V, Ioffe S, Shlens J and Wojna Z 2016 Rethinking the inception architecture for computer vision. In: Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 2818–2826)
Huang G, Liu Z, Van Der Maaten L and Weinberger K Q 2017 Densely connected convolutional networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 4700–4708)
Zoph B, Vasudevan V, Shlens J and Le Q V 2018 Learning transferable architectures for scalable image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 8697–8710)
Howard A G, Zhu M, Chen B, Kalenichenko D, Wang W, Weyand T, Andreetto M and Adam H 2017 Mobilenets: Efficient convolutional neural networks for mobile vision applications. arXiv preprint arXiv:1704.04861
Acknowledgements
The authors of this paper are grateful to the Institute of Technology, Nirma University for their support and motivation during this research.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Limbachiya, K., Sharma, A., Thakkar, P. et al. Identification of handwritten Gujarati alphanumeric script by integrating transfer learning and convolutional neural networks. Sādhanā 47, 102 (2022). https://doi.org/10.1007/s12046-022-01864-9
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s12046-022-01864-9