Abstract
The world is growing with the new technologies for machine learning. Day by day there are incredible developments in the area of pattern recognition for the applications such as institutional record keeping, ancient documents preservation, postal address sorting, signature verification, etc. Handwritten character recognition is one such application grabbing lot of attention for the machines to learn. Handwritten Devanagari script is used in the proposed work which is very difficult to understand by the machines due to many similar shapes incorporated with it. So, pre-processing before extracting the features from the character plays very important role in increasing the accuracy. By keeping this in mind, a new framework for handwritten Devanagari character recognition is proposed where entropy -based skew correction is used to correct the skew of the characters and Mask-based approach is used which efficiently removes the header line and returns the header free character. The histograms of oriented gradients (HOG) features are extracted from the header free characters and provided for non-parametric dimensionality reduction. These features are further classified using AdaBoost ensemble boosting method and achieved a very good recognition accuracy of 98.43% and 98.68% on V2DMDCHAR and ISIDCHAR databases respectively.
Similar content being viewed by others
References
Arica N, Yarman-Vural FT (2001) An overview of character recognition focused on off-line handwriting. IEEE Trans Syst Man Cybern Part C (Appl Rev) 31(2):216–233
Arvind KR, Kumar J, Ramakrishnan AG (2007) Entropy -based skew correction of document images. In International conference on Pattern recognition and Machine Intelligence, 495–502
Bhattacharya U, Chaudhuri BB (2005) Databases for research on recognition of handwritten characters of Indian scripts. In: Proceedings of the Eight International Conference on Document Analysis and Recognition (ICDAR’05). IEEE, Seoul, South Korea, pp 789–793
Dalal N, Triggs B (2005) Histograms of oriented gradients for human detection. In 2005 IEEE computer society conference on computer vision and pattern recognition (CVPR'05) 1:886–893
Deore SP, Pravin A (2019) Histogram of oriented gradients -based off-line handwritten Devanagari characters recognition using SVM, K-NN and NN classifiers. Rev d’Intelligence Artif 33(6):441–446
Dongre VJ, Mankar VH (2012) Development of comprehensive devnagari numeral and character database for offline handwritten character recognition. Appl Comput Intell Soft Comput
Freund Y, Schapire R, Abe N (1999) A short introduction to boosting. J Jpn Soc Artif Intell 14(5):771–780
Fu Q, Ding X, Liu C (2008) A new AdaBoost algorithm for large scale classification and its application to Chinese handwritten character recognition. In Int’l Conf. Frontiers in Handwriting Recognition
Gaikwad SS, Nalbalwar SL, Nandgaonkar AB (2020) An approach towards devanagari character segmentation using adaptive thresholding and polynomial. Int J Adv Sci Technol 29(10s):7888–7896
Guha R, Das N, Kundu M, Nasipuri M, Santosh KC (2020) Devnet: an efficient cnn architecture for handwritten devanagari character recognition. Int J Pattern Recognit Artif Intell 34(12):2052009
Jangid M, Srivastava S (2018) Handwritten devanagari character recognition using layer-wise training of deep convolutional neural networks and adaptive gradient methods. J Imaging 4(2):41
Jayadevan R, Kolhe SR, Patil PM, Pal U (2011) Offline recognition of Devanagari script: A survey. IEEE Trans Syst Man Cybern Part C (Appl Rev) 41(6):782–796
Jha V, Parvathi K (2019) Braille Transliteration of hindi handwritten texts using machine learning for character recognition. Int J Sci Technol Res 8(10):1188–1193
Jindal T, Bhattacharya U (2013) Recognition of offline handwritten numerals using an ensemble of MLPs combined by adaboost. In Proceedings of the 4th International Workshop on Multilingual OCR, pp 1–5
Kamble PM, Hegadi R (2015) Handwritten Marathi character recognition using R-HOG Feature. Procedia Comput Sci 45:266–274
Kaur S, Sagar BB (2019) Brahmi character recognition -based on SVM (support vector machine) classifier using image gradient features. J Discret Math Sci Cryptogr 22(8):1365–1381
Khuman YLK, Devi HM, Singh NA (2021) Entropy-based skew detection and correction for printed meitei/meetei script ocr system. Mater Today Proc 37:2666–2669
Kim K, Choi HI, Oh K (2017) Object detection using ensemble of linear classifiers with fuzzy adaptive boosting. EURASIP J Image Video Process 1:1–10
Narang SR, Jindal MK, Kumar M (2019) Devanagari ancient character recognition using DCT features with adaptive boosting and bootstrap aggregating. Soft Comput 23(24):13603–13614
Narang SR, Jindal MK, Kumar M (2020) Line segmentation of Devanagari ancient manuscripts. Proc Natl Acad Sci 90(4):717–724
Opitz D, Maclin R (1999) Popular ensemble methods: An empirical study. J Artif Intell Res 11:169–198
Sahare P, Dhok SB (2018) Multilingual character segmentation and recognition schemes for Indian document images. IEEE access 6:10603–10617
Schober P, Boer C, Schwarte LA (2018) Correlation coefficients: appropriate use and interpretation. Anesth Analg 126(5):1763–1768
Setumin S, Suandi SA (2018) Difference of gaussian oriented gradient histogram for face sketch to photo matching. IEEE Access 6:39344–39352
Spearman C (2010) The proof and measurement of association between two things. Int J Epidemiol 39(5):1137–1150
Thakur A, Kaur A (2019) Devanagari handwritten character recognition using neural network. Int J Sci Technol Res 8(10):1–9
Zhou W, Gao S, Zhang L, Lou X (2020) Histogram of oriented gradients feature extraction from raw bayer pattern images. IEEE Trans Circuits Syst II Express Briefs 67(5):946–950
Zhou T, Li J, Wang S, Tao R, Shen J (2020) MATNet: motion-attentive transition network for zero-shot video object segmentation. IEEE Trans Image Process 29:8326–8338
Zhou T, Qi S, Wang W, Shen J, Zhu S (2022) Cascaded parsing of human-object interaction recognition. IEEE Trans Pattern Anal Mach Intell 44(6):2827–2840
Zhou T, Li L, Li X, Feng CM, Li J, Shao L (2022) Group-wise learning for weakly supervised semantic segmentation. IEEE Trans Image Process 31:799–811
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflicts of interests
The authors have no conflicts of interests to declare that are relevant to the content of this article and no relevant financial support for this work to disclose.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Gaikwad, S., Nalbalwar, S. & Nandgaonkar, A. Recognition of offline handwritten Devanagari characters using new mask-based approach, histogram of oriented gradients and AdaBoost. Multimed Tools Appl 82, 43883–43902 (2023). https://doi.org/10.1007/s11042-023-15424-8
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-023-15424-8