Abstract
Offline handwritten character recognition is a conversion process of handwriting into machine-encoded text and predominantly used for digitizing handwritten texts and forensic applications. Currently, several techniques and methods are proposed to enhance accuracy of offline handwritten character recognition for many languages spoken across the globe like English, Tamil, Chinese and Arabic. In this paper, a local feature-based approach using supervised learning techniques is proposed to enhance the accuracy of handwritten offline character recognition for Thai alphabets using unsupervised learning for individual character as a class, whereas most of the existing methodologies for Thai character recognition is done with group of similarly looking characters as a class. The classification is operated by using support vector machine (SVM). The accuracy would be the percentage of correct classification for each class. For the result, the highest accuracy is 74.32% which has 144-bit shape features and uniform pattern LBP for the features.
Similar content being viewed by others
References
Tembe AU, Thombre SS (2017) Survey of copy-paste forgery detection in digital image forensic. In 2017 international conference on innovative mechanisms for industry applications (ICIMIA), pp 248–252
Singh S (2013) Optical character recognition techniques: a survey. J Emerg Trends Comput Inf Sci 4(6):545–550
Kannan RJ, Prabhakar R (2009) A comparative study of optical character recognition for tamil script. Eur J Sci Res 35(4):570–582
Ravi FJJT, Velayutham PR (2010) Effective tamil character recognition in tablet PCs using pattern recognition. In Tamil Internet Conference
Hildebrandt TH, Liu W (1993) Optical recognition of handwritten Chinese characters: advances since 1980. Pattern Recognit 26(2):205–225
Raouf AMA (2012) Offline printed Arabic character recognition. University of Nottingham, Nottingham
Ojala T, Pietikainen M, Harwood D (1996) A comparative study of texture measures with classification based on feature distributions. Pattern Recognit 29(1):51–59
Thailand Population (2018) Worldometers. http://www.worldometers.info/world-population/thailand-population/. Accessed 04 June 2018
Thai Language”, Wikipedia. https://en.wikipedia.org/wiki/Thai_language. Accessed 04 Nov 2016
Chandra MA, Bedi SS (2018) Survey on SVM and their application in image classification. Int J Inf Technol. https://doi.org/10.1007/s41870-017-0080-1
Dubey P (2019) The Hindi to Dogri machine translation system: grammatical perspective. Int J Inf Technol 11(1):171–182
Pornpanomchai C, Wongsawangtham V, Jeungudomporn S, Chatsumpun N (2011) Thai handwritten character recognition by genetic algorithm (THCRGA). Int J Eng Technol 3(2):148–153
Phokharatkul P, Sankhuangaw K, Somkuarnpanit S, Phaiboon S, Kimpan C (2005) Off-line hand written Thai character recognition using ant-miner algorithm. Int J Comput Electron Autom Control Inf Eng 8(1):276–281
Methasate I, Marukatat S, Sae-Tang S, Theeramunkong T (2005) The feature combination technique for off-line Thai character recognition system. In: Proceedings of the international conference on document analysis and recognition, ICDAR, vol 2005: 1006–1009
Joseph FJJ, Anantaprayoon P (2018) Offline handwritten Thai character recognition using single tier classifier and local features. In: 2018 international conference on information technology (InCIT), pp 1–4
Ahmad K, Sahu M, Shrivastava M, Rizvi MA, Jain V (2018) An efficient image retrieval tool: query based image management system. Int J Inf Technol. https://doi.org/10.1007/s41870-018-0198-9
Joseph FJJ, Auwatanamongkol S (2016) A crowding multi-objective genetic algorithm for image parsing. Neural Comput Appl 27(8):2217–2227
Joseph FJJ, Ravi T, Justus C (2011) Classification of correlated subspaces using HoVer representation of Census Data. In: 2011 international conference on emerging trends in electrical and computer technology, pp 906–911
Joseph FJJ (2019) Empirical dominance of features for predictive analytics of particulate matter pollution in Thailand. In: 5th Thai-Nichi Institute of Technology Academic Conference TNIAC 2019, pp 385–388
Deza MM, Deza E (eds) (2009) Encyclopedia of distances. Springer, Berlin, Heidelberg, pp 1–583
Hsu C, Chang C, Lin C (2010) A practical guide to support vector classification
Chang C, Lin C (2013) LIBSVM: a library of support vector machines
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Joseph, F.J.J. Effect of supervised learning methodologies in offline handwritten Thai character recognition. Int. j. inf. tecnol. 12, 57–64 (2020). https://doi.org/10.1007/s41870-019-00366-y
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s41870-019-00366-y