Skip to main content
Log in

Effect of supervised learning methodologies in offline handwritten Thai character recognition

  • Original Research
  • Published:
International Journal of Information Technology Aims and scope Submit manuscript

Abstract

Offline handwritten character recognition is a conversion process of handwriting into machine-encoded text and predominantly used for digitizing handwritten texts and forensic applications. Currently, several techniques and methods are proposed to enhance accuracy of offline handwritten character recognition for many languages spoken across the globe like English, Tamil, Chinese and Arabic. In this paper, a local feature-based approach using supervised learning techniques is proposed to enhance the accuracy of handwritten offline character recognition for Thai alphabets using unsupervised learning for individual character as a class, whereas most of the existing methodologies for Thai character recognition is done with group of similarly looking characters as a class. The classification is operated by using support vector machine (SVM). The accuracy would be the percentage of correct classification for each class. For the result, the highest accuracy is 74.32% which has 144-bit shape features and uniform pattern LBP for the features.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5

Similar content being viewed by others

References

  1. Tembe AU, Thombre SS (2017) Survey of copy-paste forgery detection in digital image forensic. In 2017 international conference on innovative mechanisms for industry applications (ICIMIA), pp 248–252

  2. Singh S (2013) Optical character recognition techniques: a survey. J Emerg Trends Comput Inf Sci 4(6):545–550

    Google Scholar 

  3. Kannan RJ, Prabhakar R (2009) A comparative study of optical character recognition for tamil script. Eur J Sci Res 35(4):570–582

    Google Scholar 

  4. Ravi FJJT, Velayutham PR (2010) Effective tamil character recognition in tablet PCs using pattern recognition. In Tamil Internet Conference

  5. Hildebrandt TH, Liu W (1993) Optical recognition of handwritten Chinese characters: advances since 1980. Pattern Recognit 26(2):205–225

    Article  Google Scholar 

  6. Raouf AMA (2012) Offline printed Arabic character recognition. University of Nottingham, Nottingham

    Google Scholar 

  7. Ojala T, Pietikainen M, Harwood D (1996) A comparative study of texture measures with classification based on feature distributions. Pattern Recognit 29(1):51–59

    Article  Google Scholar 

  8. Thailand Population (2018) Worldometers. http://www.worldometers.info/world-population/thailand-population/. Accessed 04 June 2018

  9. Thai Language”, Wikipedia. https://en.wikipedia.org/wiki/Thai_language. Accessed 04 Nov 2016

  10. Chandra MA, Bedi SS (2018) Survey on SVM and their application in image classification. Int J Inf Technol. https://doi.org/10.1007/s41870-017-0080-1

    Article  Google Scholar 

  11. Dubey P (2019) The Hindi to Dogri machine translation system: grammatical perspective. Int J Inf Technol 11(1):171–182

    Google Scholar 

  12. Pornpanomchai C, Wongsawangtham V, Jeungudomporn S, Chatsumpun N (2011) Thai handwritten character recognition by genetic algorithm (THCRGA). Int J Eng Technol 3(2):148–153

    Article  Google Scholar 

  13. Phokharatkul P, Sankhuangaw K, Somkuarnpanit S, Phaiboon S, Kimpan C (2005) Off-line hand written Thai character recognition using ant-miner algorithm. Int J Comput Electron Autom Control Inf Eng 8(1):276–281

    Google Scholar 

  14. Methasate I, Marukatat S, Sae-Tang S, Theeramunkong T (2005) The feature combination technique for off-line Thai character recognition system. In: Proceedings of the international conference on document analysis and recognition, ICDAR, vol 2005: 1006–1009

  15. Joseph FJJ, Anantaprayoon P (2018) Offline handwritten Thai character recognition using single tier classifier and local features. In: 2018 international conference on information technology (InCIT), pp 1–4

  16. Ahmad K, Sahu M, Shrivastava M, Rizvi MA, Jain V (2018) An efficient image retrieval tool: query based image management system. Int J Inf Technol. https://doi.org/10.1007/s41870-018-0198-9

    Article  Google Scholar 

  17. Joseph FJJ, Auwatanamongkol S (2016) A crowding multi-objective genetic algorithm for image parsing. Neural Comput Appl 27(8):2217–2227

    Article  Google Scholar 

  18. Joseph FJJ, Ravi T, Justus C (2011) Classification of correlated subspaces using HoVer representation of Census Data. In: 2011 international conference on emerging trends in electrical and computer technology, pp 906–911

  19. Joseph FJJ (2019) Empirical dominance of features for predictive analytics of particulate matter pollution in Thailand. In: 5th Thai-Nichi Institute of Technology Academic Conference TNIAC 2019, pp 385–388

  20. Deza MM, Deza E (eds) (2009) Encyclopedia of distances. Springer, Berlin, Heidelberg, pp 1–583

    Book  Google Scholar 

  21. Hsu C, Chang C, Lin C (2010) A practical guide to support vector classification

  22. Chang C, Lin C (2013) LIBSVM: a library of support vector machines

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ferdin Joe John Joseph.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Joseph, F.J.J. Effect of supervised learning methodologies in offline handwritten Thai character recognition. Int. j. inf. tecnol. 12, 57–64 (2020). https://doi.org/10.1007/s41870-019-00366-y

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s41870-019-00366-y

Keywords

Navigation