Skip to main content
Log in

RATNet: A deep learning model for Bengali handwritten characters recognition

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

The Bengali language is based on a set of symbols for basic characters, modifiers, compound characters, and numerals. The recognition rates of handwritten basic characters and numerals are very high. However, the recognition rates of compound characters and modifiers are still poor. This might be due to their large class size with huge writing styles, much similarity, and unavailability of sufficient data for deep learning. In fact, there are some compound characters which appear very rare in practice. A proper selection of frequently used characters may reduce class size, and hence improving the accuracy. In this study, we performed a statistics on the frequency of compound characters, we developed two datasets for modifiers and compound characters, and finally we proposed a heterogeneous deep learning model (RATNet) for characters recognition. A statistics was performed on two daily Bengali newspapers, and characters with frequency ≥ 5% were selected. The handwriting of selected characters was collected from 130 writers of different ages and professions. The performance of RATNet model was evaluated on the proposed datasets and also three other existing datasets (i.e., ISI, CMATERdb, BanglaLekha-Isolated). In addition, the performance of RATNet was also compared with LeNet-5, VGG-16, ResNet-50, and DenseNet-121 models. We selected 87 out of 107 compound characters. The proposed RATNet model outperforms other models providing 99.66%, 99.27%, 98.78%, and 97.70% accuracy, respectively for the recognition of numerals, basic characters, modifiers, and compound characters on the CMATERdb dataset while keeping the number of parameters relatively low likely due to layer heterogeneity.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5

Similar content being viewed by others

References

  1. Akhand M, Rahman MM, Shill P et al (2015) Bangla handwritten numeral recognition using convolutional neural network. In: Proceeding of the international conference on electrical engineering and information communication technology, Dhaka, Bangladesh, pp 1–5

  2. Akhand MAH, Ahmed M, Rahman MMH (2016) Multiple convolutional neural network training for Bangla handwritten numeral recognition. In: Proceeding of the international conference on computer and communication engineering, San Francisco, Canada, pp 311–315

  3. Ali AAA, Suresha M (2019) A novel features and classifiers fusion technique for recognition of arabic handwritten character script. SN Appl Sci 1(10):1286

    Article  Google Scholar 

  4. Alom MZ, Sidike P, Taha TM et al (2017) Handwritten Bangla digit recognition using deep learning. arXiv:170502680

  5. Alom MZ, Sidike P, Hasan M et al (2018) Handwritten Bangla character recognition using the state-of-the-art deep convolutional neural networks. Comput Intel Neurosc. arXiv:1712.09872:315-320

  6. Amin MS, Yasir SM, Ahn H (2020) Recognition of pashto handwritten characters based on deep learning. Sensors 20(20):5884

    Article  Google Scholar 

  7. Ba J, Mnih V, Kavukcuoglu K (2014) Multiple object recognition with visual attention. arXiv:14127755

  8. Basu S, Das N, Sarkar R et al (2009) A hierarchical approach to recognition of handwritten Bangla characters. Pattern Recognit 42(7):1467–1484

    Article  Google Scholar 

  9. Bhattacharya U, Chaudhuri BB (2009) Handwritten numeral databases of Indian scripts and multistage recognition of mixed numerals. IEEE Trans Pattern Anal Mach Intell 31(3):444–457

    Article  Google Scholar 

  10. Bhattacharya U, Shridhar M, Parui SK et al (2012) Offline recognition of handwritten Bangla characters: An efficient two-stage approach. Pattern Anal Appl 15(4):445–458

    Article  MathSciNet  Google Scholar 

  11. Bhowmik TK, Ghanty P, Roy A et al (2009) SVM-Based hierarchical architectures for handwritten Bangla character recognition. Int J Doc Anal Recogn 12(2):97–108

    Article  Google Scholar 

  12. Biswas M, Islam R, Shom GK et al (2017) Banglalekha-isolated: A comprehensive Bangla handwritten character dataset. arXiv:170310661

  13. Breuel TM, Ul-Hasan A, Al-Azawi MA et al (2013) High-performance ocr for printed english and fraktur using lstm networks. In: 2013 12th international conference on document analysis and recognition, pp 683–687

  14. Chatterji SK (2002) The origin and development of the Bengali language. Indogermanische Forschungen 47(1):370–380

    Google Scholar 

  15. Chaudhuri B, Pal U (1998) A complete printed Bangla OCR system. Pattern Recognit 31(5):531–549

    Article  Google Scholar 

  16. Das N et al (2012) Sarkar a genetic algorithm based region sampling for selection of local features in handwritten digit recognition application. Appl Soft Comput 12(5):1592–1606

    Article  Google Scholar 

  17. Das N, Basu S, Sarkar R et al (2009) Handwritten bangla compound character recognition: Potential challenges and probable solution. In: Procedding of the international joint conference on artificial intelligence, pp 1901–1913

  18. Das N, Das B, Sarkar R et al (2010) Handwritten Bangla basic and compund characters recogntion using MLP and SVM classifiers. J Comput 2(2):109–115

    Google Scholar 

  19. Das N, Acharya K, Sarkar R, et al. (2014) A benchmark image database of isolated Bangla handwritten compound characters. Int J Doc Anal Recogn 17:413–431. https://doi.org/10.1007/s10032-014-0222-y

    Article  Google Scholar 

  20. Fardous A, Afroge S (2019) Handwritten isolated Bangla compound character recognition. In: Procedding of the international conference on electrical, computer and communication engineering (ECCE), Cox’sBazar, Bangladesh, pp 1–5

  21. Gruber I, Hlaváč M, železnỳ M, et al. (2017) Facing face recognition with resnet: round one. In: Procedding of the international conference on interactive collaborative robotics, Hatfield, Hertfordshire, UK, pp 67–74

  22. Hariharan B, Arbeláez P, Girshick R et al (2015) Hypercolumns for object segmentation and fine-grained localization. In: Proceedings of the IEEE conference on computer vision and pattern recognition, Boston, Massachusetts, pp 447–456

  23. He K, Zhang X, Ren S et al (2015) Deep residual learning for image recognition. CoRR arXiv:abs/1512.03385

  24. Huang G, Liu Z, Van Der Maaten L et al (2017) Densely connected convolutional networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, Honolulu, HI, USA, pp 4700–4708

  25. Kingma DP, Ba J (2014) Adam: A method for stochastic optimization. arXiv:14126980

  26. LeCun Y, Bottou L, Bengio Y, et al. (1998) Gradient-based learning applied to document recognition. Proc IEEE 86(11):2278–2324

    Article  Google Scholar 

  27. McHugh ML (2012) Interrater reliability: The kappa statistic. Biochemia Medica 22(3):276–282

    Article  MathSciNet  Google Scholar 

  28. Mnih V, Heess N, Graves A et al (2014) Recurrent models of visual attention. In: Procedding of the advances in neural information processing systems, Montreal Convention Centre, Canada, pp 2204–2212

  29. Modhej N, Bastanfard A, Teshnehlab M, et al. (2020) Pattern separation network based on the hippocampus activity for handwritten recognition. IEEE Access 8:212,803–212,817

    Article  Google Scholar 

  30. Pal U, Wakabayashi T, Kimura F (2007) Handwritten Bangla compound character recognition using gradient feature. In: Procedding of the international conference on information technology (ICIT), Orissa, India, IEEE, pp 208–213

  31. Park J, Woo S, Lee JY et al (2018) BAM: Bottleneck attention module. arXiv:180706514

  32. Purnamawati S, Rachmawati D, Lumanauw G et al (2018) Korean letter handwritten recognition using deep convolutional neural network on android platform. In: Journal of Physics: Conference Series

  33. Rabby ASA, Haque S, Islam S, et al. (2018) Bornonet: Bangla handwritten characters recognition using convolutional neural network. Procedia Comput Sci 143:528–535

    Article  Google Scholar 

  34. Racki D, Tomazevic D, Skocaj D (2018) A compact convolutional neural network for textured surface anomaly detection. In: Procedding of the IEEE winter conference on applications of computer vision, lake tahoe, NV, USA, pp 1331–1339

  35. Rahman MM, Akhand MAH, Islam S et al (2015) Bangla handwritten character recognition using convolutional neural network. International Journal of Image, Graphics and Signal Processing 7(8):42–49

    Article  Google Scholar 

  36. Rahman MM, Islam MS, Sassi R, et al. (2019) Convolutional neural networks performance comparison for handwritten Bengali numerals recognition. SN Appl Sci 1(12):1660

    Article  Google Scholar 

  37. Rownak AF, Rabby MF, Ismail S et al (2016) An efficient way for segmentation of bangla characters in printed document using curved scanning. In: 2016 5th international conference on informatics, electronics and vision (ICIEV), pp 938–943. https://doi.org/10.1109/ICIEV.2016.7760138

  38. Roy S, Das N, Kundu M, et al. (2017) Handwritten isolated Bangla compound character recognition: A new benchmark using a novel deep learning approach. Pattern Recognit Lett 90:15–21

    Article  Google Scholar 

  39. Saufi MM, Zamanhuri MA, Mohammad N et al (2018) Deep learning for roman handwritten character recognition. International Journal of Electrical Engineering and Computer Science 12(2):455–460

    Google Scholar 

  40. Sazal MMR, Biswas SK, Amin MF et al (2014) Bangla handwritten character recognition using deep belief network. In: Procedding of the international conference on electrical information and communication technology, Dhaka, Bangladesh, pp 1–5

  41. Shopon M, Mohammed N, Abedin MA (2016) Bangla handwritten digit recognition using autoencoder and deep convolutional neural network. In: Procedding of the international workshop on computational intelligence (IWCI), pp 64–68

  42. Shopon M, Mohammed N, Abedin MA (2017) Image augmentation by blocky artifact in deep convolutional neural network for handwritten digit recognition. In: Procedding of the IEEE international conference on imaging, vision & pattern recognition, Himeji, Japan, pp 1—6

  43. Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. arXiv:14091556

  44. Tsai C (2016) Recognizing handwritten japanese characters using deep convolutional neural networks. University of Stanford in Stanford, California, pp 405–410

  45. Wang F, Jiang M, Qian C et al (2017) Residual attention network for image classification. In: Proceedings of the IEEE conference on computer vision and pattern recognition, Honolulu, HI, USA, pp 3156–3164

  46. Wikipedia contributors (2020) List of languages by number of native speakers — Wikipedia, the free encyclopedia. https://en.wikipedia.org/w/index.php?title=List_of_languages_by_number_of_native_speakers&oldid=978856845, [Online; accessed 9-October-2020]

  47. Woo S, Park J, Lee JY et al (2018) CBAM: Convolutional block attention module. In: Proceedings of the European conference on computer vision, Munich, Germany, pp 3–19

  48. Zhang Y (2015) Deep convolutional network for handwritten chinese character recognition. Computer Science Department, Stanford University

Download references

Acknowledgments

We thankfully acknowledge the contribution of those volunteers, who participated in writing the characters. We are grateful to Prof. Dr. Ujjal Bhattacharya, Indian Statistical Institute for providing the ISI handwritten numeral dataset. We are also grateful to the University Grants Commission (UGC) of Bangladesh and the authority of Islamic University, Kushtia-7003, Bangldesh for providing partial financial support to develop this dataset.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Md Aktaruzzaman.

Ethics declarations

Conflict of Interests

The authors declare that they have no conflict of interest.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Islam, M.S., Rahman, M.M., Rahman, M.H. et al. RATNet: A deep learning model for Bengali handwritten characters recognition. Multimed Tools Appl 81, 10631–10651 (2022). https://doi.org/10.1007/s11042-022-12070-4

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-022-12070-4

Keywords

Navigation