Skip to main content

Advertisement

Log in

Recognizing arabic handwritten characters using deep learning and genetic algorithms

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

Automated techniques for Arabic content recognition are at a beginning period contrasted with their partners for the Latin and Chinese contents recognition. There is a bulk of handwritten Arabic archives available in libraries, data centers, historical centers, and workplaces. Digitization of these documents facilitates (1) to preserve and transfer the country’s history electronically, (2) to save the physical storage space, (3) to proper handling of the documents, and (4) to enhance the retrieval of information through the Internet and other mediums. Arabic handwritten character recognition (AHCR) systems face several challenges including the unlimited variations in human handwriting and the leakage of large and public databases. In the current study, the segmentation and recognition phases are addressed. The text segmentation challenges and a set of solutions for each challenge are presented. The convolutional neural network (CNN), deep learning approach, is used in the recognition phase. The usage of CNN leads to significant improvements across different machine learning classification algorithms. It facilitates the automatic feature extraction of images. 14 different native CNN architectures are proposed after a set of try-and-error trials. They are trained and tested on the HMBD database that contains 54,115 of the handwritten Arabic characters. Experiments are performed on the native CNN architectures and the best-reported testing accuracy is 91.96%. A transfer learning (TF) and genetic algorithm (GA) approach named “HMB-AHCR-DLGA” is suggested to optimize the training parameters and hyperparameters in the recognition phase. The pre-trained CNN models (VGG16, VGG19, and MobileNetV2) are used in the later approach. Five optimization experiments are performed and the best combinations are reported. The highest reported testing accuracy is 92.88%.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16
Fig. 17
Fig. 18
Fig. 19

Similar content being viewed by others

References

  1. Abuhaiba IS, Holt MJ, Datta S (1998) Recognition of off-line cursive handwriting. Comput Vis Image Underst 71(1):19–38

    Article  Google Scholar 

  2. Acharya S, Pant AK, Gyawali PK (2015) Deep learning based large scale handwritten devanagari character recognition. In: 2015 9th International conference on software, knowledge, information management and applications (SKIMA). IEEE, pp 1–6

  3. Ahmed R et al (2020) Offline arabic handwriting recognition using deep machine learning: A review of recent advances. In: Ren J et al (eds) Advances in brain inspired cognitive systems. Springer International Publishing, Cham, pp 457–468

  4. Akhand M, Ahmed M, Rahman MH (2016) Convolutional neural network based handwritten bengali and bengali-english mixed numeral recognition. Int J image Graph Signal Process 8(9):40

    Article  Google Scholar 

  5. Al-Helali BM, Mahmoud SA (2017) Arabic online handwriting recognition (aohr): A survey. ACM Comput Surv 50(3)

  6. Al-Shaher AA, Hancock ER (2003) Learning mixtures of point distribution models with the em algorithm. Pattern Recogn 36(12):2805–2818

    Article  MATH  Google Scholar 

  7. Al-Taani AT, Al-Haj S (2010) Recognition of on-line arabic handwritten characters using structural features. J Pattern Recognit Res 5(1):23–37

    Article  Google Scholar 

  8. AlKhateeb JH, Ren J, Jiang J, Al-Muhtaseb H (2011) Offline handwritten arabic cursive text recognition using hidden markov models and re-ranking. Pattern Recogn Lett 32(8):1081–1088

    Article  Google Scholar 

  9. Ali AAA, M Suresha (2019) Arabic handwritten character recognition using machine learning approaches. In: 2019 Fifth international conference on image information processing (ICIIP). pp 187–192

  10. Ali A, Zhu Y, Chen Q, Yu J, Cai H (2019) Leveraging spatio-temporal patterns for predicting citywide traffic crowd flows using deep hybrid neural networks. In: 2019 IEEE 25th international conference on parallel and distributed systems (ICPADS). IEEE, pp 125–132

  11. Ali A, Zhu Y, Zakarya M (2021) A data aggregation based approach to exploit dynamic spatio-temporal correlations for citywide crowd flows prediction in fog computing. Multimed Tools Appl :1–33

  12. Althobaiti H, Chao Lu (2017) A survey on arabic optical character recognition and an isolated handwritten arabic character recognition algorithm using encoded freeman chain code. In: 2017 51st Annual conference on information sciences and systems (CISS). pp 1–6

  13. Altwaijry N, Al-Turaiki I (2020) Arabic handwriting recognition system using convolutional neural network. Neural Comput Appl

  14. Amin A (1997) Arabic character recognition. In: Handbook of character recognition and document image analysis. World Scientific. pp 397–420

  15. Amin A (2003) Recognition of hand-printed characters based on structural description and inductive logic programming. Pattern Recognit Lett 24 (16):3187–3196

    Article  Google Scholar 

  16. Athoillah M, Putri RK (2019) Handwritten arabic numeral character recognition using multi kernel support vector machine. KINETIK: Game technology, information system, computer network, computing, electronics, and control 4(2):99–106

    Article  Google Scholar 

  17. Bafjaish SS, Azmi MS, Al-Mhiqani MN, Sheikh AA (2020) Skew correction for mushaf al-quran: a review. Indones J Electr Eng Comput Sci 17(1):516–523

    Google Scholar 

  18. Bai J, Chen Z, Feng B, Xu B (2014) Image character recognition using deep convolutional neural network learned from different languages. In: 2014 IEEE International conference on image processing (ICIP). IEEE, pp 2560–2564

  19. Balaha HM, Ali HA, Badawy M (2020) Automatic recognition of handwritten arabic characters: a comprehensive review. Neural Comput Appl :1–24

  20. Balaha HM, Ali HA, Saraya M, Badawy M (2020) A new arabic handwritten character recognition deep learning system (ahcr-dls). Neural Comput Appl :1–43

  21. Baldi P (2011) Autoencoders, unsupervised learning and deep architectures. In: Proceedings of the 2011 international conference on unsupervised and transfer learning workshop - Volume 27, UTLW’11. (JMLR.org), pp 37–50

  22. Bengio Y, Ca M (2015) Rmsprop and equilibrated adaptive learning rates for nonconvex optimization. arXiv:1502.04390

  23. Bernardin K, Stiefelhagen R (2008) Evaluating multiple object tracking performance: the clear mot metrics. EURASIP J Image Vid Process 2008:1–10

    Article  Google Scholar 

  24. Bisong E (2019) Google colaboratory. In: Building machine learning and deep learning models on google cloud platform. Springer, pp 59–64

  25. Bradski G (2008) Kaehler A. O’Reilly Media, Inc., Newton

    Google Scholar 

  26. Changming S, Deyi S (1997) Skew and slant correction for document images using gradient direction. In: Proceedings of the fourth international conference on document analysis and recognition. vol 1, pp 142–146

  27. Chollet F (2016) Xception: Deep learning with depthwise separable convolutions. arXiv:1610.02357

  28. Christlein V et al (2019) Deep generalized max pooling. In: 2019 International conference on document analysis and recognition (ICDAR). pp 1090–1096

  29. Chua LO, Roska T (1993) The cnn paradigm. IEEE Trans Circ Syst I Fund Theory Appl 40(3):147–156

    Article  MATH  Google Scholar 

  30. Ciregan D, Meier U, Schmidhuber J (2012) Multi-column deep neural networks for image classification. In: 2012 IEEE conference on computer vision and pattern recognition. IEEE, pp 3642–3649

  31. Cogswell M, Ahmed F, Girshick R, Zitnick L, Batra D (2015) Reducing overfitting in deep networks by decorrelating representations. arXiv:1511.06068

  32. Dai Y et al (2018) Fused text segmentation networks for multi-oriented scene text detection. In: 2018 24th international conference on pattern recognition (ICPR). pp 3604–3609

  33. Deng J et al (2009) Imagenet: A large-scale hierarchical image database. In: 2009 IEEE Conference on computer vision and pattern recognition. pp 248–255

  34. Duan K, Keerthi SS, Chu W, Shevade SK, Poo AN (2003) Multi-category classification by soft-max combination of binary classifiers. In: International workshop on multiple classifier systems. Springe), pp 125–134

  35. El-Desouky A, Salem M, El-Gwad AA, Arafat H (1991) A handwritten arabic character recognition technique for machine reader. In: Third international conference on software engineering for real time systems, (IET). pp 212–216

  36. El-Sawy A, EL-Bakry H, Loey M (2017) Cnn for handwritten arabic digits recognition based on lenet-5. In: Hassanien AE, Shaalan K, Gaber T, Azar AT, Tolba MF (eds) Proceedings of the international conference on advanced intelligent systems and informatics 2016. Springer International Publishing, Cham, pp 566–575

  37. ElAdel A, Ejbali R, Zaied M, Amar CB (2015) Dyadic multi-resolution analysis-based deep learning for arabic handwritten character classification. In: 2015 IEEE 27th International conference on tools with artificial intelligence (ICTAI). IEEE, pp 807–812

  38. Elarian Y, Ahmad I, Awaida S, Al-Khatib W, Zidouri A (2015) Arabic ligatures: analysis and application in text recognition. In: 2015 13th International conference on document analysis and recognition (ICDAR). IEEE, pp 896–900

  39. Elarian Y, Ahmad I, Zidouri A, Al-Khatib WG (2019) Lucidah ligative and unligative characters in a dataset for arabic handwriting. Int J Adv Comput Sci Appl 10(8)

  40. Elleuch M, Tagougui N, Kherallah M (2015) Arabic handwritten characters recognition using deep belief neural networks. In: 2015 IEEE 12th International multi-conference on systems, signals devices (SSD15). pp 1–5

  41. Elzobi M, Al-Hamadi A, Al Aghbari Z, Dings L (2013) Iesk-ardb: a database for handwritten arabic and an optimized topological segmentation approach. Int J Doc Anal Recognit (IJDAR) 16(3):295–308

    Article  Google Scholar 

  42. Farooq F, Venu Govindaraju, Perrone M (2005) Pre-processing methods for handwritten arabic documents. In: Eighth international conference on document analysis and recognition (ICDAR’05), vol 1. pp 267–271

  43. Firdaus FI, Khumaini A, Utaminingrum F (2017) Arabic letter segmentation using modified connected component labeling. In: 2017 International conference on sustainable information engineering and technology (SIET). pp 392–397

  44. Gardiner AH (1916) The egyptian origin of the semitic alphabet. J Egypt Archaeol 3(1):1–16

    Article  Google Scholar 

  45. Genlin J (2004) Survey on genetic algorithm [j]. Comput Appl Softw 2(1):69–73

    Google Scholar 

  46. Goodfellow I, Bengio Y, Courville A (2016) Deep learning. MIT Press, Cambridge

    MATH  Google Scholar 

  47. Grahl J, Bosman PA, Rothlauf F (2006) The correlation-triggered adaptive variance scaling idea. In: Proceedings of the 8th annual conference on Genetic and evolutionary computation. pp 397–404

  48. Ham FM, Han S (1996) Classification of cardiac arrhythmias using fuzzy artmap. IEEE Trans Biomed Eng 43(4):425–429

    Article  Google Scholar 

  49. Hämäläinen W (2006) Class np, np-complete, and np-hard problems

  50. Hara K, Saito D, Shouno H (2015) Analysis of function of rectified linear unit used in deep learning. In: 2015 international joint conference on neural networks (IJCNN). IEEE, pp 1–8

  51. He M, Zhang S, Mao H, Jin L (2015) Recognition confidence analysis of handwritten chinese character with cnn. In: 2015 13th International conference on document analysis and recognition (ICDAR). IEEE, pp 61–65

  52. He K, Zhang X, Ren S, Sun J (2015) Deep residual learning for image recognition. arXiv:1512.03385

  53. Hesterman JY, Caucci L, Kupinski MA, Barrett HH, Furenlid LR (2010) Maximum-likelihood estimation with a contracting-grid search algorithm. IEEE Trans Nuclear Sci 57(3):1077–1084

    Article  Google Scholar 

  54. Heutte L, Moreau JV, Paquet T, Lecourtier Y, Olivier C (1996) Combining structural and statistical features for the recognition of handwritten characters. In: Proceedings of 13th International conference on pattern recognition. IEEE, vol 2, pp 210–214

  55. Hifny Y (2019) Open vocabulary arabic diacritics restoration. IEEE Signal Process Lett 26(10):1421–1425

    Article  Google Scholar 

  56. Hinton GE (2009) Deep belief networks. Scholarpedia 4(5):5947

    Article  Google Scholar 

  57. Hirose Y, Yamashita K, Hijiya S (1991) Back-propagation algorithm which varies the number of hidden units. Neural Netw 4(1):61–66

    Article  Google Scholar 

  58. Howard AG, et al. (2017) Mobilenets: Efficient convolutional neural networks for mobile vision applications. arXiv:1704.04861

  59. Huang G, Liu Z, Weinberger KQ (2016) Densely connected convolutional networks. arXiv:1608.06993

  60. Hubara I, Courbariaux M, Soudry D, El-Yaniv R, Bengio Y (2016) Binarized neural networks. In: Advances in neural information processing systems. pp 4107–4115

  61. Jain AK, Duin RPW, Mao J (2000) Statistical pattern recognition: A review. IEEE Trans Pattern Anal Mach Intell 22(1):4–37

    Article  Google Scholar 

  62. Javed F (2013) Arabic and english phonetics: A comparative study. Criterion 4(4):1–13

    Google Scholar 

  63. Jiang Y et al (2018) Expert feature-engineering vs. deep neural networks: which is better for sensor-free affect detection?. In: International conference on artificial intelligence in education. Springer, pp 198–211

  64. Junker M, Hoch R, Dengel A (1999) On the evaluation of document analysis components by recall, precision, and accuracy. In: Proceedings of the fifth international conference on document analysis and recognition. ICDAR’99 (Cat. No. PR00318). IEEE, pp 713–716

  65. Kar R, et al. (2019) Novel approaches towards slope and slant correction for tri-script handwritten word images. Imaging Sci J 67(3):159–170

    Article  Google Scholar 

  66. Karim A, Mahdi B, Abdullah A (2019) Writer identification based on arabic handwriting recognition by using speed up robust feature and k- nearest neighbor classification. J Univ Babylon Pure Appl Sci 27:1–10

    Google Scholar 

  67. Kaye AS (2003) Arabic. In: The major languages of South Asia, the Middle East and Africa. (Routledge), pp 144–161

  68. Kef M, Chergui L, Chikhi S (2016) A novel fuzzy approach for handwritten arabic character recognition. Pattern Anal Appl 19(4):1041–1056

    Article  MathSciNet  Google Scholar 

  69. Khuman YLK, Devi HM, Singh NA (2021) Entropy-based skew detection and correction for printed meitei/meetei script ocr system. Mater Toda Proc 37:2666–2669

    Article  Google Scholar 

  70. Kim Y (2014) Convolutional neural networks for sentence classification. arXiv:1408.5882

  71. Kingma DP, Ba J (2014) Adam: A method for stochastic optimization. arXiv:1412.6980

  72. Kumar T, Verma K (2010) A theory based on conversion of rgb image to gray image. Int J Comput Appl 7(2):7–10

    Google Scholar 

  73. Lamtougui H, Moubtahij HE, Fouadi H, Yahyaouy A, Satori K (2020) Offline arabic handwriting recognition using deep learning: Comparative study. In: 2020 International conference on intelligent systems and computer vision (ISCV). pp 1–8

  74. Lawgali A, Angelova M, Bouridane A (2013) Hacdb: Handwritten arabic characters database for automatic character recognition. In: European workshop on visual information processing (EUVIP). IEEE, pp 255–259

  75. Lawgali A, Angelova M, Bouridane A (2014) A framework for arabic handwritten recognition based on segmentation. Int J Hybrid Inf Technol 7:413–428

    Google Scholar 

  76. LeCun Y, Bengio Y et al (1995) Convolutional networks for images, speech, and time series. In: The handbook of brain theory and neural networks, vol 3361, p 1995

  77. Liang Y, Wang J, Zhou S, Gong Y, Zheng N (2016) Incorporating image priors with deep convolutional neural networks for image super-resolution. Neurocomputing 194:340–347

    Article  Google Scholar 

  78. Liu B, et al. (2017) Supervised deep feature extraction for hyperspectral image classification. IEEE Trans Geosci Remote Sens 56(4):1909–1921

    Article  Google Scholar 

  79. Lu J et al (2015) Transfer learning using computational intelligence: A survey. 25th anniversary of Knowledge-Based Systems, vol 80, pp 14–23

  80. Maitra DS, Bhattacharya U, Parui SK (2015) Cnn based common approach to handwritten character recognition of multiple scripts. In: 2015 13th International conference on document analysis and recognition (ICDAR). IEEE, pp 1021–1025

  81. Marler RT, Arora JS (2010) The weighted sum method for multi-objective optimization: new insights. Struct Multidiscipl Optim 41(6):853–862

    Article  MathSciNet  MATH  Google Scholar 

  82. McMahan HB et al (2013) Ad click prediction: a view from the trenches. In: Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining. pp 1222–1230

  83. Miikkulainen R, et al. (2019) Evolving deep neural networks. In: Artificial intelligence in the age of neural networks and brain computing. Elsevier, pp 293–312

  84. Mirjalili S (2019) Genetic algorithm. In: Evolutionary algorithms and neural networks. (Springer), pp 43–55

  85. Motawa D, Amin A, Sabourin R (1997) Segmentation of arabic cursive script. In: Proceedings of the fourth international conference on document analysis and recognition, vol 2. pp 625–628

  86. Motwani MC, Gadiya MC, Motwani RC, Harris FC Jr (2004) Survey of image denoising techniques

  87. Mozaffari S, Faez K, Ziaratban M (2005) Character representation and recognition using quad tree-based fractal encoding scheme. In: Eighth international conference on document analysis and recognition (ICDAR’05). vol 2, pp 819–823

  88. Mukkamala MC, Hein M (2017) Variants of rmsprop and adagrad with logarithmic regret bounds. arXiv:1706.05507

  89. Nair V, Hinton GE (2010) Rectified linear units improve restricted boltzmann machines. In: Proceedings of the 27th international conference on machine learning (ICML-10). pp 807–814

  90. Naz S (2015) Segmentation techniques for recognition of arabic-like scripts: A comprehensive survey. Springer J Educ Inf Technol :21

  91. Nwankpa C, Ijomah W, Gachagan A, Marshall S (2018) Activation functions: Comparison of trends in practice and research for deep learning. arXiv:1811.03378

  92. Pak I, Teh PL (2018). In: Zelinka I Vasant P Duy VH (ed) Text segmentation techniques: a critical review. Springer International Publishing, Cham, pp 167–181

  93. Pan SJ, Yang Q (2010) A survey on transfer learning. IEEE Trans Knowl Data Eng 22(10):1345–1359

    Article  Google Scholar 

  94. Parvez MT, Mahmoud SA (2013) Offline arabic handwritten text recognition: a survey. ACM Comput Surv (CSUR) 45(2):1–35

    Article  MATH  Google Scholar 

  95. Pechwitz M, et al. (2002) Ifn/enit-database of handwritten arabic words. In: Proceedings of CIFED. vol 2, Citeseer, pp 127–136

  96. Pedamonti D (2018) Comparison of non-linear activation functions for deep neural networks on mnist classification task. arXiv:1804.02763

  97. Perez L, Wang J (2017) The effectiveness of data augmentation in image classification using deep learning. arXiv:1712.04621

  98. Poon PW, Carter JN (1995) Genetic algorithm crossover operators for ordering applications. Comput Oper Res 22(1):135–147

    Article  MATH  Google Scholar 

  99. Pratihar DK (2013) Soft computing: fundamentals and applications. Alpha Science International Ltd, Oxford

    Google Scholar 

  100. Retsö J (2013) What is arabic. In: The Oxford handbook of Arabic linguistics, vol 422, p 450

  101. Ruder S (2016) An overview of gradient descent optimization algorithms. arXiv:1609.04747

  102. Russakovsky O et al (2015) Imagenet large scale visual recognition challenge. Int J Comput Vis 115(3):211–252

    Article  MathSciNet  Google Scholar 

  103. Sahlol A, Suen C (2014) A novel method for the recognition of isolated handwritten arabic characters

  104. Sandler M, Howard AG, Zhu M, Zhmoginov A, Chen L (2018) Inverted residuals and linear bottlenecks: Mobile networks for classification, detection and segmentation. arXiv:1801.04381

  105. Saravanan C (2010) Color image to grayscale image conversion. In: 2010 second international conference on computer engineering and applications. IEEE, vol 2, pp 196–199

  106. Scherer D, Müller A, Behnke S (2010) Evaluation of pooling operations in convolutional architectures for object recognition. In: International conference on artificial neural networks. Springer, pp 92–101

  107. Schmitt LM (2001) Theory of genetic algorithms. Theor Comput Sci 259(1-2):1–61

    Article  MathSciNet  MATH  Google Scholar 

  108. Schuster M, Paliwal KK (1997) Bidirectional recurrent neural networks. IEEE Trans Signal Process 45(11):2673–2681

    Article  Google Scholar 

  109. Set T (2021) Precision recall f1-score auc accuracy, vol 16, p 12

  110. Shams M, Elsonbaty AZW (2020) Arabic handwritten character recognition based on convolution neural networks and support vector machine. Int J Adv Comput Sci Appl 11(6)

  111. Shams M, Elsonbaty A, ElSawy W (2020) Arabic handwritten character recognition based on convolution neural networks and support vector machine. Int J Adv Comput Sci Appl 11(8)

  112. Simard PY, Steinkraus D, Platt JC (2003) Best practices for convolutional neural networks applied to visual document analysis. In: Icdar. vol. 3

  113. Simonyan K, Zisserman A (2015) Very deep convolutional networks for large-scale image recognition. In: Bengio Y, LeCun Y (eds) 3rd International conference on learning representations, ICLR 2015, San Diego, CA, USA, May 7-9, 2015, Conference Track Proceedings

  114. Singh P, Verma A, Chaudhari NS (2016) Deep convolutional neural network classifier for handwritten devanagari character recognition. Springer, Berlin, pp 551–561

    Google Scholar 

  115. Slavik P, Govindaraju V (2001) Equivalence of different methods for slant and skew corrections in word recognition applications. IEEE Trans Pattern Anal Mach Intell 23(3):323–326

    Article  Google Scholar 

  116. Soman ST, Nandigam A, Chakravarthy VS (2013) An efficient multiclassifier system based on convolutional neural network for offline handwritten telugu character recognition. In: 2013 National Conference on Communications (NCC). IEEE, pp 1–5

  117. Sonka M, Hlavac V, Boyle R (1993) Image pre-processing. Springer US, Boston, pp 56–111

    Google Scholar 

  118. Srivastava N, Hinton G, Krizhevsky A, Sutskever I, Salakhutdinov R (2014) Dropout: A simple way to prevent neural networks from overfitting. J Mach Learn Res 15(56):1929–1958

    MathSciNet  MATH  Google Scholar 

  119. Tato A, Nkambou R (2018) Improving adam optimizer

  120. Vorontsov E, Trabelsi C, Kadoury S, Pal C (2017) On orthogonality and learning recurrent networks with long term dependencies. arXiv:1702.00071

  121. Waller RA, Duncan DB (1969) A bayes rule for the symmetric multiple comparisons problem. J Am Stat Assoc 64(328):1484–1503

    MathSciNet  Google Scholar 

  122. Ward R, Wu X, Bottou L (2019) Adagrad stepsizes: Sharp convergence over nonconvex landscapes. In: International conference on machine learning. (PMLR), pp 6677–6686

  123. Whitley D (1994) A genetic algorithm tutorial. Stat Comput 4 (2):65–85

    Article  Google Scholar 

  124. Witten IH, Bell TC, Emberson H, Inglis S, Moffat A (1994) Textual image compression: Two-stage lossy/lossless encoding of textual images. Proc IEEE 82(6):878–888

    Article  Google Scholar 

  125. Wright W, Caspari CP (2011) A grammar of the Arabic language. Cosimo Inc., New York

    Google Scholar 

  126. Wu C, Fan W, He Y, Sun J, Naoi S (2014) Handwritten character recognition by alternately trained relaxation convolutional neural network. In: 2014 14th International conference on frontiers in handwriting recognition. IEEE, pp 291–296

  127. Wu H, Gu X (2015) Towards dropout training for convolutional neural networks. Neural Netw 71:1–10

    Article  Google Scholar 

  128. Wu V, Manmatha R, Riseman EM (1999) Textfinder: an automatic system to detect and recognize text in images. IEEE Trans Pattern Anal Mach Intell 21(11):1224–1229

    Article  Google Scholar 

  129. Xiang T, Wang J, Liao X (2007) An improved particle swarm optimizer with momentum. In: 2007 IEEE congress on evolutionary computation. IEEE, pp 3341–3345

  130. Xiong H, Pandey G, Steinbach M, Kumar V (2006) Enhancing data analysis with noise removal. IEEE Trans Knowl Data Eng 18(3):304–319

    Article  Google Scholar 

  131. Yang W, Jin L, Xie Z, Feng Z (2015) Improved deep convolutional neural network for online handwritten chinese character recognition using domain-specific knowledge. In: 2015 13th international conference on document analysis and recognition (ICDAR). IEEE, pp 551–555

  132. Yuan A, Bai G, Jiao L, Liu Y (2012) Offline handwritten english character recognition based on convolutional neural network. In: 2012 10th IAPR International workshop on document analysis systems. IEEE, pp 125–129

  133. Zeiler MD (2012) Adadelta: an adaptive learning rate method. arXiv:1212.5701

  134. Zhang C, Vinyals O, Munos R, Bengio S (2018) A study on overfitting in deep reinforcement learning. arXiv:1804.06893

  135. Zhong Z, Jin L, Feng Z (2015) Multi-font printed chinese character recognition using multi-pooling convolutional neural network. In: 2015 13th International conference on document analysis and recognition (ICDAR). IEEE, pp 96–100

  136. Zhong Z, Jin L, Xie Z (2015) High performance offline handwritten chinese character recognition using googlenet and directional feature maps. In: 2015 13th International conference on document analysis and recognition (ICDAR). IEEE, pp 846–850

  137. Zhuang F et al (2020) A comprehensive survey on transfer learning. Proc IEEE :1–34

  138. Zoph B, Vasudevan V, Shlens J, Le QV (2017) Learning transferable architectures for scalable image recognition. arXiv:1707.07012

Download references

Funding

No funding was received for this work.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hossam Magdy Balaha.

Ethics declarations

Conflict of Interests

No conflict of interest exists. We wish to confirm that, there are no known conflicts of interest associated with this publication and there has been no significant financial support for this work that could have influenced its outcome.

Intellectual Property

We confirm that, we have given due consideration to the protection of intellectual property associated with this work and that there are no impediments to publication, including the timing of publication, with respect to intellectual property. In so doing we confirm that we have followed the regulations of our institutions concerning intellectual property.

Authorship

We confirm that the manuscript has been read and approved by all named authors. We confirm that the order of authors listed in the manuscript has been approved by all named authors.

Contact with the Editorial Office

The “Corresponding Author” is declared on the title page. This author submitted this manuscript using his account in the editorial submission system. (A) We understand that the “Corresponding Author” is the sole contact for the editorial process (including the editorial submission system and direct communications with the office). He is responsible for communicating with the other authors about progress, submissions of revisions, and final approval of proofs. (B) We confirm that the email address shown below is accessible by the “Corresponding Author”, is the address to which “Corresponding Author”’s editorial submission system account is linked, and has been configured to accept email from the editorial office (Email: hossam.m.balaha@mans.edu.eg).

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Author agreement statement

We the undersigned declare that this manuscript is original, has not been published before, and is not currently being considered for publication elsewhere. We confirm that the manuscript has been read and approved by all named authors and that there are no other persons who satisfied the criteria for authorship but are not listed. We further confirm that the order of authors listed in the manuscript has been approved by all of us. We understand that the “Corresponding Author” is the sole contact for the editorial process. He is responsible for communicating with the other authors about progress, submissions of revisions, and final approval of proofs.

Appendix A

Appendix A

1.1 A. 1 Table of abbreviations

Table 12 presents the abbreviations.

Table 12 Table of abbreviations

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Balaha, H.M., Ali, H.A., Youssef, E.K. et al. Recognizing arabic handwritten characters using deep learning and genetic algorithms. Multimed Tools Appl 80, 32473–32509 (2021). https://doi.org/10.1007/s11042-021-11185-4

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-021-11185-4

Keywords

Navigation