Abstract
The rapid volume of digit texts and images motivates researchers to build solid and efficient prediction models to recognize such media. The Arabic language is considered one of the difficult languages regarding the way of writing characters and digits. Recent research focuses on such language for building predictive approaches to recognize written materials. The Arabic (Indian) digit recognition task has been a challenging task and has gained more attention from researchers who build optimal predictive models from historical images that are used in many applications. However, transfer learning approaches exploit deep pre-trained models that could be re-used for similar tasks. So, in this paper, we propose an adapted deep hybrid transfer model developed using two well-known pre-trained convolutional neural networks (CNN) models. These are further adapted by adding recurrent neural networks especially long short-term memory (LSTM) architectures to detect Arabic (Indian) Handwritten Digits (AHD). The CNN model learns the relevant features of Arabic (Indian) digits, while the sequence learning process in the LSTM layers extracts long-term dependence features. The experimental results, using popular datasets, show significant performance obtained by the adapted transfer models with accuracy reached up to 98.92% as well as with precision and recall values at most cases reached to 100% with statistical t test using p-value (\(p\le 0.05\)) compared to baseline methods.
Similar content being viewed by others
References
Abdleazeem S, El-Sherif E (2008) Arabic handwritten digit recognition. IJDAR 11(3):127–141
AlKhateeb JH, Alseid M (2014) DBN-based learning for arabic handwritten digit recognition using DCT features. In: 2014 6th International conference on computer science and information technology (CSIT), pp 222–226. IEEE
Alkhawaldeh RS (2019) DGR: gender recognition of human speech using one-dimensional conventional neural network. Sci Program. https://doi.org/10.1155/2019/7213717
Alkhawaldeh RS, Khawaldeh S, Pervaiz U, Alawida M, Alkhawaldeh H (2019) NIML: non-intrusive machine learning-based speech quality prediction on VoIP networks. IET Commun 13(16):2609–2616
Alom MZ, Taha TM, Yakopcic C, Westberg S, Sidike P, Nasrin MS, Hasan M, Van Essen BC, Awwal AA, Asari VK (2019) A state-of-the-art survey on deep learning theory and architectures. Electronics 8(3):292
Bengio Y et al (2009) Learning deep architectures for ai. Foundations and trends® in Machine Learning 2(1):1–127
Bergstra J, Bengio Y (2012) Random search for hyper-parameter optimization. J Mach Learn Res 13:281–305
Canziani A, Paszke A, Culurciello E (2016) An analysis of deep neural network models for practical applications. arXiv preprint arXiv:1605.07678
Chollet F (2018) Deep Learning mit Python und Keras: Das Praxis-Handbuch vom Entwickler der Keras-Bibliothek. MITP-Verlags GmbH & Co KG, Wachtendonk
Dargan S, Kumar M, Garg A, Thakur K (2019) Writer identification system for pre-segmented offline handwritten devanagari characters using k-NN and SVM. Soft Comput. https://doi.org/10.1007/s00500-019-04525-y
Elleuch M, Maalej R, Kherallah M (2016) A new design based-SVM of the CNN classifier architecture with dropout for offline arabic handwritten recognition. Procedia Comput Sci 80:1712–1723. https://doi.org/10.1016/j.procs.2016.05.512
Goodfellow I, Bengio Y, Courville A (2016) Deep learning. MIT Press, Cambridge
He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 770–778
Hochreiter S (1998) The vanishing gradient problem during learning recurrent neural nets and problem solutions. Int J Uncertain Fuzziness Knowl Based Syst 6(02):107–116
Hochreiter S, Schmidhuber J (1997) Long short-term memory. Neural Comput 9(8):1735–1780
Khawaldeh S, Pervaiz U, Rafiq A, Alkhawaldeh RS (2018) Noninvasive grading of glioma tumor using magnetic resonance imaging with convolutional neural networks. Appl Sci 8(1):27
Kim B, Yuvaraj N, Ramasamy S, Santhosh R, Sabari A (2020) Enhanced pedestrian detection using optimized deep convolution neural network for smart building surveillance. Soft Comput. https://doi.org/10.1007/s00500-020-04999-1
Krizhevsky A, Sutskever I, Hinton G.E (2012) Imagenet classification with deep convolutional neural networks. In: Advances in neural information processing systems, pp 1097–1105
LeCun Y, Bengio Y, Hinton G (2015) Deep learning. Nature 521(7553):436
LeCun Y, Bottou L, Bengio Y, Haffner P et al (1998) Gradient-based learning applied to document recognition. Proc IEEE 86(11):2278–2324
Loey M, El-Sawy A, El-Bakry H (2017) Deep learning autoencoder approach for handwritten arabic digits recognition. arXiv preprint arXiv:1706.06720
Maheshwari M, Namdev D, Maheshwari S (2018) A systematic review of automation in handwritten character recognition. Int J Appl Eng Res 13(10):8090–8099
Mahmoud S (2008) Recognition of writer-independent off-line handwritten Arabic (Indian) numerals using hidden Markov models. Sig Process 88(4):844–857. https://doi.org/10.1016/j.sigpro.2007.10.002
Mahmoud SA (2008) Arabic (Indian) handwritten digits recognition using Gabor-based features. In: 2008 International conference on innovations in information technology, pp 683–687. IEEE
Mudhsh M, Almodfer R (2017) Arabic handwritten alphanumeric character recognition using very deep neural network. Information 8(3):105
Pan SJ, Yang Q (2009) A survey on transfer learning. IEEE Trans Knowl Data Eng 22(10):1345–1359
Perez L, Wang J (2017) The effectiveness of data augmentation in image classification using deep learning. arXiv preprint arXiv:1712.04621
Rawat W, Wang Z (2017) Deep convolutional neural networks for image classification: a comprehensive review. Neural Comput 29(9):2352–2449
Rosa GH, Papa JaP, Yang XS (2018) Handling dropout probability estimation in convolution neural networks using meta-heuristics. Soft Comput 22(18):6147–6156. https://doi.org/10.1007/s00500-017-2678-4
Salameh M (2014) Arabic digits recognition using statistical analysis for end/conjunction points and fuzzy logic for pattern recognition techniques. World Comput Sci Inf Technol J 4(4):50
Selvi P.P, Meyyappan T (2013) Recognition of Arabic numerals with grouping and ungrouping using back propagation neural network. In: 2013 International conference on pattern recognition, informatics and mobile engineering, pp 322–327. IEEE
Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556
Szegedy C, Vanhoucke V, Ioffe S, Shlens J, Wojna Z (2016) Rethinking the inception architecture for computer vision. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2818–2826
Voulodimos A, Doulamis N, Doulamis A, Protopapadakis E (2018) Deep learning for computer vision: a brief review. Comput Intell Neurosci. https://doi.org/10.1155/2018/7068349
Younis KS (2017) Arabic handwritten character recognition based on deep convolutional neural networks. Jordanian J Comput Inf Technol 3(3):186
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The author(s) declare(s) that there is no conflict of interest regarding the publication of this paper.
Funding
This work is not funded.
Ethical approval
This article does not contain any studies with human participants or animals performed by any of the authors.
Additional information
Communicated by V. Loia.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Alkhawaldeh, R.S. Arabic (Indian) digit handwritten recognition using recurrent transfer deep architecture. Soft Comput 25, 3131–3141 (2021). https://doi.org/10.1007/s00500-020-05368-8
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00500-020-05368-8