Abstract
Handwritten Digit recognition is a challenging problem these days due to the widely used Arabic language in the world, especially in the Middle East region. In this paper, sliding windows are used to enhance classification accuracies and implemented using random forests (RF) and support vector machine (SVM) classifiers for recognition of Arabic digit images. In order to study their effectiveness with and without using sliding windows, four different feature extraction techniques have been proposed which includes Mean-based, Gray-Level Co-occurrence Matrix (GLCM), Moment-based, and Edge Direction Histogram (EDH). The obtained accuracies show the significance of using sliding windows for classifying digit. The recognition rates acquired using the modified version of AHDBase dataset are 98% when Mean-based and Moment-based are applied with RF classifier, 98.33% and 99.13% when GLCM and EDH are used with linear-kernel SVM, respectively. Moreover, the performance of this study is compared against recent state-of-the-art approaches, namely Geometric-based, two-dimensional discrete cosine transform, Hierarchical features, Hetero-features, Discrete Fourier Transform and geometrical features, Gabor-based, gradient, structural, and concavity and Local Binary Convolutional Neural Networks.
Similar content being viewed by others
References
AlKhateeb JH, Alseid M (2014) DBN-Based learning for Arabic handwritten digit recognition using DCT features. In: 2014 6th international conference on computer science and information technology (CSIT). IEEE, pp 222–226
Awaida SM, Mahmoud SA (2014) Automatic check digits recognition for Arabic using multi-scale features, HMM and SVM classifiers. Br J Math Comput Sci 4:2521
Bosch A, Zisserman A, Munoz X (2007) Representing shape with a spatial pyramid kernel. In: Proceedings of the 6th ACM international conference on image and video retrieval. ACM, pp 401–408
Boser BE, Guyon IM, Vapnik VN (1992) A training algorithm for optimal margin classifiers. In: Proceedings of the fifth annual workshop on computational learning theory. ACM, pp 144–152
Breiman L (2001) Random forests. Mach Learn 45:5–32
El-Sherif EA, Abdelazeem S (2007) A two-stage system for Arabic handwritten digit recognition tested on a new large database. In: Artificial intelligence and pattern recognition. pp 237–242
Fawcett T (2006) An introduction to ROC analysis. Pattern Recognit Lett 27:861–874
Flusser J, Suk T (2006) Rotation moment invariants for recognition of symmetric objects. IEEE Trans Image Process 15:3784–3790
Gao X, Xiao B, Tao D, Li X (2008) Image categorization: graph edit distance + edge direction histogram. Pattern Recognit 41(10):3179–3191
Ghaleb MH, George LE, Mohammed FG (2013) Numeral handwritten Hindi/Arabic numeric recognition method. Int J Sci Eng Res 4(1):229–518
Gonzalez RC, Woods RE (2008) Digital image processing, 1st edn. Pearson Prentice Hall, Upper Saddle River, NJ
Haghighat M, Zonouz S, Abdel-Mottaleb M (2013) Identification using encrypted biometrics. In: Computer analysis of images and patterns. Springer, pp 440–448
Haralick RM, Shanmugam K et al (1973) Textural features for image classification. IEEE Trans Syst Man Cybern 6:610–621
Hu M-K (1962) Visual pattern recognition by moment invariants. IRE Trans Inf Theory 8:179–187
Juefei-Xu F, Boddeti VN, Savvides M (2016) Local binary convolutional neural networks. ArXiv Prepr. ArXiv: 160806049
Keerthi SS, Shevade SK, Bhattacharyya C, Murthy KRK (2001) Improvements to Platt’s SMO algorithm for SVM classifier design. Neural Comput 13:637–649
Khosravi H, Kabir E (2007) Introducing a very large dataset of handwritten Farsi digits and a study on their varieties. Pattern Recognit Lett 28:1133–1141
Lawgali A (2015) Handwritten digit recognition based on DWT and DCT. Int J Database Theory Appl 8:215–222
LeCun Y (1998) The MNIST database of handwritten digits. Httpyann Lecun Comexdbmnist
LeCun Y, Bottou L, Bengio Y, Haffner P (1998) Gradient-based learning applied to document recognition. Proc IEEE 86:2278–2324
Mahmoud SA, Al-Khatib WG (2011) Recognition of Arabic (Indian) bank check digits using log-gabor filters. Appl Intell 35:445–456
McIntosh A, Hassan S, Hindle A (2019) What can Android mobile app developers do about the energy consumption of machine learning? Empir Softw Eng 24(2):562–601
Mizutani E, Dreyfus SE (2001) On complexity analysis of supervised MLP-learning for algorithmic comparisons. In: International joint conference on neural networks, 2001, IJCNN’01. Proceedings. IEEE, pp 347–352
Mozaffari S, Faez K, Faradji F, Ziaratban M, Golzan SM (2006) A comprehensive isolated Farsi/Arabic character database for handwritten OCR research. In: Tenth international workshop on frontiers in handwriting recognition. Suvisoft
Ojala T, Pietikainen M, Harwood D (1994) Performance evaluation of texture measures with classification based on Kullback discrimination of distributions. In: Proceedings of the 12th IAPR international conference on pattern recognition, vol 1-conference a: computer vision and image processing. IEEE, pp 582–585
Oliva A, Torralba A (2001) Modeling the shape of the scene: a holistic representation of the spatial envelope. Int J Comput Vis 42:145–175
Oshiro TM, Perez PS, Baranauskas JA (2012) How many trees in a random forest? In: MLDM. Springer, pp 154–168
Partio M, Cramariuc B, Gabbouj M, Visa A (2002) Rock texture retrieval using gray level co-occurrence matrix. In: Proceedings of 5th nordic signal processing symposium. Citeseer
Plamondon R, Srihari SN (2000) Online and off-line handwriting recognition: a comprehensive survey. IEEE Trans Pattern Anal Mach Intell 22:63–84
Platt JC (1999) Fast training of support vector machines using sequential minimal optimization. In: Schölkopf B, Burges CJC, Smola AJ (eds) Advances in Kernel methods—support vector learning. MIT Press, Cambridge, pp 185–208
Radwan E (2013) Hybrid of rough neural networks for Arabic/Farsi handwriting recognition. Int J Adv Res Artif Intell 2:39–47
Rashnodi O, Sajedi H, Abadeh MS (2011) Using box approach in persian handwritten digits recognition. Int J Comput Appl 32(3)
Shilbayeh NF, Aqel MM, Alkhateeb R (2013) Recognition offline handwritten Hindi digits using multilayer Perceptron neural networks. World Sci Eng Acad Soc 94–103
Takruri M, Al-Hmouz R, Al-Hmouz A, Momani M (2015) Fuzzy C means based hybrid classifiers for offline recognition of handwritten Indian (Arabic) numerals. Int J Appl Eng Res 10:1911–1924
Trier ØD, Jain AK, Taxt T (1996) Feature extraction methods for character recognition-a survey. Pattern Recognit 29:641–662
Vapnik VN, Chervonenkis AY (1971) On the uniform convergence of relative frequencies of events to their probabilities. Theory Probab Appl 16:264–280
Yang Y, Zhang J, Kisiel B (2003) A scalability analysis of classifiers in text categorization. In: Proceedings of the 26th annual international ACM SIGIR conference on research and development in information retrieval. ACM, pp 96–103
Acknowledgements
The authors would like to thank Universiti Tun Hussein Onn Malaysia (UTHM) and Ministry of Education Malaysia for financially supporting this research under the Fundamental Research Grant Scheme (FRGS), Vote No. 1641.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Al-wajih, E., Ghazali, R. Improving the Accuracy for Offline Arabic Digit Recognition Using Sliding Window Approach. Iran J Sci Technol Trans Electr Eng 44, 1633–1644 (2020). https://doi.org/10.1007/s40998-020-00317-5
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s40998-020-00317-5