Abstract
Various Optical Character Recognition (OCR) methods, especially, machine learning models, work towards the solution of recognizing patterns in intelligent ways from data that is originally not available in digital format. These patterns are converted into data that a machine can recognize (by the proper algorithm) and can further manipulate for various manipulations. The basic characteristics of the implementation presented here are based on a balance between the complexity of the algorithm applied and the highest precision that can be obtained. In this paper, we attempt to recognize precise patterns from a set of handwritten characters without making the implementation intractably complex. Specifically, in the context of the Urdu language spoken and written by more than a hundred million people around the world, very little exploration has been carried out. This paper proposes the application of an established machine learning model, namely, using the Support Vector Machine (SVM) algorithm; and investigates the efficiency of its application for recognizing handwritten characters of the Urdu language, a subject that has never been investigated before.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Garcia, M.M.: The Urdu language reforms. Studies 26, 97 (2014)
Géron, A.: Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow: Concepts, Tools, and Techniques to Build Intelligent Systems. (O'Reilly Media, California, 2019), pp. 145
Kadhm, M.S., Hassan, A.K.A.: Handwriting word recognition based on SVM classifier. Int. J. Adv. Comput. Sci. Appl. 1, 64–68 (2015)
Garreta, R., Moncecchi, G.: Learning Scikit-Learn: Machine Learning in Python. (Packt Publishing Ltd, Birmingham, 2013), pp. 25
Bernhard, B.E., Guyon, I.M., Vapnik, V.N.: A training algorithm for optimal margin classifiers. In: COLT ’92: Proceedings of the Fifth Annual Workshop on Computational Learning Theory, ACM Press, New York, NY, USA, pp. 144–152 (1992)
Thome, A.C.G.: SVM Classifiers concepts and applications to character recognition. Advances in Character Recognition, Xiaoqing Ding, IntechOpen, (2012). https://doi.org/10.5772/52009
Liu, C.-L., Nakashima, K., Sako, H., Fujisawa, H.: Handwritten digit recognition: benchmarking of state-of-the-art techniques. Pattern Recogn. 36(10), 2271–2285 (2003)
Suen, C.Y., Kiu, K., Strathy, N.W.: Sorting and recognizing cheques and financial documents, document analysis systems: theory and practice. In: Lee, S.-W., Nakano, Y. (eds.) LNCS 1655, pp. 173–187. Springer, (1999)
Liu, C.-L., Nakashima, K., Sako, H., Fujisawa, H.: Handwritten digit recognition: investigation of normalization and feature extraction techniques. Pattern Recogn. 37(2), 265–279 (2004)
Ahmad, A.R., Viard-Gaudin, C., Khalid, M., Yusof, R.: Online handwriting recognition using support vector machine. In: Proceedings of the Second International Conference on Artificial Intelligence in Engineering & Technology, Kota Kinabalu, Sabah, Malaysia, (2004)
Pal, U., Chanda, S., Wakabayashi, T., Kimura, F.: Accuracy improvement of Devnagari character recognition combining SVM and MQDF (2008)
Ahmad, A.R., Viard-Gaudin, C., Khalid, M.: Lexicon-based word recognition using support vector machine and hidden Markov model. In: 10th International Conference on Document Analysis and Recognition, (2009)
Arora, S., et al.: Performance comparison of SVM and ANN for handwritten Devnagari character recognition. Int. J. Comput. Sci. 7(3), (2010)
Husnain, M., Missen, M.M.S., Mumtaz, S., Luqman, M.M., Coustaty, M., Ogier, J.-M.: Visualization of high-dimensional data by pairwise fusion matrices using t-SNE. Symmetry 11, 107 (2019)
Scikit-learn https://scikit-learn.org/stable/modules/generated/sklearn.metrics.confusion_matrix.html. Accessed 19 June 2020
Scikit-learn https://scikit-learn.org/stable/modules/model_evaluation.html#classification-report. Accessed 19 June 2020
Jain, A., Jain, M., Jain, G., Tayal, D.K.: “UTTAM”: an efficient spelling correction system for hindi language based on supervised learning. ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP) 18(1), 8 (2019)
Vij, S., Jain, A., Tayal, D., Castillo, O.: Fuzzy logic for inculcating significance of semantic relations in word sense disambiguation using a WordNet graph. Int. J. Fuzzy Syst. 20(2), 444–459 (2018)
Acknowledgements
This research was supported by the National Research, Development, and Innovation Office (Hungary), grant nr. K124055.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this chapter
Cite this chapter
Zargar, H., Almahasneh, R., Kóczy, L.T. (2022). Automatic Recognition of Handwritten Urdu Characters. In: Harmati, I.Á., Kóczy, L.T., Medina, J., Ramírez-Poussa, E. (eds) Computational Intelligence and Mathematics for Tackling Complex Problems 3. Studies in Computational Intelligence, vol 959. Springer, Cham. https://doi.org/10.1007/978-3-030-74970-5_19
Download citation
DOI: https://doi.org/10.1007/978-3-030-74970-5_19
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-74969-9
Online ISBN: 978-3-030-74970-5
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)