Automatic Recognition of Handwritten Urdu Characters

Zargar, Hisham; Almahasneh, Ruba; Kóczy, László T.

doi:10.1007/978-3-030-74970-5_19

Hisham Zargar⁶,
Ruba Almahasneh⁶ &
László T. Kóczy⁶

Part of the book series: Studies in Computational Intelligence ((SCI,volume 959))

260 Accesses
3 Citations

Abstract

Various Optical Character Recognition (OCR) methods, especially, machine learning models, work towards the solution of recognizing patterns in intelligent ways from data that is originally not available in digital format. These patterns are converted into data that a machine can recognize (by the proper algorithm) and can further manipulate for various manipulations. The basic characteristics of the implementation presented here are based on a balance between the complexity of the algorithm applied and the highest precision that can be obtained. In this paper, we attempt to recognize precise patterns from a set of handwritten characters without making the implementation intractably complex. Specifically, in the context of the Urdu language spoken and written by more than a hundred million people around the world, very little exploration has been carried out. This paper proposes the application of an established machine learning model, namely, using the Support Vector Machine (SVM) algorithm; and investigates the efficiency of its application for recognizing handwritten characters of the Urdu language, a subject that has never been investigated before.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 149.00; Price excludes VAT (USA)

Softcover Book: USD 199.99; Price excludes VAT (USA)

Hardcover Book: USD 199.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Optical Character Recognition Using Minimal Complexity Machine and Its Comparison with Existing Classifiers

Off-line Odia Handwritten Character Recognition: A Hybrid Approach

Automatic Text Recognition from Image Dataset Using Optical Character Recognition and Deep Learning Techniques

References

Garcia, M.M.: The Urdu language reforms. Studies 26, 97 (2014)
Google Scholar
Géron, A.: Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow: Concepts, Tools, and Techniques to Build Intelligent Systems. (O'Reilly Media, California, 2019), pp. 145
Google Scholar
Kadhm, M.S., Hassan, A.K.A.: Handwriting word recognition based on SVM classifier. Int. J. Adv. Comput. Sci. Appl. 1, 64–68 (2015)
Google Scholar
Garreta, R., Moncecchi, G.: Learning Scikit-Learn: Machine Learning in Python. (Packt Publishing Ltd, Birmingham, 2013), pp. 25
Google Scholar
Bernhard, B.E., Guyon, I.M., Vapnik, V.N.: A training algorithm for optimal margin classifiers. In: COLT ’92: Proceedings of the Fifth Annual Workshop on Computational Learning Theory, ACM Press, New York, NY, USA, pp. 144–152 (1992)
Google Scholar
Thome, A.C.G.: SVM Classifiers concepts and applications to character recognition. Advances in Character Recognition, Xiaoqing Ding, IntechOpen, (2012). https://doi.org/10.5772/52009
Article Google Scholar
Liu, C.-L., Nakashima, K., Sako, H., Fujisawa, H.: Handwritten digit recognition: benchmarking of state-of-the-art techniques. Pattern Recogn. 36(10), 2271–2285 (2003)
Article Google Scholar
Suen, C.Y., Kiu, K., Strathy, N.W.: Sorting and recognizing cheques and financial documents, document analysis systems: theory and practice. In: Lee, S.-W., Nakano, Y. (eds.) LNCS 1655, pp. 173–187. Springer, (1999)
Google Scholar
Liu, C.-L., Nakashima, K., Sako, H., Fujisawa, H.: Handwritten digit recognition: investigation of normalization and feature extraction techniques. Pattern Recogn. 37(2), 265–279 (2004)
Article Google Scholar
Ahmad, A.R., Viard-Gaudin, C., Khalid, M., Yusof, R.: Online handwriting recognition using support vector machine. In: Proceedings of the Second International Conference on Artificial Intelligence in Engineering & Technology, Kota Kinabalu, Sabah, Malaysia, (2004)
Google Scholar
Pal, U., Chanda, S., Wakabayashi, T., Kimura, F.: Accuracy improvement of Devnagari character recognition combining SVM and MQDF (2008)
Google Scholar
Ahmad, A.R., Viard-Gaudin, C., Khalid, M.: Lexicon-based word recognition using support vector machine and hidden Markov model. In: 10th International Conference on Document Analysis and Recognition, (2009)
Google Scholar
Arora, S., et al.: Performance comparison of SVM and ANN for handwritten Devnagari character recognition. Int. J. Comput. Sci. 7(3), (2010)
Google Scholar
Husnain, M., Missen, M.M.S., Mumtaz, S., Luqman, M.M., Coustaty, M., Ogier, J.-M.: Visualization of high-dimensional data by pairwise fusion matrices using t-SNE. Symmetry 11, 107 (2019)
Article Google Scholar
Scikit-learn https://scikit-learn.org/stable/modules/generated/sklearn.metrics.confusion_matrix.html. Accessed 19 June 2020
Scikit-learn https://scikit-learn.org/stable/modules/model_evaluation.html#classification-report. Accessed 19 June 2020
Jain, A., Jain, M., Jain, G., Tayal, D.K.: “UTTAM”: an efficient spelling correction system for hindi language based on supervised learning. ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP) 18(1), 8 (2019)
Google Scholar
Vij, S., Jain, A., Tayal, D., Castillo, O.: Fuzzy logic for inculcating significance of semantic relations in word sense disambiguation using a WordNet graph. Int. J. Fuzzy Syst. 20(2), 444–459 (2018)
Google Scholar

Download references

Acknowledgements

This research was supported by the National Research, Development, and Innovation Office (Hungary), grant nr. K124055.

Author information

Authors and Affiliations

Department of Telecommunication and Media Economics, Budapest University of Technology and Economics, Budapest, Hungary
Hisham Zargar, Ruba Almahasneh & László T. Kóczy

Authors

Hisham Zargar
View author publications
You can also search for this author in PubMed Google Scholar
Ruba Almahasneh
View author publications
You can also search for this author in PubMed Google Scholar
László T. Kóczy
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to László T. Kóczy .

Editor information

Editors and Affiliations

Department of Mathematics and Computational Sciences, Széchenyi István University, Győr, Hungary
István Á. Harmati
Department of Information Technology, Széchenyi István University, Győr, Hungary
László T. Kóczy
Department of Mathematics, Science Faculty, Universidad de Cádiz, Cádiz, Spain
Jesús Medina
Department of Mathematics, Faculty of Economics and Business, University of Cádiz, Cádiz, Spain
Eloísa Ramírez-Poussa

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Zargar, H., Almahasneh, R., Kóczy, L.T. (2022). Automatic Recognition of Handwritten Urdu Characters. In: Harmati, I.Á., Kóczy, L.T., Medina, J., Ramírez-Poussa, E. (eds) Computational Intelligence and Mathematics for Tackling Complex Problems 3. Studies in Computational Intelligence, vol 959. Springer, Cham. https://doi.org/10.1007/978-3-030-74970-5_19

Download citation

DOI: https://doi.org/10.1007/978-3-030-74970-5_19
Published: 26 August 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-74969-9
Online ISBN: 978-3-030-74970-5
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics

Automatic Recognition of Handwritten Urdu Characters

Abstract

Access this chapter

Similar content being viewed by others

Optical Character Recognition Using Minimal Complexity Machine and Its Comparison with Existing Classifiers

Off-line Odia Handwritten Character Recognition: A Hybrid Approach

Automatic Text Recognition from Image Dataset Using Optical Character Recognition and Deep Learning Techniques

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Publish with us

Navigation

Automatic Recognition of Handwritten Urdu Characters

Abstract

Access this chapter

Similar content being viewed by others

Optical Character Recognition Using Minimal Complexity Machine and Its Comparison with Existing Classifiers

Off-line Odia Handwritten Character Recognition: A Hybrid Approach

Automatic Text Recognition from Image Dataset Using Optical Character Recognition and Deep Learning Techniques

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Share this chapter

Publish with us

Search

Navigation