Abstract
Takri is a low-resource class of scripts, used in north-west India which include states of J&K, H.P., Punjab, and Uttarakhand. This class of script has almost 13 scripts, identified in the whole region of North-west India. The paper focuses on classifying the various challenges in the script. The challenges identified are classified for effectiveness using ML classifiers and their performance is evaluated for accuracy. We believe that this classification of challenges will further aid the researchers of NLP community in solving them more efficiently.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Sk.Md. Obaidullah, et al., Script identification from printed Indian document images and performance evaluation using different classifiers. Appl. Comput. Intell. Soft Comput. 22 (2014)
Lakshmi, C. Vasantha, C. Patvardhan, An optical character recognition system for printed Telugu text. Pattern Anal. Appl. 7(2), 190–204 (2004)
S. Naz, et al., The optical character recognition of Urdu-like cursive scripts. Pattern Recogn. 47(3), 1229–1248 (2014)
B.B. Chaudhuri, U. Pal, An OCR system to read two Indian language scripts: Bangla and Devanagari (Hindi),in Proceedings of the Fourth International Conference on Document Analysis and Recognition, 1997, vol. 2. (IEEE, 1997)
S.Mohanty, H.K. Behera, A complete OCR development system for Oriya script, in Proceedings of SIMPLE, vol. 4 (2004)
Kunte, R. Sanjeev, R.D. Sudhaker Samuel, A simple and efficient optical character recognition system for basic symbols in printed Kannada text. Sadhana 32(5) (2007)
S. Magotra, B. Kaushik, A. Kaul, A comparative analysis for identification and classification of text segmentation challenges in Takri Script. Sadhana 45(1) (2020)
S. Magotra, B. Kaushik, A. Kaul, A database for printed Takri class of North-West Indian regional scripts, in International Conference on Futuristic Trends in Networks and Computing Technologies (Springer, Singapore, 2019)
G.S. Lehal, C. Singh, A technique for segmentation of Gurmukhi text, in International Conference on Computer Analysis of Images and Patterns (Springer, Berlin, Heidelberg, 2001)
M.K. Jindal, G.S. Lehal, R.K. Sharma, Segmentation problems and solutions in printed degraded Gurmukhi script. Int. J. Sig. Process. 2(4), 258–267 (2005)
M.K. Jindal, R.K. Sharma, G.S. Lehal, Segmentation of touching characters in upper zone in printed Gurmukhi script, in Proceedings of the 2nd Bangalore Annual Compute Conference (ACM, 2009)
S. Tsujimoto, H. Asada, Resolving ambiguity in segmenting touching characters, in Structured Document Image Analysis (Springer, Berlin, Heidelberg, 1992), pp. 203–215
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Magotra, S., Kaushik, B., Kaul, A. (2021). Use of Classification Approaches for Takri Text Challenges. In: Kaiser, M.S., Xie, J., Rathore, V.S. (eds) Information and Communication Technology for Competitive Strategies (ICTCS 2020). Lecture Notes in Networks and Systems, vol 190. Springer, Singapore. https://doi.org/10.1007/978-981-16-0882-7_34
Download citation
DOI: https://doi.org/10.1007/978-981-16-0882-7_34
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-16-0881-0
Online ISBN: 978-981-16-0882-7
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)