Script Identification of South-East and South-West Asia: A Survey

Zakarde, Sandeepa V.; Rojatkar, Dinesh V.

doi:10.1007/978-981-15-1420-3_27

Sandeepa V. Zakarde³⁷ &
Dinesh V. Rojatkar³⁷

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 601))

52 Accesses

Abstract

The world population is 7.7 billion and the largest and most populous continent is Asia having 59.66% of the total world population. Southern Asia accounts for 39.49%, out of which South-East has 8.59% ranking third. Whereas, Western Asia is equivalent to 3.59% and ranks fourth in world population. This region hosts a variety of languages, playing a critical role in the polygraphia formation, sharing of one script by several languages which have applications in multilingual access to patents, business regulatory information for independently evaluating all regional market requirements. Ideographic languages in Southeast Asian scripts are left-to-right or vertically top-to-bottom is more flexible in their writing direction. This paper presents the challenges involved in analyzing handwritten and printed documents. The review work of popular scripts namely Chinese, Japanese, Thai, Sinhala, Balinese and Arabic using various methods of feature extraction and different classifiers are represented in this paper. It summarizes most of the existing methodologies in the papers published by various researchers.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 89.00; Price excludes VAT (USA)

Softcover Book: USD 119.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Spitz AL (1997) Determination of the script and language content of document images. IEEE Trans Pattern Anal Mach Intell
Google Scholar
Hochberg J, Bowers K, Cannon M, Kally P (1999) Script and language identification for handwritten document images. Int J Doc Anal Recognit
Google Scholar
Pal U, Chaudhuri BB (2001) Automatic identification of English, Chinese, Arabic, Devnagari and Bangla script line. In: ieee computer vision and pattern recognition unit Indian statistical institute
Google Scholar
Touj S, Amara NB, Amiri H (2005) Arabic handwritten words recognition based on a planar Hidden Markov Model. Int Arab J Inf Technol
Google Scholar
Chanda S, Pal U, Kimura F (2007) Identification of Japanese and English script from a single document page. In: IEEE, Seventh international conference on computer and information technology
Google Scholar
Chanda S, Terrades OR, Pal U (2007) SVM based scheme for Thai and English script identification. In: Ninth international conference on document analysis and recognition ICDAR
Google Scholar
Chanda S, Pal U, Franke K, Kimura F (2010) Script identification-a Han & Roman perspective. In: IEEE international conference on pattern recognition
Google Scholar
Piao M, Cui R (2013) An approach to script identification in multi-language text image. In: IEEE sixth international conference on intelligent networks and intelligent systems
Google Scholar
Sudarma M, Ariyani S, Artana M (2016) Balinese script’s character reconstruction using linear discriminant analysis. Indones J Electr Eng Comput Sci
Google Scholar
Rojatkar DV, Chinchkhede KD, Sarate GG (2013) Handwritten Devnagari consonants recognition using MLPNN with Five-fold cross validation. In: International conference on circuits, power and computing technologies
Google Scholar
Tan TN (1998) Invariant texture features and their use in automatic script identification. IEEE Pattern Anal Mach Intell
Google Scholar
Jaegar S, Ma H, Doermann D (2005) Identifying script on word-level with informational confidence. In: IEEE international conference on document analysis and recognition
Google Scholar
Chanda S, Pal U (2009) Word-wise thai and roman script identification. ACM Trans Asian Lang Inf Process
Google Scholar
Tan GX, Viard-Gaudin C, Kot AC (2009) Information retrieval model for online handwritten script identification. In: 10th international conference on document analysis and recognition
Google Scholar

Download references

Author information

Authors and Affiliations

Government College of Engineering Amravati, Amravati, Maharashtra, India
Sandeepa V. Zakarde & Dinesh V. Rojatkar

Authors

Sandeepa V. Zakarde
View author publications
You can also search for this author in PubMed Google Scholar
Dinesh V. Rojatkar
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sandeepa V. Zakarde .

Editor information

Editors and Affiliations

BioAxis DNA Research Centre Private Ltd, Hyderabad, Telangana, India
Amit Kumar
Polish Academy of Science, Systems Research Institute, Warsaw, Poland
Marcin Paprzycki
Department of Computer Science and Engineering, CMR Institute of Technology, Hyderabad, Telangana, India
Vinit Kumar Gunjan

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zakarde, S.V., Rojatkar, D.V. (2020). Script Identification of South-East and South-West Asia: A Survey. In: Kumar, A., Paprzycki, M., Gunjan, V. (eds) ICDSMLA 2019. Lecture Notes in Electrical Engineering, vol 601. Springer, Singapore. https://doi.org/10.1007/978-981-15-1420-3_27

Download citation

DOI: https://doi.org/10.1007/978-981-15-1420-3_27
Published: 19 May 2020
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-1419-7
Online ISBN: 978-981-15-1420-3
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics