Improved word-level handwritten Indic script identification by integrating small convolutional neural networks

Ukil, Soumya; Ghosh, Swarnendu; Obaidullah, Sk Md; Santosh, K. C.; Roy, Kaushik; Das, Nibaran

doi:10.1007/s00521-019-04111-1

Improved word-level handwritten Indic script identification by integrating small convolutional neural networks

Original Article
Published: 06 March 2019

Volume 32, pages 2829–2844, (2020)
Cite this article

Neural Computing and Applications Aims and scope Submit manuscript

Soumya Ukil¹,
Swarnendu Ghosh¹,
Sk Md Obaidullah²,
K. C. Santosh³,
Kaushik Roy⁴ &
…
Nibaran Das ORCID: orcid.org/0000-0002-2426-9915¹

523 Accesses
22 Citations
Explore all metrics

Abstract

Handwritten document recognition has been an active domain of research in the field of computer vision for several years since 1914 with the development of handheld scanner for reading printed texts called “optophone”. In India, which has several different scripts in one document page, identifying them is a must to automate process: document understanding. We propose a novel technique in integrating convolutional neural networks (CNNs) for script identification. We combined small individually trainable small CNNs, and used several different levels of variation in the architectures of the individual CNNs. Such a collection of individually trainable modules vary with respect to the input image size, CNN’s depth and wavelet transformation. In our test, we used publicly available dataset of size 11K words (1K per script) from 11 different Indic Scripts: Bangla, Devanagari, Gujarati, Gurumukhi, Kannada, Malayalam, Oriya, Roman, Tamil, Telugu and Urdu. Several ensemble strategies were implemented such as max-voting and probabilistic voting are used in addition to other conventional approaches like feature concatenation. We achieved a maximum accuracy of 95.04%, and it outperforms the accuracy of the state-of-the-art techniques like AlexNet by 2.9% and more importantly, benchmark techniques as (for script identification) on the dataset by more than 4%.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A Deep Convolutional Neural Networks Approach for Word-Level Handwritten Script Identification Using a Large Dataset

Deep Learning for Word-Level Handwritten Indic Script Identification

Script Identification from Offline Handwritten Characters Using Combination of Features

References

Ahmed SB, Naz S, Razzak MI, Rashid SF, Afzal MZ, Breuel TM (2016) Evaluation of cursive and non-cursive scripts using recurrent neural networks. Neural Comput Appl 27(3):603–613
Article Google Scholar
Anil R, Manjusha K, Kumar SS, Soman K (2015) Convolutional neural networks for the recognition of Malayalam characters. In: Proceedings of the 3rd international conference on frontiers of intelligent computing: theory and applications (FICTA) 2014. Springer, pp 493–500
Basu S, Das N, Sarkar R, Kundu M, Nasipuri M, Basu DK (2009) A hierarchical approach to recognition of handwritten bangla characters. Pattern Recognit 42(7):1467–1484
Article Google Scholar
Bhattacharya U, Chaudhuri B (2005) Databases for research on recognition of handwritten characters of Indian scripts. In: Eighth international conference on document analysis and recognition, 2005. Proceedings. IEEE, pp 789–793
Bhattacharya U, Chaudhuri BB (2009) Handwritten numeral databases of Indian scripts and multistage recognition of mixed numerals. IEEE Trans Pattern Anal Mach Intell 31(3):444–457
Article Google Scholar
Bracewell RN, Bracewell RN (1986) The Fourier transform and its applications, vol 31999. McGraw-Hill, New York
MATH Google Scholar
Brodić D, Amelio A, Milivojević ZN (2016) Language discrimination by texture analysis of the image corresponding to the text. Neural Comput Appl 29:1–22
Google Scholar
Busch A, Boles WW, Sridharan S (2005) Texture for script identification. IEEE Trans Pattern Anal Mach Intell 27(11):1720–1732
Article Google Scholar
Das N, Sarkar R, Basu S, Saha PK, Kundu M, Nasipuri M (2015) Handwritten bangla character recognition using a soft computing paradigm embedded in two pass approach. Pattern Recognit 48(6):2054–2071
Article Google Scholar
Daubechies I (1990) The wavelet transform, time-frequency localization and signal analysis. IEEE Trans Inf Theory 36(5):961–1005
Article MathSciNet Google Scholar
Dhanya D, Ramakrishnan A, Pati PB (2002) Script identification in printed bilingual documents. Sadhana 27(1):73–82
Article Google Scholar
Garain U, Chakraborty M, Dasgupta D (2006) Recognition of handwritten indic script using clonal selection algorithm. In: Artificial immune systems, pp 256–266
Ghosh D, Dube T, Shivaprasad A (2010) Script recognition a review. IEEE Trans Pattern Anal Mach Intell 32(12):2142–2161
Article Google Scholar
Govindaraju V, Setlur S (2009) Guide to OCR for indic scripts. Springer, Berlin
Google Scholar
Hangarge M, Santosh K, Pardeshi R (2013) Directional discrete cosine transform for handwritten script identification. In: 2013 12th International conference on document analysis and recognition (ICDAR). IEEE, pp 344–348
John J, Pramod K, Balakrishnan K (2012) Unconstrained handwritten Malayalam character recognition using wavelet transform and support vector machine classifier. Procedia Eng 30:598–605
Article Google Scholar
Kingma D, Ba J (2014) Adam: a method for stochastic optimization. arXiv preprint arXiv:1412.6980
Krizhevsky A, Sutskever I, Hinton G.E (2012) Imagenet classification with deep convolutional neural networks. In: Advances in neural information processing systems, pp 1097–1105
LeCun Y (1998) The mnist database of handwritten digits. http://yann.lecun.com/exdb/mnist/
LeCun Y, Bottou L, Bengio Y, Haffner P (1998) Gradient-based learning applied to document recognition. Proc IEEE 86(11):2278–2324
Article Google Scholar
Mehrotra K, Jetley S, Deshmukh A, Belhe S (2013) Unconstrained handwritten devanagari character recognition using convolutional neural networks. In: Proceedings of the 4th international workshop on multilingual OCR. ACM, p 15
Neeba N, Jawahar C (2009) Empirical evaluation of character classification schemes. In: Seventh international conference on advances in pattern recognition, 2009. ICAPR’09. IEEE, pp 310–313
Obaidullah SM, Das N, Halder C, Roy K (2015) Indic script identification from handwritten document images an unconstrained block-level approach. In: 2015 IEEE 2nd international conference on recent trends in information systems (ReTIS). IEEE, pp 213–218
Obaidullah SM, Halder C, Santosh K, Das N, Roy K (2017) Phdindic\(\_\)11: page-level handwritten document image dataset of 11 official indic scripts for script identification. Multimedia Tools Appl 77:1–36
Google Scholar
Obaidullah SM, Santosh K, Halder C, Das N, Roy K (2017) Automatic indic script identification from handwritten documents: page, block, line and word-level approach. Int J Mach Learn Cybern 10:1–20
Google Scholar
Pal U, Chaudhuri B (2004) Indian script character recognition: a survey. Pattern Recognit 37(9):1887–1899
Article Google Scholar
Pal U, Jayadevan R, Sharma N (2012) Handwriting recognition in indian regional scripts: a survey of offline techniques. ACM Trans Asian Lang Inf Process (TALIP) 11(1):1
Article Google Scholar
Pal U, Sinha S, Chaudhuri B (2003) Multi-script line identification from indian documents. In: Seventh international conference on document analysis and recognition, 2003. Proceedings. IEEE, pp 880–884
Pati PB, Ramakrishnan A (2008) Word level multi-script identification. Pattern Recogn Lett 29(9):1218–1229
Article Google Scholar
Portnoff M (1980) Time-frequency representation of digital signals and systems based on short-time fourier analysis. IEEE Trans Acoust Speech Signal Process 28(1):55–69
Article Google Scholar
Porwik P, Lisowska A (2004) The haar-wavelet transform in digital image processing: its status and achievements. Mach Graph Vis 13(1/2):79–98
MATH Google Scholar
Rajput G, Anita H (2013) Handwritten script recognition at line level-a multiple feature based approach. Int J Eng Innovative Technol 3(4):90–95
Google Scholar
Rani R, Dhir R, Lehal GS (2013) Script identification of pre-segmented multi-font characters and digits. In: 2013 12th International conference on document analysis and recognition (ICDAR). IEEE, pp 1150–1154
Roy K, Das S.K, Obaidullah SM (2011) Script identification from handwritten document. In: 2011 Third national conference on computer vision, pattern recognition, image processing and graphics (NCVPRIPG). IEEE, pp 66–69
Roy S, Das N, Kundu M, Nasipuri M (2017) Handwritten isolated bangla compound character recognition: a new benchmark using a novel deep learning approach. Pattern Recogn Lett 90:15–21
Article Google Scholar
Sarkhel R, Das N, Das A, Kundu M, Nasipuri M (2017) A multi-scale deep quad tree based feature extraction method for the recognition of isolated handwritten characters of popular indic scripts. Pattern Recognit 71:78–93
Article Google Scholar
Schenkel M, Guyon I, Henderson D (1995) On-line cursive script recognition using time-delay neural networks and hidden markov models. Mach Vis Appl 8(4):215–223
Article Google Scholar
Sharma MK, Dhaka VP (2016) Pixel plot and trace based segmentation method for bilingual handwritten scripts using feedforward neural network. Neural Comput Appl 27(7):1817–1829
Article Google Scholar
Singh PK, Mondal A, Bhowmik S, Sarkar R, Nasipuri M (2015) Word-level script identification from handwritten multi-script documents. In: Proceedings of the 3rd international conference on frontiers of intelligent computing: theory and applications (FICTA) 2014. Springer, pp 551–558
Singh PK, Sarkar R, Nasipuri M (2015) Offline script identification from multilingual indic-script documents: a state-of-the-art. Comput Sci Rev 15:1–28
Article MathSciNet Google Scholar
Singh PK, Sarkar R, Nasipuri M, Doermann D (2015) Word-level script identification for handwritten indic scripts. In: 2015 13th International conference on document analysis and recognition (ICDAR). IEEE, pp 1106–1110
Smith S (1997) Fourier transform properties. The scientist and engineers guide to digital signal processing. California Technical Publishing, San Diego, pp 185–208
Google Scholar
Stanković RS, Falkowski BJ (2003) The haar wavelet transform: its status and achievements. Comput Electr Eng 29(1):25–44
Article Google Scholar
Ubul K, Tursun G, Aysa A, Impedovo D, Pirlo G, Yibulayin T (2017) Script identification of multi-script documents: a survey. IEEE Access 5:6546–6559
Google Scholar
Verma K, Sharma RK (2016) Comparison of HMM-and SVM-based stroke classifiers for Gurmukhi script. Neural Comput Appl 28:1–13
Article Google Scholar

Download references

Acknowledgement

This work is supported by the project order no. SB/S3/EECE/054/2016, dated 25/11/2016, sponsored by SERB (Government of India) and carried out at the Centre for Microprocessor Application for Training Education and Research, CSE Department, Jadavpur University, Kolkata, India.

Author information

Authors and Affiliations

Jadavpur University, Kolkata, WB, 700032, India
Soumya Ukil, Swarnendu Ghosh & Nibaran Das
Aliah University, Kolkata, WB, 700156, India
Sk Md Obaidullah
The University of South Dakota, Vermillion, SD, 57069, USA
K. C. Santosh
West Bengal State University, Kolkata, 700126, WB, India
Kaushik Roy

Authors

Soumya Ukil
View author publications
You can also search for this author in PubMed Google Scholar
Swarnendu Ghosh
View author publications
You can also search for this author in PubMed Google Scholar
Sk Md Obaidullah
View author publications
You can also search for this author in PubMed Google Scholar
K. C. Santosh
View author publications
You can also search for this author in PubMed Google Scholar
Kaushik Roy
View author publications
You can also search for this author in PubMed Google Scholar
Nibaran Das
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Nibaran Das.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Ukil, S., Ghosh, S., Obaidullah, S.M. et al. Improved word-level handwritten Indic script identification by integrating small convolutional neural networks. Neural Comput & Applic 32, 2829–2844 (2020). https://doi.org/10.1007/s00521-019-04111-1

Download citation

Received: 16 October 2017
Accepted: 22 February 2019
Published: 06 March 2019
Issue Date: April 2020
DOI: https://doi.org/10.1007/s00521-019-04111-1

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Improved word-level handwritten Indic script identification by integrating small convolutional neural networks

Abstract

Access this article

Similar content being viewed by others

A Deep Convolutional Neural Networks Approach for Word-Level Handwritten Script Identification Using a Large Dataset

Deep Learning for Word-Level Handwritten Indic Script Identification

Script Identification from Offline Handwritten Characters Using Combination of Features

References

Acknowledgement

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Improved word-level handwritten Indic script identification by integrating small convolutional neural networks

Abstract

Access this article

Similar content being viewed by others

A Deep Convolutional Neural Networks Approach for Word-Level Handwritten Script Identification Using a Large Dataset

Deep Learning for Word-Level Handwritten Indic Script Identification

Script Identification from Offline Handwritten Characters Using Combination of Features

References

Acknowledgement

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation