Abstract
Although offline handwritten Indic script recognition has been explored for decades, it is still a challenging task to recognize handwritten characters and digits accurately because of complex structure and similar shaped characters in Indic scripts. Like other computer vision problems, handwritten Indic scripts recognition has achieved impressive state-of-the-art results using deep learning-based techniques. However, designing a successful handcrafted Deep Neural Network (DNN) right from scratch requires a lot of problem domain knowledge and involves a significant amount of trial and error. This approach intuitively appears to consume substantial time and computational resources. To solve this problem, we simplified the search process by using a meta-heuristics evolutionary technique to automatically evolve the optimal Convolutional Neural Network (CNN) architecture. More specifically, this work proposes a novel framework based on improved and fast converging Adaptive Particle Swarm Optimization (APSO) to design CNN architecture without manual intervention. The computational experiments are subsequently carried out on eight handwritten isolated characters and digits datasets belonging to three popular Indic scripts, namely Bangla, Devanagari, and Dogri. The experimental results clearly show that the proposed APSO-CNN technique yields better performance than the state-of-the-art methods for all the datasets.
Similar content being viewed by others
References
Singh P, Verma A and Chaudhari N S 2015 Feature selection based classifier combination approach for handwritten devanagari numeral recognition. Sadhana 40: 1701–1714
Bhunia A K, Mukherjee S, Sain A, Bhunia A K, Roy P P and Pal U 2020 Indic handwritten script identification using offline-online multi modal deep network. Information Fusion 57: 1–14
Sharma R and Kaushik B 2020 Offline recognition of handwritten indic scripts: A state-of-the-art survey and future perspectives. Computer Science Review 38: 100302
Gu J, Wang Z, Kuen J, Ma L, Shahroudy A, Shuai B, Liu T, Wang X, Wang G, Cai J and Chen T 2018 Recent advances in convolutional neural networks. Pattern Recognition 77: 354–377
Zhang Q, Zhang M, Chen T, Sun Z, Ma Y and Yu B 2019 Recent advances in convolutional neural network acceleration. Neurocomputing 323: 37–51
Miikkulainen R, Liang J, Meyerson E, Rawal A, Fink D, Francon O, Raju B, Shahrzad H, Navruzyan A, Duffy N, Hodjat B 2019 Evolving deep neural networks. In: Artificial Intelligence in the Age of Neural Networks and Brain Computing, pp. 293–312
Wang B, Xue B, Zhang M 2020 Particle swarm optimisation for evolving deep neural networks for image classification by evolving and stacking transferable blocks. In: 2020 IEEE Congress on Evolutionary Computation, pp. 1–8
Stanley K O and Miikkulainen R 2020 Evolving neural networks through augmenting topologies. Evolutionary computation 10: 99–127
Stanley K O, D’Ambrosio D B and Gauci J 2009 A hypercube-based encoding for evolving large-scale neural networks. Artificial life 15: 185–212
Stanley K O 2007 Compositional pattern producing networks: A novel abstraction of development. Genetic programming and evolvable machines 8: 131–162
Verbancsics P, Harguess J 2015 Image classification using generative neuro evolution for deep learning. In: 2015 IEEE winter conference on applications of computer vision, pp. 488–493
Fernando C, Banarse D, Reynolds M, Besse F, Pfau D, Jaderberg M, Lanctot M, Wierstra D 2016 Convolution by evolution: Differentiable pattern producing networks. In: Proceedings of the Genetic and Evolutionary Computation Conference, pp. 109–116
Real E, Moore S, Selle A, Saxena S, Suematsu Y L, Tan J, Le Q V, Kurakin A 2017 Large-scale evolution of image classifiers. In: International Conference on Machine Learning, pp. 2902–2911
Jin H, Song Q, Hu X 2019 Auto-keras: An efficient neural architecture search system. In: Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp. 1946–1956
Wang B, Sun Y, Xue B, Zhang M 2018 Evolving deep convolutional neural networks by variable-length particle swarm optimization for image classification. In: IEEE Congress on Evolutionary Computation, pp. 1–8
Junior F E F and Yen G G 2019 Particle swarm optimization of deep neural networks architectures for image classification. Swarm and Evolutionary Computation 49: 62–74
Yangyang L, Junjie X, Yanqiao C and Licheng J 2019 Evolving deep convolutional neural networks by quantum behaved particle swarm optimization with binary encoding for image classification. Neurocomputing 362: 156–165
Sun Y, Xue B, Zhang M and Yen G G 2018 A particle swarm optimization-based flexible convolutional autoencoder for image classification. IEEE transactions on neural networks and learning systems 30: 2295–2309
LeCun Y and Bengio Y 1998 Convolutional networks for images, speech, and time series. The handbook of brain theory and neural networks, MIT Press, 255–258
Zhang X, Zou J, He K and Sun J 2015 Accelerating very deep convolutional networks for classification and detection. IEEE transactions on pattern analysis and machine intelligence 38: 1943–1955
Xiao X, Jin L, Yang Y, Yang W, Sun J and Chang T 2017 Building fast and compact convolutional neural networks for offline handwritten chinese character recognition. Pattern Recognition 72: 72–81
Nair V, Hinton G E 2010 Rectified linear units improve restricted boltzmann machines. In: Proceedings of the 27th International Conference on Machine Learning, pp. 807–814
LeCun Y A, Bottou L, Orr G B, Muller K R 2012 Efficient backprop. In: Neural networks: Tricks of the trade, pp. 9–48
Kavitha BR, Srimathi C 2019 Benchmarking on offline handwritten tamil character recognition using convolutional neural networks. Journal of King Saud University-Computer and Information Sciences. https://doi.org/10.1016/j.jksuci.2019.06.004
Kennedy J, Eberhart R 1995 Particle swarm optimization. In: Proceedings of ICNN’95-International Conference on Neural Networks, volume 4, pp. 1942–1948
Mendes R, Kennedy J and Neves J 2004 The fully informed particle swarm: simpler, maybe better. IEEE transactions on evolutionary computation 8: 204–210
Glorot X, Bengio Y 2010 Understanding the difficulty of training deep feedforward neural networks. In: Proceedings of the thirteenth international conference on artificial intelligence and statistics, pp. 249–256
Bhunia A K, Roy P P, Mohta A and Pal U 2018 Cross-language framework for word recognition and spotting of indic scripts. Pattern Recognition 79: 12–31
Sharma R, Kaushik B and Gondhi N K 2018 Devanagari and gurmukhi script recognition in the context of machine learning classifiers: Mini review. Journal of Artificial Intelligence 11: 65–70
Das N, Sarkar R, Basu S, Kundu M, Nasipuri M and Basu D K 2012 A genetic algorithm based region sampling for selection of local features in handwritten digit recognition application. Applied Soft Computing 12: 1592–1606
Das N, Sarkar R, Basu S, Saha P K, Kundu M and Nasipuri M 2015 Handwritten bangla character recognition using a soft computing paradigm embedded in two pass approach. Pattern Recognition, 48: 2054–2071
Das N, Acharya K, Sarkar R, Basu S, Kundu M and Nasipuri M 2014 A benchmark image database of isolated bangla handwritten compound characters. International Journal on Document Analysis and Recognition 17: 413–431
Biswas M, Islam R, Shom G K, Shopon M, Mohammed N, Momen S and Abedin A 2017 Banglalekhaisolated: A multi-purpose comprehensive dataset of handwritten bangla isolated characters. Data in brief 12: 103–107
Das N, Reddy J M, Sarkar R, Basu S, Kundu M, Nasipuri M and Basu D K 2012 A statistical-topological feature combination for recognition of handwritten numerals. Applied Soft Computing 12: 2486–2495
Acharya S, Pant A K and Gyawali P K 2015 Deep learning based large scale handwritten devanagari character recognition. In: 9th International Conference on Software, Knowledge, Information Management and Applications (SKIMA), pp. 1–6
Magotra S, Kaushik B and Kaul A 2020 A comparative analysis for identification and classification of text segmentation challenges in takri script. Sadhana 45: 1–20
Pandey A 2015 Preliminary proposal to encode the dogra script in unicode. Technical report, L2/15–213. http://www.unicode.org
LeCun Y, Bottou L, Bengio Y and Haffner P 1998 Gradient-based learning applied to document recognition. In: Proceedings of the IEEE, 86: 2278–2324
Jia Q and Seo Y 2013 An improved particle swarm optimization for the resource-constrained project scheduling problem. The International Journal of Advanced Manufacturing Technology 67: 2627–2638
Kumar N and Vidyarthi D P 2016 A novel hybrid pso-ga meta-heuristic for scheduling of dag with communication on multiprocessor systems. Engineering with Computers 32: 35–47
Kingma D P, Ba J 2014 Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980
Srivastava N, Hinton G, Krizhevsky A, Sutskever I and Salakhutdinov R 2014 Dropout: a simple way to prevent neural networks from overfitting. The journal of machine learning research 15:1929–1958
Ioffe S 2017 Batch renormalization: Towards reducing minibatch dependence in batch-normalized models. In: Advances in neural information processing systems, pp. 1945–1953
Abadi M, Agarwal A, Barham P, Brevdo E, Chen Z, Citro C, Corrado G S, Davis A, Dean J, Devin M, Ghemawat S, Goodfellow I, Harp A, Irving G, Isard M, Jia Y, Jozefowicz R, Kaiser L, Kudlur M, Levenberg J, Mane D, Monga R, Moore S, Murray D, Olah C, Schuster M, Shlens J, Steiner B, Sutskever I, Talwar K, Tucker P, Vanhoucke V, Vasudevan V, Viegas F, Vanhoucke V, Vinyals O, Warden P, Wattenberg M, Wicke M, Yu Y, Zheng X 2016 Tensorflow: Largescale machine learning on heterogeneous distributed systems. arXiv preprint arXiv:1603.04467
Keserwani P, Ali T and Roy P P 2019 Handwritten bangla character and numeral recognition using convolutional neural network for low-memory gpu. International Journal of Machine Learning and Cybernetics 10: 3485–3497
Akhand MAH, Ahmed M, Rahman M M H and Islam M M 2018 Convolutional neural network training incorporating rotation based generated patterns and handwritten numeral recognition of major indian scripts. IETE Journal of Research 64: 176–194
Alom Md Z, Sidike P, Taha T M, Asari V K 2017 Handwritten bangla digit recognition using deep learning. arXiv preprint arXiv:1705.02680
Shopon Md, Mohammed N, Abedin Md A 2016 Bangla handwritten digit recognition using autoencoder and deep convolutional neural network. In: 2016 International Workshop on Computational Intelligence (IWCI), pp. 64–68
Sarkhel R, Das N, Das A, Kundu M and Nasipuri M 2017 A multi-scale deep quad tree based feature extraction method for the recognition of isolated handwritten characters of popular indic scripts. Pattern Recognition 71: 78–93
Roy S, Das N, Kundu M and Nasipuri M 2017 Handwritten isolated bangla compound character recognition: A new benchmark using a novel deep learning approach. Pattern Recognition Letters 90: 15–21
Chatterjee S, Dutta R K, Ganguly D, Chatterjee K, Roy S 2019 Bengali handwritten character classification using transfer learning on deep convolutional network. In: International Conference on Intelligent Human Computer Interaction, pp. 138–148
Rabby A S A, Haque S, Islam S, Abujar S and Hossain S A 2018 Bornonet: Bangla handwritten characters recognition using convolutional neural network. Procedia computer science 143: 528–535
Alif M A R, Ahmed S, Hasan M A 2017 Isolated bangla handwritten character recognition with convolutional neural network. In: 20th International Conference of Computer and Information Technology, pp. 1–6
Mhapsekar M, Mhapsekar P, Mhatre A, Sawant V 2020 Implementation of residual network (resnet) for devanagari handwritten character recognition. In: Advanced Computing Technologies and Applications, pp. 137–148
Aneja N, Aneja S 2019 Transfer learning using cnn for handwritten devanagari character recognition. In: 2019 1st International Conference on Advances in Information Technology (ICAIT), pp. 293–296
Tushar A K, Ashiquzzaman A, Afrin A, Islam M R 2018 A novel transfer learning approach upon hindi, arabic, and bangla numerals using convolutional neural networks. In: Computational Vision and Bio Inspired Computing, pp. 972–981
Serizawa T, Fujita H 2020 Optimization of convolutional neural network using the linearly decreasing weight particle swarm optimization. arXiv preprint arXiv:2001.05670
Sun Y, Yen G G and Yi Z 2018 Evolving unsupervised deep neural networks for learning meaningful representations. IEEE Transactions on Evolutionary Computation 23: 89–103
Desell T 2017 Large scale evolution of convolutional neural networks using volunteer computing. In: Proceedings of the Genetic and Evolutionary Computation Conference Companion, pp. 127–128
Baker B, Gupta O, Naik N, Raskar R 2016 Designing neural network architectures using reinforcement learning. arXiv preprint arXiv:1611.02167
Sun Y, Xue B, Zhang M and Yen G G 2019 Evolving deep convolutional neural networks for image classification. IEEE Transactions on Evolutionary Computation 24: 394–407
Qiao J, Wang G, Li W and Chen M 2018 An adaptive deep q-learning strategy for handwritten digit recognition. Neural Networks 107: 61–71
Dash K S, Puhan N B, Panda G 2020 Sparse concept coded tetrolet transform for unconstrained odia character recognition. arXiv preprint arXiv:2004.01551
Gupta A, Sarkhel R, Das N and Kundu M 2019 Multiobjective optimization for recognition of isolated handwritten indic scripts. Pattern Recognition Letters 128: 318–325
Sarkhel R, Das N, Saha A K and Nasipuri M 2016 A multi-objective approach towards cost effective isolated handwritten bangla character and digit recognition. Pattern Recognition 58: 172–189
Singh P K, Sarkar R, Nasipuri M 2016 A study of moment based features on handwritten digit recognition. Applied Computational Intelligence and Soft Computing 2016: Article ID 2796863 https://doi.org/10.1155/2016/2796863
Roy A, Das N, Sarkar R, Basu S, Kundu M, Nasipuri M 2014 An axiomatic fuzzy set theory based feature selection methodology for handwritten numeral recognition. In: ICT and Critical Infrastructure: Proceedings of the 48th Annual Convention of Computer Society of India-Vol I, pp. 133–140
Basu S, Das N, Sarkar R, Kundu M, Nasipuri M and Basu D K 2009 A hierarchical approach to recognition of handwritten bangla characters. Pattern Recognition 42: 1467–1484
Sarkhel R, Saha A K, Das N 2015 An enhanced harmony search method for bangla handwritten character recognition using region sampling. In: IEEE 2nd International Conference on Recent Trends in Information Systems (ReTIS), pp. 325–330
Friedman, M 1937 The use of ranks to avoid the assumption of normality implicit in the analysis of variance. Journal of the american statistical association 32: 675–701
Puri A and Gupta M K 2021 Knowledge discovery from noisy imbalanced and incomplete binary class data. Expert Systems with Applications 181: 115179
Huang G, Liu Z, Maaten L V D, Weinberger K Q 2017 Densely connected convolutional networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 4700–4708
He K, Zhang X, Ren S, Sun J 2016 Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 770–778
Acknowledgements
We express our sincere gratitude to Dr. Naveen Kumar Gondhi, Assistant Professor, Shri Mata Vaishno Devi University for providing his precious time, constant help, and guidance to carry out this research work.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Sharma, R., Kaushik, B. Handwritten Indic scripts recognition using neuro-evolutionary adaptive PSO based convolutional neural networks. Sādhanā 47, 30 (2022). https://doi.org/10.1007/s12046-021-01787-x
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s12046-021-01787-x