Abstract
The lack of large training data in the context of deep learning applications is a serious issue investigated by many studies that deal with the current challenge. In this paper, we introduce new data augmentation methods that generate more shape and dynamic variations to improve the performance of recognition systems using small datasets. Four data augmentation strategies are employed in our work. The first strategy employs the geometric methods that include: italicity angle, change of magnitude ratio, and baseline inclination angle. The second strategy applies a frequency treatment that attenuates or amplifies the trajectory high harmonics to generate handwriting modified styles. The third strategy employs the beta-elliptic model to extract a combined static and dynamic representation of the handwritten trajectory which undergoes limited random change around its parameters in order to generate more modified samples. The hybrid strategy consists of combining these strategies to maximize variations of the online handwriting trajectory (OHT). We evaluated our approach of data augmentation in the context of multi-lingual online handwriting recognition (OHR) tasks using end-to-end CNN architecture. Four databases; ADAB, ALTEC-OnDB, and Online_KHATT for Arabic script, and UNIPEN for Latin characters, are used to validate the proposed strategy. The obtained results show the effectiveness and the advantage of the adopted strategies compared with those registered before database extension or reported in the state-of-the-art systems.
Similar content being viewed by others
References
Abdelaziz, I., Abdou, S.: “AltecOnDB: A Large-Vocabulary Arabic Online Handwriting Recognition Database”, arXiv preprint arXiv:1412.7626, (2014)
Alex K., Ilya S., Geoffrey, E.: Hinton. Imagenet classification with deep convolutional neural networks. In Advances in neural information processing systems, pages 1097–1105, (2012)
Alimi, A. M.: An evolutionary neuro-fuzzy approach to recognize online Arabic handwriting, in: Document Analysis and Recognition, 1997., Proceedings of the Fourth International Conference on, IEEE. pp. 382–386, (1997)
Al-Helali, B.M.: and Mahmoud. A survey. Submitted for publication, S. A., Arabic Online Text Recognition (AOTR) (2016)
Al-Helali, B.M., Mahmoud, S.A.: A statistical framework for online arabic character recognition. Cybern. Syst. 47(6), 478–498 (2016)
Anh, D.. Le.., Indurkhya, B., Nakagawa, M.: Pattern generation strategies for improving recognition of handwritten mathematical expression. Pattern Recognit. Lett. 128, 255–262 (2019)
Antreas A., Amos S., Harrison E.: Data augmentation generative adversarial networks. arXiv preprint arXiv: 1711.04340, (2017)
Azeem, Sherif, A., Hany A.: “Combining online and offline systems for Arabic handwriting recognition.”Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012). IEEE, (2012)
Bahlmann, C., Burkhardt, H.:“The writer independent online handwriting recognition system frog on hand and cluster generative statistical dynamic time warping,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 26, no. 3, pp. 299–310, Mar. (2004)
Berio, D., Akten, M., Fol Leymarie, F., Grierson, M., Plamondon, R.: Sequence generation with a physiologically plausible model of handwriting and Recurrent Mixture Density Networks , ICLR, (2017)
Bezine, H., Kefi, M., Alimi, A.M.: On the beta-elliptic model for the control of the human arm movement. Int. J. Pattern Recognit Artif Intell. 21, 5–19 (2007)
Boubaker, H., Elbaati, A., Tagougui, N., ElAbed, H., Kherallah, M., Alimi, A.M.: Online Arabic databases and applications gide to OCR for arabic scripts. Springer, New York (2012)
Boubaker, H., Chaabouni, A., Tagougui, N., Kherallah, M., Alimi, A.M.: “Handwriting and Hand drawing Velocity Modeling by Superposing Beta Impulses and Continuous Training Component,” International Journal of Computer Science Issues (UCS),pp S7-63, September (2013)
Boubaker, H., Chaabouni, A., Haikal El-Abed, Alimi, A.M.:“GLoBD: Geometric and Learned Logic Algorithm for Straight or Curved Handwriting Baseline Detection” The International Arab Journal of Information Technology, Vol. 15, No. 1, January (2018)
Brandão J.A., Almeida C.M.: The Italic Style: Getting It Straight. In: Rebelo F., Soares M. (eds) Advances in Ergonomics in Design. AHFE 2018. Advances in Intelligent Systems and Computing, vol 777. Springer, (2019)
Chatfield, K., Simonyan, K., Vedaldi, A., Zisserman, A.: Return of the devil in the details: Delving deep into convolutional nets. Proc, BMVC (2014)
Derrode, S., Ghorbel, F.: Robust and efficient fourier-mellin transform approximations for gray-level image reconstruction and complete invariant description. Comput. Vis. Image Underst. 83(1), 57–78 (2001)
Elleuch, M., Maalej, R., Kherallah, M.: A new design based-SVM of the CNN classifier architecture with dropout for offline arabic handwritten recognition. Procedia - Procedia Comput. Sci. 80, 1712–1723 (2016)
Ghosh, R., Chirumavila, V., Prabhat, K.: RNN based online handwritten word recognition in Devanagari and Bengali scripts using horizontal zoning. Pattern Recogn. 92, 203–218 (2019)
Graves, A., Liwicki, M., Bunke, H., Schmidhuber, J., Fernández, S.: Unconstrained on-line handwriting recognition with recurrent neural networks. In: Advances in Neural Information Processing Systems, pp. 577–584, (2008)
Graves, A., Schmidhuber, J.: Offline handwriting recognition with multidimensional recurrent neural networks. Advances in Neural Information Processing Systems 21,(2008)
Graves, A., Mohamed, A., Hinton. G.: Speech recognition with deep recurrent neural networks. In Proc. ICASSP, (2013)
Graves, A.: Generating Sequences With Recurrent Neural Networks, arXiv:1308.0850, 5 Jun (2014)
Guyon, I., Schomaker, L.R.B., Plamondon, R., Liberman, M. Janet, S.: “UNIPEN Project of On-Line Data Exchange and Recognizer Benchmarks,”Proc. 12th Int’l Conf
Hamdi., Y., Boubaker H., Dhieb., T., Elbaati., A, Alimi, M.: Adel. Hybrid DBLSTM-SVM based Beta-elliptic-CNN Models for Online Arabic Characters Recognition. In International Conference on Document Analysis and Recognition (ICDAR), pages 803-808, (2019)
Hamdi Y., Chaabouni A., Boubaker H., Alimi A.M.: Hybrid Neural Network and Genetic Algorithm for off-Lexicon Online Arabic Handwriting Recognition. In: Proceedings of the 16th International Conference on Hybrid Intelligent Systems (HIS 2016). HIS 2016. Advances in Intelligent Systems and Computing, vol 552. (2017)
Keysers, D., Deselaers, T., Rowley, H., Wang, L.L.: Fast multi-language LSTM-based online handwriting recognition. International Journal on Document Analysis and Recognition (IJDAR) 89–102,(2020)
Kherallah, M.,Najiba, T., Alimi,A.M., El Abed, H., Margner, M.: “Online Arabic Handwriting Recognition Competition,” International Conference on Document Analysis and Recognition, (2011)
Krishnan, P., Jawahar, C. V.: “Matching handwritten document images,” The 14th European Conference on Computer Vision (ECCV), (2016)
Ltaief, M., Njah, S., Bezine, H., Alimi, M.A.: Genetic algorithms for perceptual codes extraction. J. Intell. Learn. Syst. Appl. JILSA 4, 256–265 (2012)
Lin, T.-Y., Maire, M., Belongie, S., Hays, J., Perona, P., Ramanan, D., Dollar, P., Zitnick, C. L.:“Microsoft COCO: Common objects in context,” in Proc. Eur. Conf. Comp. Vis., (2014)
Mahmoud, S., Luqman, H., Al-Helali, B., BinMakhashen, G., Tanvir Parvez, M.: ”Online-KHATT: An Open-Vocabulary Database for Arabic OnlineText Processing”. The Open Cybernetics & Systemics Journal, vol. 12, pp. 42-59, March. (2018)
Patrice, J. P., Simard, Y., Dave S.: “Best practices for convolutional neural networks applied to visual document analysis.” Institute of Electrical and Electronics Engineers, Inc., August (2003)
Plamondon, R.: A delta lognormal model for handwriting generation, in: Proceedings of the Biennal Conference of the International Graphonomics Society, pp. 126–127, (1995)
Plamondon, R., Srihari, S. N.: “On-line and off-line handwriting recognition: A comprehensive survey,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 22, no. 1, pp. 63–84, Jan. (2000)
Plamondon, R., Pirlo, G., Anquetil, E., Rémi, C., Teulings, H., Nakagawa, M.: Personal digital bodyguards for e-Security, e-Learning and e-Health: a prospective surve. Pattern Recogn. 81, 633–659 (2018)
Poznanski, A., Wolf, L.:“Cnn-n-gram for handwriting word recognition,” Proc. CVPR, (2016)
Sen, S., Shaoo, D., Paul, S., Sarkar, R., Roy, K.: Online Handwritten Bangla Character Recognition Using CNN: A Deep Learning. Intelligent Engineering Informatics 695,(2018)
Shen, X., Messina, R.: “A method of synthesizing handwritten chinese images for data augmentation,” The 15th International Conference on Frontiers in Handwriting Recognition (ICFHR), pp. 114–119, (2016)
Smach, F., Lemaître, C., Gauthier, J.-P., Mitéran, J., Atri, M.: Generalized fourier descriptors with applications to objects recognition in svm context. J. Math. Imaging Vision 30(1), 43–71 (2008)
Sennrich, R., Haddow, H.,Birch, A.: Neural machine translation of rare words with subword units. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 1715–1725. Association for Computational Linguistics, (2016)
Shorten, C., Khoshgoftaar, T.M.: A survey on image data augmentation for deep learning. J. Big Data 6, 60 (2019)
T. Lin, M. Maire, S. Belongie, et al., CoCo: Microsoft COCO: common objects in context, in: Proceedings of the ECCV, (2014)
Wigington, C., Stewart, S., Davis, B., Barrett, B., Price, B, Cohen, S.:“Data Augmentation for Recognition of Handwritten Words and Lines Using a CNN-LSTM Network,” 14th IAPR International Conference on Document Analysis and Recognition (ICDAR), Kyoto, pp. 639-645, (2017)
Wilson-Nunn, D., Lyons, T., Papavasiliou, A., Ni, H.: “A Path Signature Approach to Online Arabic Handwriting Recognition”, International Workshop on Arabic and Derived Script Analysis and Recognition (ASAR): 135-139, (2018)
Wu, Y.C., Yin, F.: Liu, CL: & #x201C;Improving handwritten Chinese text recognition using neural network language models and convolutional neural network shape models. Pattern Recogn. 65, 251–264 (2017)
Xiwen, Qu, Wang, W., Lu, K., Zhou, J.: Data augmentation and directional feature maps extraction for in-air handwritten Chinese character recognition based on convolutional neural network. Pattern Recogn, Lett (2018)
Yuan, A., Bai, G., Yang, P., Guo, Y., Zhao, X.:“ Handwritten English word recognition based on Convolutional Neural Networks” Proceedings of the 13th International Conference on Frontiers in Handwriting Recognition2, Bari, Italy, pp. 207–(2012)
Online and offline handwritten chinese character recognition: Zhang, X.-Y., Bengio, Y. C.-L. Liu. A comprehensive study and new benchmark, Pattern Recognition 61, 348–360 (2017)
Zitouni1, R., Bezine, H., Arous, N.:“Online Handwritten Arabic Scripts Recognition Using Stroke-Based Class Labeling Scheme”, International Journal of Computational Intelligence Systems (IJCIS), (2020)
Acknowledgements
The research leading to these results has received funding from the Ministry of Higher Education and Scientific Research of Tunisia under the grant agreement number LR11ES48.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Hamdi, Y., Boubaker, H. & Alimi, A.M. Data Augmentation using Geometric, Frequency, and Beta Modeling approaches for Improving Multi-lingual Online Handwriting Recognition. IJDAR 24, 283–298 (2021). https://doi.org/10.1007/s10032-021-00376-2
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10032-021-00376-2