A Database for Handwritten Yoruba Characters

  • Samuel Ojumah
  • Sanjay MisraEmail author
  • Adewole Adewumi
Conference paper
Part of the Communications in Computer and Information Science book series (CCIS, volume 799)


This paper describes a novel publicly available dataset for research on offline Yoruba handwritten character recognition. It contains a total of 6954 characters being made up of several categories from a total number of 183 writers thus making it the largest available dataset for Yoruba handwriting research. It can be used for designing and evaluating handwritten character recognition systems for the Yoruba language as well as provide valuable insights through writer identification. The dataset has been partitioned into training and test sets being shared into 70% and 30% respectively.


Database Character recognition Yoruba 



This generation of this database was done with help from Learnd Technologies, which helped from the design phase to the scanning phase. The authors thank all members of the Learnd team for the collaboration in the creation the dataset. The financial support of Covenant University Centre for Research Innovation and Discovery (CUCRID) is also acknowledged.


  1. Ajao, J.F., Olabiyisi, S.O., Omidiora, E.O.: Yoruba handwriting word recognition quality evaluation of preprocessing attributes using information theory approach. Int. J. Appl. Inf. Syst. (IJAIS) 9(1), 18–23 (2015)Google Scholar
  2. Assabie, Y., Bigun, J.: Offline handwritten Amharic word recognition. Pattern Recogn. Lett. 32(8), 1089–1099 (2011)CrossRefGoogle Scholar
  3. Bencharef, O., Chihab, Y., Mousaid, N., Oujaoura, M.: Data set for Tifinagh handwriting character recognition. Data. Brief 4, 11–13 (2015)CrossRefGoogle Scholar
  4. Bentayebi, K., Abada, F., Ihzmad, H., Amzazi, S.: Genetic ancestry of a Moroccan population as inferred from autosomal STRs. Meta Gene 2, 427–438 (2014)CrossRefGoogle Scholar
  5. Djeddi, C., Gattal, A., Souici-Meslati, L., Siddiqi, I., Chibani, Y., El Abed, H.: LAMIS-MSHD: a multi-script offline handwriting database. In: 2014 14th International Conference on Frontiers in Handwriting Recognition (ICFHR), pp. 93–97. IEEE (2014)Google Scholar
  6. Graves, A., Schmidhuber, J.: Offline handwriting recognition with multidimensional recurrent neural networks. In: Advances in Neural Information Processing Systems, pp. 545–552 (2009)Google Scholar
  7. Liu, C.-L., Yin, F., Wang, D.-H., Wang, Q.-F.: CASIA online and offline Chinese handwriting databases. In: 2011 International Conference on Document Analysis and Recognition (ICDAR), pp. 37–41. IEEE (2011)Google Scholar
  8. Mahmoud, S.A., Ahmad, I., Al-Khatib, W.G., Alshayeb, M., Parvez, M.T., Märgner, V., Fink, G.A.: KHATT: an open Arabic offline handwritten text database. Pattern Recogn. 47(3), 1096–1112 (2014)CrossRefGoogle Scholar
  9. Marti, U.-V., Bunke, H.: The IAM-database: an English sentence database for offline handwriting recognition. Int. J. Doc. Anal. Recogn. 5(1), 39–46 (2002)CrossRefGoogle Scholar
  10. Oyedotun, O.K., Olaniyi, E.O., Khashman, A.: Deep learning in character recognition considering pattern invariance constraints. Int. J. Intell. Syst. Appl. 7(7), 1 (2015)Google Scholar
  11. Pascanu, R., Mikolov, T., Bengio, Y.: On the difficulty of training recurrent neural networks. ICML 3(28), 1310–1318 (2013)Google Scholar
  12. Saabni, R.M., El-Sana, J.A.: Comprehensive synthetic Arabic database for on/off-line script recognition research. Int. J. Doc. Anal. Recogn. (IJDAR) 16(3), 285–294 (2013)CrossRefGoogle Scholar
  13. Saady, Y.E., Rachidi, A., Yassa, M., Mammass, D.: AMHCD: a database for amazigh handwritten character recognition research. Int. J. Comput. Appl. 27(4), 44–48 (2011)Google Scholar
  14. Yadav, P., Yadav, N.: Handwriting recognition system-a review. Analysis, 114(19), 36–40 (2015)Google Scholar

Copyright information

© Springer Nature Singapore Pte Ltd. 2018

Authors and Affiliations

  1. 1.Covenant UniversityOtaNigeria

Personalised recommendations