Skip to main content

A Novel Bangla Font Recognition Approach Using Deep Learning

  • Conference paper
  • First Online:
Emerging Technologies in Data Mining and Information Security

Abstract

Font detection is an essential pre-processing step for printed character recognition. In this era of computerization and automation, computer composed documents such as official documents, bank checks, loan applications, visiting cards, invitation cards, educational materials are used everywhere. Beyond just editing and processing documents, converting documents from one format to another, such as an invitation card, billboards is another major application area where a designer has to recognize the font details from the images. There is a lot of re-search on automatic font detection published for high-resource languages such as English. Still, not much has been reported for a low resource language such as Bangla. Bangla has a complex structure because of the use of diacritics, compound characters and graphemes. Furthermore, because of the popularity of digital, online publications, there has been a recent surge of fonts in Bangla. Font detection can also help analysts to detect changes in font choices based on sociopolitical divides: for example, consider that fonts common in Bangladesh may not be as popular among Bangla publications in India. In this paper, we present a convolutional neural network (CNN) approach for detecting Bangla fonts, using a space adjustment method dependent on a stacked convolutional auto-encoder (SCAE). As part of the work, we built a large corpus of printed documents consisting of 12,187 images in 7 different Bangla fonts, forming a total of 77,728 samples by augmentations to train and validate our model. Our proposed model achieves 98.73% average font recognition accuracy in the validation set.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. The World Factbook: www.cia.gov. Central Intelligence Agency. Archived from the original on 13 February 2008. Retrieved 21 Feb 2018

  2. Hasan, M.Z., Rahman, K.T., Riya, R.I., Hasan, K.Z., Zahan, N.: A CNN-based classification model for recognizing visual Bengali font. In: Proceedings of International Joint Conference on Computational Intelligence, pp. 471–482. Springer (2020)

    Google Scholar 

  3. Rakshit, A., Barshan, R.A., Islam, M.I.: Bangla font detection using 1-D discrete wavelet transform

    Google Scholar 

  4. Chanda, S., Pal, U., Franke, K.: Font identification—in context of an Indic script. In: Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012), pp. 1655–1658. IEEE (2012)

    Google Scholar 

  5. Yadav, R.K., Mazumdar, B.D.: Detection of Bold and Italic Character in Devanagari Script. Int. J. Comput. Appl. 39(2), 19–22 (2012)

    Google Scholar 

  6. Lehal, G.S., Saini, T.S., Buttar, S.P.K.: Automatic bilingual legacy-fonts identification and conversion system. Res. Comput. Sci. 86, 9–23 (2014)

    Article  Google Scholar 

  7. Ghosh, S., Roy, P., Bhattacharya, S., Pal, U.: Large-scale font identification from document images. In: Asian Conference on Pattern Recognition, pp. 594–600. Springer, Cham (2019)

    Google Scholar 

  8. Slimane, F., Kanoun, S., Hennebert, J., Alimi, A., Ingold, R.: A study on font-family and font-size recognition applied to Arabic word images at ultra-low resolution. Pattern Recogn. Lett. 34(2), 209–218 (2013)

    Article  Google Scholar 

  9. Tensmeyer, C., Saunders, D., Martinez, T.: Convolutional neural networks for font classification. In: 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR), pp. 985–990 (2017)

    Google Scholar 

  10. Wang, Z., Yang, J., Jin, H., Shechtman, E.: DeepFont: identify your font from an image. In: Proceedings of the 23rd ACM International Conference on Multimedia, pp. 451–459. ACM, Brisbane, Australia (2015)

    Google Scholar 

  11. Wang, Y., Lian, Z., Tang, Y., Xiao, J.: Font recognition in natural images via transfer learning. In: Tang, Y., Xiao, J. (eds.) International Conference on Multimedia Modeling 2018, LNCS, vol. 10704, pp 229–240. Springer (2018)

    Google Scholar 

  12. Ramanthan R, Thaneshwaran L, Viknesh V, Arunkumar T, Yuvaraj P, Soman DKP (2009) A novel technique for English font recognition using support vector machines. In: 2009 international conference on advances in recent technologies in communication and computing, pp 766–769

    Google Scholar 

  13. Khoubyari, S., Hull, J.: Font and function word identification in document recognition. Comput. Vis. Image Underst. 63, 66–74 (1996)

    Google Scholar 

  14. Ding, Xiaoqing, Chen, Li, Wu, Tao: Character independent font recognition on a single Chinese character. IEEE Trans. Pattern Anal. Mach. Intell. 29, 195–204 (2007)

    Article  Google Scholar 

Download references

Acknowledgements

The authors would like to acknowledge the encouragement and funding from the “Enhancement of Bangla Language in ICT through Research & Development (EBLICT)” project, under the Ministry of ICT, the Government of Bangladesh.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Md. Majedul Islam .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Islam, M.M., Rabby, A.K.M.S.A., Hasan, N., Nahar, J., Rahman, F. (2021). A Novel Bangla Font Recognition Approach Using Deep Learning. In: Hassanien, A.E., Bhattacharyya, S., Chakrabati, S., Bhattacharya, A., Dutta, S. (eds) Emerging Technologies in Data Mining and Information Security. Advances in Intelligent Systems and Computing, vol 1300. Springer, Singapore. https://doi.org/10.1007/978-981-33-4367-2_71

Download citation

Publish with us

Policies and ethics