HanFont: large-scale adaptive Hangul font recognizer using CNN and font clustering

Yang, Jinhyeok; Kim, Heebeom; Kwak, Hyobin; Kim, Injung

doi:10.1007/s10032-019-00337-w

HanFont: large-scale adaptive Hangul font recognizer using CNN and font clustering

Original Paper
Published: 31 July 2019

Volume 22, pages 407–416, (2019)
Cite this article

International Journal on Document Analysis and Recognition (IJDAR) Aims and scope Submit manuscript

Jinhyeok Yang¹,
Heebeom Kim¹,
Hyobin Kwak¹ &
…
Injung Kim ORCID: orcid.org/0000-0003-4439-6097¹

346 Accesses
5 Citations
Explore all metrics

Abstract

We propose a large-scale Hangul font recognizer that is capable of recognizing 3300 Hangul fonts. Large-scale Hangul font recognition is a challenging task. Typically, Hangul fonts are distinguished by small differences in detailed shapes, which are often ignored by the recognizer. There are additional issues in practical applications, such as the existence of almost indistinguishable fonts and the release of new fonts after the training of the recognizer. Only a few recently developed font recognizers are scalable enough to recognize thousands of fonts, most of which focus on the fonts for western languages. The proposed recognizer, HanFont, is composed of a convolutional neural network (CNN) model designed to effectively distinguish the detailed shapes. HanFont also contains a font clustering algorithm to address the issues caused by indistinguishable fonts and untrained new fonts. In the experiments, HanFont exhibits a recognition rate of 94.11% for 3300 Hangul fonts including numerous similar fonts, which is 2.49% higher than that of ResNet. The cluster-level recognition accuracy of HanFont was 99.47% when the 3300 fonts were grouped into 1000 clusters. In a test on 100 new fonts without retraining the CNN model, HanFont exhibited 57.87% accuracy. The average accuracy for the top 56 untrained fonts was 75.76%.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A CNN-Based Classification Model for Recognizing Visual Bengali Font

Large-Scale Font Identification from Document Images

Character-Independent Font Identification

References

Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems, pp. 1097–1105 (2012)
Lin, M., Chen, Q., Yan, S.: Network in network (2013). arXiv preprint arXiv:1312.4400
Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition (2014). arXiv preprint arXiv:1409.1556
Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., Erhan, D., Vanhoucke, V., Rabinovich, A.: Going deeper with convolutions. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1–9 (2015)
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016)
Huang, G., Liu, Z., Van Der Maaten, L., Weinberger, K.Q.: Densely connected convolutional networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 4700–4708 (2017)
Chen, Y., Li, J., Xiao, H., Jin, X., Yan, S., Feng, J.: Dual path networks. In: Advances in Neural Information Processing Systems, pp. 4467–4475 (2017)
Hu, J., Shen, L., Sun, G.: Squeeze-and-excitation networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 7132–7141 (2018)
Wang, Z., Yang, J., Jin, H., Shechtman, E., Agarwala, A., Brandt, J., Huang, T.S.: Deepfont: identify your font from an image. In: Proceedings of the 23rd ACM International Conference on Multimedia, pp. 451–459. ACM (2015)
Park, M.H., Woo, S.Y., Kim, S.T., NamKung, J.C.: The font recognition of printed hangul documents. KIPS J. 4(8), 2017–2024 (1997)
Google Scholar
Lee, J.G., Chung, Y.S., Kim, D.S.: Recognition method for large number of Korean font using deep learning. In: Proceedings of the KICS, pp. 154–155 (2017)
Nair, V., Hinton, G.E.: Rectified linear units improve restricted Boltzmann machines. In: Proceedings of the 27th International Conference on Machine Learning (ICML-10), pp. 807–814 (2010)
He, K., Zhang, X., Ren, S., Sun, J.: Delving deep into rectifiers: surpassing human-level performance on imagenet classification. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 1026–1034 (2015)
Glorot, X., Bengio, Y.: Understanding the difficulty of training deep feedforward neural networks. In: Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, pp. 249–256 (2010)
Ioffe, S., Szegedy, C.: Batch normalization: accelerating deep network training by reducing internal covariate shift (2015). arXiv preprint arXiv:1502.03167
Huang, S., Zhong, Z., Jin, L., Zhang, S., Wang, H.: Dropregion training of inception font network for high-performance Chinese font recognition. Pattern Recognit. 77, 395–411 (2018)
Article Google Scholar
Kim, I.J., Xie, X.: Handwritten hangul recognition using deep convolutional neural networks. Int. J. Doc. Anal. Recognit. (IJDAR) 18(1), 1–13 (2015)
Article Google Scholar
Kim, I.J., Choi, C., Lee, S.H.: Improving discrimination ability of convolutional neural networks by hybrid learning. Int. J. Doc. Anal. Recognit. (IJDAR) 19(1), 1–9 (2016)
Article Google Scholar
Ranjan, R., Patel, V.M., Chellappa, R.: Hyperface: a deep multi-task learning framework for face detection, landmark localization, pose estimation, and gender recognition. IEEE Trans. Pattern Anal. Mach. Intell. 41(1), 121–135 (2019)
Article Google Scholar
Springenberg, J.T., Dosovitskiy, A., Brox, T., Riedmiller, M.: Striving for simplicity: the all convolutional net (2014). arXiv preprint arXiv:1412.6806
Veit, A., Wilber, M.J., Belongie, S.: Residual networks behave like ensembles of relatively shallow networks. In: Advances in Neural Information Processing Systems, pp. 550–558 (2016)
Masci, J., Meier, U., Cireşan, D., Schmidhuber, J.: Stacked convolutional auto-encoders for hierarchical feature extraction. In: International Conference on Artificial Neural Networks, pp. 52–59. Springer (2011)
Dumoulin, V., Visin, F.: A guide to convolution arithmetic for deep learning (2016). arXiv preprint arXiv:1603.07285
Hartigan, J.A., Wong, M.A.: Algorithm as 136: a \(k\)-means clustering algorithm. J. R. Stat. Soc. Ser. C (Appl. Stat.) 28(1), 100–108 (1979)
MATH Google Scholar
Rokach, L., Maimon, O.: Data Mining and Knowledge Discovery Handbook, 1st edn, pp. 321–352. Springer (2005)
Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization (2014). arXiv preprint arXiv:1412.6980

Download references

Acknowledgements

This work was supported by the Cultural Technology R&D Program funded by the Ministry of Culture, Sports and Tourism and Korea Creative Content Agency. This work was supported by the National Program for Excellence in Software funded by the Ministry of Science and ICT, Republic of Korea (2017000130).

Author information

Authors and Affiliations

School of CSEE, Handong Global University, 558 Handong-ro Buk-gu, Pohang, Gyeongbuk, 37554, Republic of Korea
Jinhyeok Yang, Heebeom Kim, Hyobin Kwak & Injung Kim

Authors

Jinhyeok Yang
View author publications
You can also search for this author in PubMed Google Scholar
Heebeom Kim
View author publications
You can also search for this author in PubMed Google Scholar
Hyobin Kwak
View author publications
You can also search for this author in PubMed Google Scholar
Injung Kim
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Injung Kim.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Yang, J., Kim, H., Kwak, H. et al. HanFont: large-scale adaptive Hangul font recognizer using CNN and font clustering. IJDAR 22, 407–416 (2019). https://doi.org/10.1007/s10032-019-00337-w

Download citation

Received: 13 August 2018
Revised: 08 July 2019
Accepted: 16 July 2019
Published: 31 July 2019
Issue Date: December 2019
DOI: https://doi.org/10.1007/s10032-019-00337-w

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

HanFont: large-scale adaptive Hangul font recognizer using CNN and font clustering

Abstract

Access this article

Similar content being viewed by others

A CNN-Based Classification Model for Recognizing Visual Bengali Font

Large-Scale Font Identification from Document Images

Character-Independent Font Identification

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

HanFont: large-scale adaptive Hangul font recognizer using CNN and font clustering

Abstract

Access this article

Similar content being viewed by others

A CNN-Based Classification Model for Recognizing Visual Bengali Font

Large-Scale Font Identification from Document Images

Character-Independent Font Identification

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation