Skip to main content
Log in

Classifying fonts and calligraphy styles using complex wavelet transform

  • Original Paper
  • Published:
Signal, Image and Video Processing Aims and scope Submit manuscript

Abstract

Recognizing fonts has become an important task in document analysis, due to the increasing number of available digital documents in different fonts and emphases. A generic font recognition system independent of language, script and content is desirable for processing various types of documents. At the same time, categorizing calligraphy styles in handwritten manuscripts is important for paleographic analysis, but has not been studied sufficiently in the literature. We address the font recognition problem as analysis and categorization of textures. We extract features using complex wavelet transform and use support vector machines for classification. Extensive experimental evaluations on different datasets in four languages and comparisons with state-of-the-art studies show that our proposed method achieves higher recognition accuracy while being computationally simpler. Furthermore, on a new dataset generated from Ottoman manuscripts, we show that the proposed method can also be used for categorizing Ottoman calligraphy with high accuracy.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4

Similar content being viewed by others

References

  1. Abdelnour, A.F., Selesnick, I.W.: Nearly symmetric orthogonal wavelet bases. In: Proceedings IEEE International Conference on Acoustics, Speech, Signal Processing (ICASSP), vol. 6 (2001)

  2. Abuhaiba, I.S.I.: Arabic font recognition using decision trees built from common words. J. Comput. Inf. Technol. 13(3), 211–224 (2004)

    Google Scholar 

  3. Aiolli, F., Simi, M., Sona, D., Sperduti, A., Starita, A., Zaccagnini, G.: SPI: A system for paleographic inspections. In: AI IA Notizie, vol. 4, pp. 34–48 (1999)

  4. Amin, A.: Off-line arabic character recognition: the state of the art. Pattern Recognit. 31(5), 517–530 (1998)

    Article  MathSciNet  Google Scholar 

  5. Aviles-Cruz, C., Rangel-Kuoppa, R., Reyes-Ayala, M., Andrade-Gonzalez, A., Escarela-Perez, R.: High-order statistical texture analysis-font recognition applied. Pattern Recognit. Lett. 26(2), 135–145 (2005)

    Article  Google Scholar 

  6. Azmi, M.S., Omar, K., Nasrudin, M.F., Muda, A.K., Abdullah, A.: Arabic calligraphy classification using triangle model for digital jawi paleography analysis. In: 11th International Conference on Hybrid Intelligent Systems (HIS), pp. 704–708 (2011)

  7. Bataineh, B., Abdullah, S.N.H.S., Omar, K.: A novel statistical feature extraction method for textual images: optical font recognition. Expert Syst. Appl. 39(5), 5470–5477 (2012)

    Article  Google Scholar 

  8. Ben Moussa, S., Zahour, A., Benabdelhafid, A., Alimi, A.M.: New fractal-based system for arabic/latin, printed/handwritten script identification. In: 19th International Conference on Pattern Recognition (ICPR 2008), pp. 1–4 (2008)

  9. Ben Moussa, S., Zahour, A., Benabdelhafid, A., Alimi, A.M.: New features using fractal multi-dimensions for generalized arabic font recognition. Pattern Recognit. Lett. 31(5), 361–371 (2010)

    Article  Google Scholar 

  10. Borji, A., Hamidi, M.: Support vector machine for persian font recognition. Int. J. Intell. Syst. Technol. 184–197 (2007)

  11. Bozkurt, A., Suhre, A., Cetin, A.E.: Multi-scale directional-filtering-based method for follicular lymphoma grading. Signal Image Video Process. 8(1), 63–70 (2014)

    Article  Google Scholar 

  12. Cai, S., Li, K., Selesnick, I.: Matlab implementation of wavelet transforms. Polytechnic University (2010)

  13. Celik, T., Tjahjadi, T.: Multiscale texture classification using dual-tree complex wavelet transform. Pattern Recognit. Lett. 30(3), 331–339 (2009)

    Article  Google Scholar 

  14. Chang, C.C., Lin, C.J.: LIBSVM: a library for support vector machines. ACM Trans. Intell. Syst. Technol. (TIST) 2(3), 27 (2011)

    Google Scholar 

  15. Chaudhuri, B.B., Garain, U.: Automatic detection of italic, bold and all-capital words in document images. In: Proceedings of Fourteenth International Conference on Pattern Recognition, vol. 1, pp. 610–612. IEEE (1998)

  16. Cortes, C., Vapnik, V.: Support-vector networks. Mach. Learn. 20(3), 273–297 (1995)

    MATH  Google Scholar 

  17. Hatipoglu, S., Mitra, S.K., Kingsbury, N.: Texture classification using dual-tree complex wavelet transform. In: Seventh International Conference on Image Processing and its Applications (Conf. Publ. No. 465), vol. 1, pp. 344–347. IET (1999)

  18. Hill, P.R., Bull, D.R., Canagarajah, C.N.: Rotationally invariant texture features using the dual-tree complex wavelet transform. In: Proceedings of International Conference on Image Processing, vol. 3, pp. 901–904. IEEE (2000)

  19. Hsu, C.W., Chang, C.C., Lin, C.J.: A practical guide to support vector classification. National Taiwan University (2003)

  20. Khosravi, H., Kabir, E.: Farsi font recognition based on sobel-roberts features. Pattern Recognit. Lett. 31(1), 75–82 (2010)

    Article  Google Scholar 

  21. Kingsbury, N.: A dual-tree complex wavelet transform with improved orthogonality and symmetry properties. In: Proceedings of International Conference on Image Processing, vol. 2, pp. 375–378. IEEE (2000)

  22. Kingsbury, N.G.: The dual-tree complex wavelet transform: a new efficient tool for image restoration and enhancement. In: Proceedings of EUSIPCO, vol. 98, pp. 319–322 (1998)

  23. Ma, H., Doermann, D.: Gabor filter based multi-class classifier for scanned document images. In: 7th International Conference on Document Analysis and Recognition (ICDAR), pp. 968–972 (2003)

  24. Otsu, N.: A threshold selection method from gray-level histograms. IEEE Trans. Syst. Man Cybern. 9(1), 62–66 (1979)

    Article  MathSciNet  Google Scholar 

  25. Ozturk, A., Gunes, S., Ozbay, Y.: Multifont ottoman character recognition. In: The 7th IEEE International Conference on Electronics, Circuits and Systems (ICECS) (2000)

  26. Ozturk, S., Toygar Abak, A., Sankur, B.: Font clustering and cluster identification in document images. J. Electron. Imaging 10(2), 418–430 (2001)

    Article  Google Scholar 

  27. Petkov, N., Wieling, M.B.: Gabor filter for image processing and computer vision. University of Groningen (2008)

  28. Portilla, J., Simoncelli, E.P.: A parametric texture model based on joint statistics of complex wavelet coefficients. Int. J. Comput. Vis. 40(1), 49–70 (2000)

    Article  MATH  Google Scholar 

  29. Rado, S.: Turk Hattatlari:XV. yzyildan gnmze kadar gelmis nl hattatlarin hayatlari ve yazilarindan rnekler. Yayin Matbaacilik Ticaret (1983)

  30. Ramanathan, R., Soman, K.P., Thaneshwaran, L., Viknesh, V., Arunkumar, T., Yuvaraj, P.: A novel technique for english font recognition using support vector machines. In: International Conference on Advances in Recent Technologies in Communication and Computing (ARTCom ’09), pp. 766–769 (2009)

  31. Senobari, E.M., Khosravi, H.: Farsi font recognition based on combination of wavelet transform and sobel-robert operator features. In: 2nd International eConference on Computer and Knowledge Engineering (ICCKE), pp. 29–33. IEEE (2012)

  32. Slimane, F., Kanoun, S., Hennebert, J., Alimi, A.M., Ingold, R.: A study on font-family and font-size recognition applied to arabic word images at ultra-low resolution. Pattern Recognit. Lett. 34, 209–218 (2013)

    Article  Google Scholar 

  33. Villegas-Cortez, J., Aviles-Cruz, C.: Font recognition by invariant moments of global textures. In: Proceedings of International Workshop VLBV05 (very low bit-rate video-coding 2005), pp. 15–16 (2005)

  34. Yang, Z., Yang, L., Qi, D., Suen, C.Y.: An EMD-based recognition method for chinese fonts and styles. Pattern Recognit. Lett. 27(14), 1692–1701 (2006)

    Article  Google Scholar 

  35. Yosef, I.B., Beckman, I., Kedem, K., Dinstein, I.: Binarization, character extraction, and writer identification of historical hebrew calligraphy documents. Int. J. Doc. Anal. Recognit. 9(2–4), 89–99 (2007)

    Article  Google Scholar 

  36. Zahedi, M., Eslami, S.: Farsi/arabic optical font recognition using sift features. Procedia Comput. Sci. 3, 1055–1059 (2011)

    Article  Google Scholar 

  37. Zhu, Y., Tan, T., Wang, Y.: Font recognition based on global texture analysis. IEEE Trans. Pattern Anal. Mach. Intell. (10), 1192–1200 (2001)

  38. Zhuang, Y., Weiming, L., Jiangqin, W.: Latent style model: discovering writing styles for calligraphy works. J. Visual Commun. Image Represent. 20, 84–96 (2009)

  39. Zramdini, A., Ingold, R.: Optical font recognition using typographical features. IEEE Trans. Pattern Anal. Mach. Intell. 20(8), 877–882 (1998)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Alican Bozkurt.

Electronic supplementary material

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Bozkurt, A., Duygulu, P. & Cetin, A.E. Classifying fonts and calligraphy styles using complex wavelet transform. SIViP 9 (Suppl 1), 225–234 (2015). https://doi.org/10.1007/s11760-015-0795-z

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11760-015-0795-z

Keywords

Navigation