Abstract
Recognizing fonts has become an important task in document analysis, due to the increasing number of available digital documents in different fonts and emphases. A generic font recognition system independent of language, script and content is desirable for processing various types of documents. At the same time, categorizing calligraphy styles in handwritten manuscripts is important for paleographic analysis, but has not been studied sufficiently in the literature. We address the font recognition problem as analysis and categorization of textures. We extract features using complex wavelet transform and use support vector machines for classification. Extensive experimental evaluations on different datasets in four languages and comparisons with state-of-the-art studies show that our proposed method achieves higher recognition accuracy while being computationally simpler. Furthermore, on a new dataset generated from Ottoman manuscripts, we show that the proposed method can also be used for categorizing Ottoman calligraphy with high accuracy.
Similar content being viewed by others
References
Abdelnour, A.F., Selesnick, I.W.: Nearly symmetric orthogonal wavelet bases. In: Proceedings IEEE International Conference on Acoustics, Speech, Signal Processing (ICASSP), vol. 6 (2001)
Abuhaiba, I.S.I.: Arabic font recognition using decision trees built from common words. J. Comput. Inf. Technol. 13(3), 211–224 (2004)
Aiolli, F., Simi, M., Sona, D., Sperduti, A., Starita, A., Zaccagnini, G.: SPI: A system for paleographic inspections. In: AI IA Notizie, vol. 4, pp. 34–48 (1999)
Amin, A.: Off-line arabic character recognition: the state of the art. Pattern Recognit. 31(5), 517–530 (1998)
Aviles-Cruz, C., Rangel-Kuoppa, R., Reyes-Ayala, M., Andrade-Gonzalez, A., Escarela-Perez, R.: High-order statistical texture analysis-font recognition applied. Pattern Recognit. Lett. 26(2), 135–145 (2005)
Azmi, M.S., Omar, K., Nasrudin, M.F., Muda, A.K., Abdullah, A.: Arabic calligraphy classification using triangle model for digital jawi paleography analysis. In: 11th International Conference on Hybrid Intelligent Systems (HIS), pp. 704–708 (2011)
Bataineh, B., Abdullah, S.N.H.S., Omar, K.: A novel statistical feature extraction method for textual images: optical font recognition. Expert Syst. Appl. 39(5), 5470–5477 (2012)
Ben Moussa, S., Zahour, A., Benabdelhafid, A., Alimi, A.M.: New fractal-based system for arabic/latin, printed/handwritten script identification. In: 19th International Conference on Pattern Recognition (ICPR 2008), pp. 1–4 (2008)
Ben Moussa, S., Zahour, A., Benabdelhafid, A., Alimi, A.M.: New features using fractal multi-dimensions for generalized arabic font recognition. Pattern Recognit. Lett. 31(5), 361–371 (2010)
Borji, A., Hamidi, M.: Support vector machine for persian font recognition. Int. J. Intell. Syst. Technol. 184–197 (2007)
Bozkurt, A., Suhre, A., Cetin, A.E.: Multi-scale directional-filtering-based method for follicular lymphoma grading. Signal Image Video Process. 8(1), 63–70 (2014)
Cai, S., Li, K., Selesnick, I.: Matlab implementation of wavelet transforms. Polytechnic University (2010)
Celik, T., Tjahjadi, T.: Multiscale texture classification using dual-tree complex wavelet transform. Pattern Recognit. Lett. 30(3), 331–339 (2009)
Chang, C.C., Lin, C.J.: LIBSVM: a library for support vector machines. ACM Trans. Intell. Syst. Technol. (TIST) 2(3), 27 (2011)
Chaudhuri, B.B., Garain, U.: Automatic detection of italic, bold and all-capital words in document images. In: Proceedings of Fourteenth International Conference on Pattern Recognition, vol. 1, pp. 610–612. IEEE (1998)
Cortes, C., Vapnik, V.: Support-vector networks. Mach. Learn. 20(3), 273–297 (1995)
Hatipoglu, S., Mitra, S.K., Kingsbury, N.: Texture classification using dual-tree complex wavelet transform. In: Seventh International Conference on Image Processing and its Applications (Conf. Publ. No. 465), vol. 1, pp. 344–347. IET (1999)
Hill, P.R., Bull, D.R., Canagarajah, C.N.: Rotationally invariant texture features using the dual-tree complex wavelet transform. In: Proceedings of International Conference on Image Processing, vol. 3, pp. 901–904. IEEE (2000)
Hsu, C.W., Chang, C.C., Lin, C.J.: A practical guide to support vector classification. National Taiwan University (2003)
Khosravi, H., Kabir, E.: Farsi font recognition based on sobel-roberts features. Pattern Recognit. Lett. 31(1), 75–82 (2010)
Kingsbury, N.: A dual-tree complex wavelet transform with improved orthogonality and symmetry properties. In: Proceedings of International Conference on Image Processing, vol. 2, pp. 375–378. IEEE (2000)
Kingsbury, N.G.: The dual-tree complex wavelet transform: a new efficient tool for image restoration and enhancement. In: Proceedings of EUSIPCO, vol. 98, pp. 319–322 (1998)
Ma, H., Doermann, D.: Gabor filter based multi-class classifier for scanned document images. In: 7th International Conference on Document Analysis and Recognition (ICDAR), pp. 968–972 (2003)
Otsu, N.: A threshold selection method from gray-level histograms. IEEE Trans. Syst. Man Cybern. 9(1), 62–66 (1979)
Ozturk, A., Gunes, S., Ozbay, Y.: Multifont ottoman character recognition. In: The 7th IEEE International Conference on Electronics, Circuits and Systems (ICECS) (2000)
Ozturk, S., Toygar Abak, A., Sankur, B.: Font clustering and cluster identification in document images. J. Electron. Imaging 10(2), 418–430 (2001)
Petkov, N., Wieling, M.B.: Gabor filter for image processing and computer vision. University of Groningen (2008)
Portilla, J., Simoncelli, E.P.: A parametric texture model based on joint statistics of complex wavelet coefficients. Int. J. Comput. Vis. 40(1), 49–70 (2000)
Rado, S.: Turk Hattatlari:XV. yzyildan gnmze kadar gelmis nl hattatlarin hayatlari ve yazilarindan rnekler. Yayin Matbaacilik Ticaret (1983)
Ramanathan, R., Soman, K.P., Thaneshwaran, L., Viknesh, V., Arunkumar, T., Yuvaraj, P.: A novel technique for english font recognition using support vector machines. In: International Conference on Advances in Recent Technologies in Communication and Computing (ARTCom ’09), pp. 766–769 (2009)
Senobari, E.M., Khosravi, H.: Farsi font recognition based on combination of wavelet transform and sobel-robert operator features. In: 2nd International eConference on Computer and Knowledge Engineering (ICCKE), pp. 29–33. IEEE (2012)
Slimane, F., Kanoun, S., Hennebert, J., Alimi, A.M., Ingold, R.: A study on font-family and font-size recognition applied to arabic word images at ultra-low resolution. Pattern Recognit. Lett. 34, 209–218 (2013)
Villegas-Cortez, J., Aviles-Cruz, C.: Font recognition by invariant moments of global textures. In: Proceedings of International Workshop VLBV05 (very low bit-rate video-coding 2005), pp. 15–16 (2005)
Yang, Z., Yang, L., Qi, D., Suen, C.Y.: An EMD-based recognition method for chinese fonts and styles. Pattern Recognit. Lett. 27(14), 1692–1701 (2006)
Yosef, I.B., Beckman, I., Kedem, K., Dinstein, I.: Binarization, character extraction, and writer identification of historical hebrew calligraphy documents. Int. J. Doc. Anal. Recognit. 9(2–4), 89–99 (2007)
Zahedi, M., Eslami, S.: Farsi/arabic optical font recognition using sift features. Procedia Comput. Sci. 3, 1055–1059 (2011)
Zhu, Y., Tan, T., Wang, Y.: Font recognition based on global texture analysis. IEEE Trans. Pattern Anal. Mach. Intell. (10), 1192–1200 (2001)
Zhuang, Y., Weiming, L., Jiangqin, W.: Latent style model: discovering writing styles for calligraphy works. J. Visual Commun. Image Represent. 20, 84–96 (2009)
Zramdini, A., Ingold, R.: Optical font recognition using typographical features. IEEE Trans. Pattern Anal. Mach. Intell. 20(8), 877–882 (1998)
Author information
Authors and Affiliations
Corresponding author
Electronic supplementary material
Below is the link to the electronic supplementary material.
Rights and permissions
About this article
Cite this article
Bozkurt, A., Duygulu, P. & Cetin, A.E. Classifying fonts and calligraphy styles using complex wavelet transform. SIViP 9 (Suppl 1), 225–234 (2015). https://doi.org/10.1007/s11760-015-0795-z
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11760-015-0795-z