Advertisement

Word-Level Handwritten Script Identification from Multi-script Documents

  • Mallikarjun Hangarge
  • K. C. Santosh
Conference paper
Part of the Advances in Intelligent Systems and Computing book series (AISC, volume 266)

Abstract

In this paper, we present the directional discrete cosine transform (DCT) based rotation invariant features for word-level handwritten script identification. Our aim in this paper is two folds: one is to validate the effectiveness of the directional DCT (D-DCT) in extracting edge information of the studied word image and another is to provide rotation invariant property since conventional DCT (C-DCT) does not offer both issues. For each extracted word image, we compute DCT, its coefficient matrix and decompose into different directions such as horizontal, vertical, left and right diagonals plus mean and standard deviations of the decomposed components. These statistical features are then evaluated with hundreds of word images from six different scripts by using linear discriminant analysis (LDA) and achieved an accuracy of 97.35 % in average.

Keywords

Discrete cosine transform Rotation invariant features Script identification Multi-script document 

References

  1. 1.
    Belaid, A., Santhosh K.C., D’ Andeey, V.P.: Handwritten and printed text separation in real document, In: Proceedings of Machine Vision Applications, pp. 218–221 (2013)Google Scholar
  2. 2.
    D Ghosh, T.D., Shivaprasad, A.P.: Script recognition a review. IEEE Trans. Pattern Anal. Mach. Intell. 32(12), 2142–2161 (2010)Google Scholar
  3. 3.
    Fu, J., Zeng, B.: Directional discrete cosine transforms: A theoretical analysis. In: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 1, pp. 1105–1108 (2007)Google Scholar
  4. 4.
    Hangarge, M., Dhandra, B.V.: Offline handwritten script identification in document images. Int. J Comput. Appl. 4(6), 1–5 (2008)Google Scholar
  5. 5.
    Hangarge, M.,Santhos, K.C., Rajmohan, P.: Directional Discrete Cosine Transform for Handwritten Script Identification. In: Proceedings of International Conference on Document Analysis and Recognition, pp. 344–348 (2013)Google Scholar
  6. 6.
    Hochberg, J., Bowers, K., Cannon, M., Kelly, P.: Script and language identification for handwritten document images. Int. J. Doc. Anal. Recogn. 2(2–3), 45–52 (1999)CrossRefGoogle Scholar
  7. 7.
    Roy, K., Banerjee, A., Pal, U.: Word-wise hand-written script separation for Indian postal automation. In: Proceedings of International Workshop on Frontiers in Handwriting Recognition, pp. 521–526, (2006)Google Scholar
  8. 8.
    Zhou, L., Lu, Y., Tan, C.L.: Bangla/english script identification based on analysis of connected component profiles. In: Proceedings of International Conference on Document Analysis and Recognition, pp. 243–254 (2006)Google Scholar
  9. 9.
    Moussa, S.B., Zahour, A., BenAbdelhafid, A., Alimi, A.M.: Fractal-based system for Arabic/Latin, printed/handwritten script identification. In: Proceedings of International Conference on Pattern Recognition, pp.1–4 (2008)Google Scholar
  10. 10.
    Namboodiri, A., Jain, A.: Online handwritten script recognition. IEEE Trans. Pattern Anal. Mach. Intell. 26(1), 124–130 (2004)CrossRefGoogle Scholar
  11. 11.
    Pati, P.B., Ramakrishnan, A.G.: Word level multi-script identification. Phys. Rev. Lett. 29(9), 122–1218 (2008)Google Scholar
  12. 12.
    Pati, P.B., Ramakrishnan, A.G.: Hvs inspired system for script identification in Indian multi-script documents. In: Proceedings of International Conference on Document Analysis and Recognition, pp. 380–389 (2006)Google Scholar
  13. 13.
    Rajput, G.G., Anitha, B.H.: Handwritten script recognition using DCT and wavelet features at block level. Int. J Comput. Appl. 158–163 (2010)Google Scholar
  14. 14.
    Sarkar, R., Das, N., Basu, S., Kundu, M., Nasipuri, M., Basu, D.K.: Word level script identification from Bangla and Devanagri handwritten texts mixed with Roman script. J Comput 2(2), 103–108 (2010)Google Scholar
  15. 15.
    Tan, T.: Rotation invariant texture features and their use in automatic script identification. IEEE Trans. Pattern Anal. Mach. Intell. 20(7), 751–758 (1998)CrossRefGoogle Scholar
  16. 16.
    Zeng, B., Fu, J.: Directional discrete cosine transforms: a new framework for image coding. IEEE Trans. Circuits Syst. Video Technol. 18(3), 305–313 (2008)CrossRefMathSciNetGoogle Scholar

Copyright information

© Springer India 2014

Authors and Affiliations

  1. 1.Karnatak Arts, Science and Commerce CollegeBidarIndia
  2. 2.Communications Engineering BranchUS National Library of Medicine National Institutes of HealthBethesdaUSA

Personalised recommendations