Script Identification from Indian Documents

  • Gopal Datt Joshi
  • Saurabh Garg
  • Jayanthi Sivaswamy
Part of the Lecture Notes in Computer Science book series (LNCS, volume 3872)

Abstract

Automatic identification of a script in a given document image facilitates many important applications such as automatic archiving of multilingual documents, searching online archives of document images and for the selection of script specific OCR in a multilingual environment. In this paper, we present a scheme to identify different Indian scripts from a document image. This scheme employs hierarchical classification which uses features consistent with human perception. Such features are extracted from the responses of a multi-channel log-Gabor filter bank, designed at an optimal scale and multiple orientations. In the first stage, the classifier groups the scripts into five major classes using global features. At the next stage, a sub-classification is performed based on script-specific features. All features are extracted globally from a given text block which does not require any complex and reliable segmentation of the document image into lines and characters. Thus the proposed scheme is efficient and can be used for many practical applications which require processing large volumes of data. The scheme has been tested on 10 Indian scripts and found to be robust to skew generated in the process of scanning and relatively insensitive to change in font size. This proposed system achieves an overall classification accuracy of 97.11% on a large testing data set. These results serve to establish the utility of global approach to classification of scripts.

Keywords

Document Image Text Line Optimal Scale Character Segmentation Text Block 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

  1. 1.
    Spitz, A.: Determination of the script and language content of document images. IEEE Trans. Pattern Anal. Mach. Intell 19(3), 235–245 (1997)CrossRefGoogle Scholar
  2. 2.
    Hochberg, J., Kerns, L., Kelly, P., Thomas, T.: Automatic script identification from images using cluster-based templates. IEEE Trans. Pattern Anal. Mach. Intell. 19(2), 176–181 (1997)CrossRefGoogle Scholar
  3. 3.
    Wood, S.L., Yao, X., Krishnamurthi, K., Dang, L.: Language identification for printed text independent of segmentation. In: Proceedings of International Conference on Image Processing, vol. 3, pp. 428–431 (1995)Google Scholar
  4. 4.
    Tan, T.N.: Rotation invariant texture features and their use in automatic script identification. IEEE Trans. Pattern Anal. Mach. Intell. 20(7), 751–756 (1998)CrossRefGoogle Scholar
  5. 5.
    Pal, U., Sinha, S., Chaudhuri, B.B.: Multi-script line identification from indian document. In: Seventh International Conference on Document Analysis and Recognition, vol. 2, pp. 880–884 (2003)Google Scholar
  6. 6.
    Chaudhury, S., Sheth, R.: Trainable script identification strategies for indian languages. In: Fifth International Conference on Document Analysis and Recognition, pp. 657–660 (1999)Google Scholar
  7. 7.
    Chan, W., Sivaswamy, J.: Local energy analysis for text script classification. In: Proceedings of Image and Vision Computing New Zealand (1999)Google Scholar
  8. 8.
    Chan, W., Coghill, G.G.: Text analysis using local energy. Pattern Recognition 34(12), 2523–2532 (2001)MATHCrossRefGoogle Scholar
  9. 9.
    Digital Library of India, http://dli.iiit.ac.in/
  10. 10.
    Morrone, M.C., Burr, D.C.: Feature detection in human vision: A phase-dependent energy model. Proceedings of the Royal Society, London Series B 235, 221–245 (1988)CrossRefGoogle Scholar
  11. 11.
    PRTools, A.: Matlab Toolbox for Pattern Recognition, http://www.prtools.org/
  12. 12.
    Jain, A.K., Zhong, Y.: Page segmentation using texture analysis. Pattern Recognition 29, 743–770 (1996)CrossRefGoogle Scholar
  13. 13.
    Duda, R., Hart, P., Stork, D.: Pattern Classification, 2nd edn. John Wiley and Sons, New York (2001)MATHGoogle Scholar
  14. 14.
  15. 15.
    Zhitao, X., Chengming, G., Ming, Y., Qiang, L.: Research on log Gabor wavelet and its application in image edge detection. In: Sixth International Conference on Signal Processing (2002)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2006

Authors and Affiliations

  • Gopal Datt Joshi
    • 1
  • Saurabh Garg
    • 1
  • Jayanthi Sivaswamy
    • 1
  1. 1.Centre for Visual Information TechnologyIIITHyderabadIndia

Personalised recommendations