Optimal Feature Extraction for Bilingual OCR

  • D. Dhanya
  • A. G. Ramakrishnan
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 2423)


Feature extraction in bilingual OCR is handicapped bythe increase in the number of classes or characters to be handled. This is evident in the case of Indian languages whose alphabet set is large. It is expected that the complexityof the feature extraction process increases with the number of classes. Though the determination of the best set of features that could be used cannot be ascertained through anyquan titative measures, the characteristics of the scripts can help decide on the feature extraction procedure. This paper describes a hierarchical feature extraction scheme for recognition of printed bilingual (Tamil and Roman) text. The scheme divides the combined alphabet set of both the scripts into subsets bythe extraction of certain spatial and structural features. Three features viz geometric moments, DCT based features and Wavelet transform based features are extracted from the grouped symbols and a linear transformation is performed on them for the purpose of efficient representation in the feature space. The transformation is obtained bythe maximization of certain criterion functions. Three techniques : Principal component analysis, maximization of Fisher’s ratio and maximization of divergence measure have been employed to estimate the transformation matrix. It has been observed that the proposed hierarchical scheme allows for easier handling of the alphabets and there is an appreciable rise in the recognition accuracyas a result of the transformations.


Feature Vector Feature Extraction Discrete Cosine Transform Discrete Wavelet Transform Recognition Accuracy 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


  1. 1.
    Dhanya, D., Ramakrishnan, A.G.: Script identification in printed bilingual documents. Published in the same (2002).Google Scholar
  2. 2.
    A. Sinha: Improved recognition module for the identification of handwritten digits. Master’s thesis, Department of Electrical Engineering and Computer Science, Massachuset (1999)Google Scholar
  3. 3.
    Teh, C.H., Chin, R.T.: On image analysis by method of moments. IEEE Transaction on Pattern Analysis and Machine Intelligence 10 (1993) 496–513CrossRefGoogle Scholar
  4. 4.
    Khotanzad, A., Hong, Y.H.: Rotation invariant image representation using features selected via a systematic method. Pattern Recognition 23 (1990) 1089–1101CrossRefGoogle Scholar
  5. 5.
    Bailey, R.R.: Orthogonal moment features for use with parametric and nonparametric classifiers. IEEE Transaction on Pattern Analysis and Machine Intelligence 18 (1996) 389–399CrossRefGoogle Scholar
  6. 6.
    P.C. Loizou, Spanias, A.: Improved speech recognition using a subspace projection approach. IEEE Transaction on Speech and Audio Processing 7 (1999)Google Scholar
  7. 7.
    Dhanya, D.: Bilingual ocr for tamil and roman scripts. Master’s thesis, Department of Electrical Engineering, Indian Institute of Science (2001)Google Scholar
  8. 8.
    Hamamoto, Y., et al.: A bootstrap technique for nearest neighbour classifier design. IEEE Transaction on Pattern Analysis and Machine Intelligence 19 (1993) 73–79CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2002

Authors and Affiliations

  • D. Dhanya
    • 1
  • A. G. Ramakrishnan
    • 1
  1. 1.Department of Electrical EngineeringIndian Institute of ScienceBangaloreIndia

Personalised recommendations