A Novel GA Based OCR Enhancement and Segmentation Methodology for Marathi Language in Bimodal Framework

  • Amarjot Singh
  • Ketan Bacchuwar
  • Akash Choubey
Conference paper
Part of the Communications in Computer and Information Science book series (CCIS, volume 139)


Automated learning systems used to extract information from images play a major role in document analysis. Optical character recognition or OCR has been widely used to automatically segment and index the documents from a wide space. Most of the methods used for OCR recognition and extraction like HMM’s, Neural etc, mentioned in literature have errors which require human operators to be rectified and fail to extract images with blur as well as illumination variance. This paper explains proposes an enhancement supported threshold based pre-processing methodology for word spotting in Marathi printed bimodal images using image segmentation. The methodology makes use of an enhanced image obtained by histogram equalization followed by followed by age segmentation using a specific threshold. The threshold can be obtained using genetic algorithms. GA based segmentation technique is codified as an optimization problem used efficiently to search maxima and minima from the histogram of the image to obtain the threshold for segmentation. The system described is capable of extracting normal as well as blurred images and images for different lighting conditions. The same inputs are tested for a standard GA based methodology and the results are compared with the proposed method. The paper further elaborates the limitations of the method.


OCR Genetic Algorithm Bimodal Blur Illumination 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Weszka, J.S., Nagel, R.N., Rosenfeld, A.: A Threshold selection technique. IEEE Trans. Computer C-23, 1322–1326 (1974)CrossRefzbMATHGoogle Scholar
  2. 2.
    Watanable, S., CYBEST Group.: An automated apparatus for cancer processing. Comp. Graph. Image processing 3, 350–358 (1974)CrossRefGoogle Scholar
  3. 3.
    Di Gesh, V.: A Clustering Approach to Texture Classification. In: Jain, A.K. (ed.) Real Time Object and Environment Measurement and Classification. NATO AS1 Series F, vol. 42, Springer, Heidelberg (1988)Google Scholar
  4. 4.
    Fu, K.S., Mui, J.K.: A Survey on Image segmentation. Pattern Recognition 13, 3–16 (1981)MathSciNetCrossRefGoogle Scholar
  5. 5.
    Pal, N.R., Pal, S.K.: A Review on Image Segmentation Techniques. Pattern Recognition 26(9), 1277–1294 (1993)CrossRefGoogle Scholar
  6. 6.
    Kalas, M.S.: An Artificial Neural Network for Detection of Biological Early Brain Cancer. International Journal of Computer Applications 1(6), 17–23 (2010)MathSciNetCrossRefGoogle Scholar
  7. 7.
    IP, H.H.-S., Chan, S.-L.: Hypertext-Assisted Video Indexing and Content- based Retrieval. ACM 0-89791-866-5, 232–233 (1997)Google Scholar
  8. 8.
    Storn, R., Price, K.: Differential Evolution – A Simple and Efficient Heuristic for Global Optimization over Continuous Spaces. Journal of Global Optimization, 341–359 (1997)Google Scholar
  9. 9.
    Kao, Y.-T., Zahara, E.: A hybrid genetic algorithm and particle swarm optimization for multimodal functions. Applied Soft Computing 8(2), 849–857 (2008)CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2011

Authors and Affiliations

  • Amarjot Singh
    • 1
  • Ketan Bacchuwar
    • 1
  • Akash Choubey
    • 1
  1. 1.Dept. of Electrical EngineeringNIT WarangalIndia

Personalised recommendations