Advertisement

A new cost function for typewritten digits segmentation

  • C. Rodríguez
  • J. Muguerza
  • M. Navarro
  • A. Zárate
  • J. I. Martín
  • J. M. Pérez
Poster Papers
Part of the Lecture Notes in Computer Science book series (LNCS, volume 1451)

Abstract

This work presents a solution to the problem of the segmentation of digits in forms characterized by its low quality, as well as the existence of breaks and touching digits. We propose a new function of segmentation that adds to two traditional techniques (vertical projections and Tsujimoto metric) information of background of the digit. Unlike other techniques reported in the literature, ours obtains a near-optimum number of break points in fields containing broken, blurred and touching characters, leading to high accuracy in the global OCR system. The accuracy obtained in the segmentation of the forms fields is of 99,74% on a sample of 11,283 fields of 144 forms of low quality, which provides a final accuracy to the automatic recognition process of 99,42% of digits correctly classified.

References

  1. [1]
    R.G. Casey, E. Lecolinet: A Survey of Methods and Strategies in Character Segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 18, no. 7, pp. 690–706, 1996.CrossRefGoogle Scholar
  2. [2]
    A.C. Downton, R.W.S. Tregidgo, E. Kabir: Recognition and Verification of Handwritten and Hand-printed British Postal Addresses. Character & Handwriting Recognition, Ed: P.S.P. Wang, pp. 265–291, World Scientific series in Computer Science Vol. 30, 1991.Google Scholar
  3. [3]
    Y. Lu: Machine Printed Character Segmentation — An overview. Pattern Recognition, Vol. 28, No. 1, pp. 67–80, 1995.CrossRefGoogle Scholar
  4. [4]
    J. Muguerza: Una Solución al Reconocimiento Automático de Dígitos Imprecisos en Formularios. Doctoral Thesis, Basque Country University, Spain, January 1996.Google Scholar
  5. [5]
    C. Rodriguez, J. Muguerza, M. Navarro, A. Zárate, J.1. Martin, J.M. Pérez: A Two-Stage Classifier for Broken and Blurred Digits in Forms. Accepted for presentation in the 2nd International Workshop on Statistical Techniques in Pattern Recognition, Sydney, Australia, 1998.Google Scholar
  6. [6]
    S. Tsujimoto, H. Asada: Resolving Ambiguity in Segmenting Touching Characters. The First International Conference on Document Analysis and Recognition, pp. 701–709, 1991.Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 1998

Authors and Affiliations

  • C. Rodríguez
    • 1
  • J. Muguerza
    • 1
  • M. Navarro
    • 1
  • A. Zárate
    • 1
  • J. I. Martín
    • 1
  • J. M. Pérez
    • 1
  1. 1.Computer Architecture and Technology DepartmentThe Basque Country University (UPV/EHU)DonostiaSpain

Personalised recommendations