A new cost function for typewritten digits segmentation
This work presents a solution to the problem of the segmentation of digits in forms characterized by its low quality, as well as the existence of breaks and touching digits. We propose a new function of segmentation that adds to two traditional techniques (vertical projections and Tsujimoto metric) information of background of the digit. Unlike other techniques reported in the literature, ours obtains a near-optimum number of break points in fields containing broken, blurred and touching characters, leading to high accuracy in the global OCR system. The accuracy obtained in the segmentation of the forms fields is of 99,74% on a sample of 11,283 fields of 144 forms of low quality, which provides a final accuracy to the automatic recognition process of 99,42% of digits correctly classified.
Unable to display preview. Download preview PDF.
- A.C. Downton, R.W.S. Tregidgo, E. Kabir: Recognition and Verification of Handwritten and Hand-printed British Postal Addresses. Character & Handwriting Recognition, Ed: P.S.P. Wang, pp. 265–291, World Scientific series in Computer Science Vol. 30, 1991.Google Scholar
- J. Muguerza: Una Solución al Reconocimiento Automático de Dígitos Imprecisos en Formularios. Doctoral Thesis, Basque Country University, Spain, January 1996.Google Scholar
- C. Rodriguez, J. Muguerza, M. Navarro, A. Zárate, J.1. Martin, J.M. Pérez: A Two-Stage Classifier for Broken and Blurred Digits in Forms. Accepted for presentation in the 2nd International Workshop on Statistical Techniques in Pattern Recognition, Sydney, Australia, 1998.Google Scholar
- S. Tsujimoto, H. Asada: Resolving Ambiguity in Segmenting Touching Characters. The First International Conference on Document Analysis and Recognition, pp. 701–709, 1991.Google Scholar