Measuring the Robustness of Character Shape Coding
Earlier claims of great robustness for the character shape coding process have been largely unsupported.We provide quantitative measures of the sensitivity of the character shape coding process to the text input, production values and image quality and to the complexity of the destination character shape codes. Using this evaluation tool we can tune the character shape coding process in a systematic way and also develop new versions of the shape codes appropriately adapted to particular applications.
KeywordsText Image Document Image Optical Character Recognition High Frequency Word Character Shape
- 1.A. L. Spitz, “ Generalized line, word and character finding”, Progress in Image Analysis and Processing III. S. Impedovo (ed.), pp. 377–383, World Scientific, (1993).Google Scholar
- 2.A. L. Spitz, “ Text characterization by connected component transformation”, Proc. SPIE, San Jose, CA, pp. 97–105, (1994).Google Scholar
- 4.J. Esakov, D. P. Lopresti & J. S. Sandberg., “ Classification and distribution of optical character recognition errors”, Proc. SPIE, San Jose, CA, pp. 204–216, (1994).Google Scholar
- 5.H. S. Baird, “ Document image defect models”, H.S. Baird, H. Bunke, and K. Yamamoto, eds., Structured Document Analysis. New York, Springer-Verlag, pp.546–556, (1992).Google Scholar