Abstract
We describe a new approach to the visual recognition of cursive handwriting. An effort is made to attain human-like performance by using a method based on pictorial alignment and on a model of the process of handwriting. The alignment approach permits recognition of character instances that appear embedded in connected strings. A system embodying this approach has been implemented and tested on five different word sets. The performance was stable both across words and across writers. The system exhibited a substantial ability to interpret cursive connected strings without recourse to lexical knowledge.
Similar content being viewed by others
References
Aho, A.V., Hopcroft, J.E., and Ullman, J.D. 1974. Design and Analysis of Computer Algorithms. Addison-Wesley: Reading, MA.
Ben-Israel, A., and Greville, T.N.E. 1974. Generalized Inverses: Theory and Applications. Wiley: New York.
Biederman, I. 1985. Human image interpretation: recent experiments and a theory. Comput. Vision, Graphics, Image Process. 32: 1–47.
Bookstein, F.L. 1978. The measurement of biological shape and shape change, Lecture Notes in Biomathematics, 24. S.Levin, ed., Springer-Verlag: Berlin.
Božinović, R.M., and Srihari, S.N. 1985. ROCS: a system for reading off-line cursive script. SUNY at Buffalo TR-85-13, September 1985.
Burr, D.J. 1981. Elastic matching of line drawings. IEEE Trans. Patt. Anal. Mach. Intell. PAMI 3: 708–713.
Burr, D.J. 1983. Matching clastic templates, 260–270. In Physical and Biological Processing of Images, O.J.Braddiek and A.C.Sleigh (eds.), Springer-Verlag: Berlin.
Chen, S., and Penna, M. 1986. Shape and motion of nonrigid bodies. Comput. Vision, Graphics, Image Process. 36: 175–207.
deBoor, C., and Lynch, R.E. 1966. On splines and their minimum properties. J. Math. Mech. 15: 953–969.
Duda, R.O., and Hart, P.E. 1973. Pattern Classification and Scene Analysis. Wiley: New York.
Duvernoy, J., and Charraut, D. 1979. Stability and stationarity of cursive handwriting. Pattern Recognition 11: 145–154.
Edeiman, S. 1988. Reading and writing cursive script: A computational study. Ph.D. thesis, Dept. of Applied Math., Weizmann Institute of Science.
Edelman, S., and Flash, T. 1987. A model of handwriting, Biological Cyberneties 57: 25–36.
Eden, M. 1961. On the formalization of handwriting, Proc. Symp. Appl. Math. 12: 83–88. Amer. Math. Soc., Providence, RI.
Fiash, T. 1983. Organizing principles underlying the formation of arm trajectories. Ph.D. thesis, Harvard MIT Div. of Health Sciences and Technology.
Flash, T., and Hogan, N. 1985. The coordination of arm movements: An experimentally confirmed mathematical model, Journal of Neuroscience 5 (7): 1688–1703.
Foster, D.H. 1975. An approach to the analysis of the underlying structure of visual space using a generalized notion of visual pattern recognition. Biological Cybernetics 17: 77–79.
Goad, C. 1986. Fast 3-D model-based vision, 371–391. In From Pixels to Predicates. A.P.Pentland (ed.), Ablex: Norwood, NJ.
Grimson, W.E.L. 1981. From Images to Surfaces: A Computational Study of the Human Early Visual System. MIT Press: Cambridge, MA.
Grimson, W.E.L., and Lozano-Perez, T. 1987. Locating overlapping parts by searching the interpretation tree. IEEE Trans. Patt. Anal. Mach. Intell. PAMI 9: 469–482.
Halle, M., and Stevens, K. 1962. Speech recognition: a model and a program for research. Inst. Rad. Eng. Trans. IT 8: 155–159.
Hanson, A.R., Riseman, E.M., and Fisher, E. 1976. Context in word recognition. Pattern Recognition 8: 35–45.
Hayes, K.C.Jr. 1980. Reading handwritten words using hierarchical relaxation. Comput. Graphics, Image Process 14: 344–364.
Hogan, N. 1982. Control and coordination of voluntary arm movements. In Proc. 1982 Amer. Control Conf., M.J. Rabins and Y. Bar-Shalom, (eds.), pp. 522–528.
Hogan, N. 1984. An organizing principle for a class of voluntary movements, Journal of Neuroscience 4 (11): 2745–2754.
Huttenlocher, D.P., and Ullman, S. 1987. Object recognition using alignment, Proc. 1st Intern. Conf. Comput. Vision, London, pp. 102–111.
Kahan, S., Pavlidis, T., and Baird, H.S. 1987. On the recognition of printed characters of any font and size. IEEE Trans. Patt. Anal. Mach. Intell. PAMI 9: 274–287.
Karlin, S. 1969. The fundamental theorem of algebra for monosplines satisfying certain boundary conditions and applictions to optimal quadrature formulas. In Approximations with Special Emphasis on Spline Functions. J.J.Bercuberg (ed.) Academic Press: New York, pp. 467–484.
Paivio, A. 1978. The relationship between verbal and perceptual codes, Handbook of Perception E.C.Carterette and M.P.Friedman (eds.), Academic Press: New York, vol. 8, pp. 375–397.
Pavlidis, T. 1977. Structural Pattern Recognition, Springer-Verlag, Berlin.
Pearl, J. 1984. Heuristics. Addison-Wesley: Reading, MA.
Persoon, E., and Fu, K.S. 1977. Shape discrimination using Fourier descriptors. IEEE Trans. Syst. Man Cybern. SMC 7: 170–179.
Ritter, K. 1969. Generalized spline interpolation and nonlinear programming. In Approximations, with Special Emphasis on Spline Functions, I.J.Schoenberg (ed.), Academic Press: New York, pp. 75–117.
Serra, J. 1982. Image Analysis and Mathematical Morphology. Academic Press; New York.
Sha'ashua, A., and Ullman, S. 1988. Structural saliency: The detection of globally salient structures using a locally connected network. Proc. 2nd Intern. Conf. Comput. Vision, Tampa, FL, pp. 321–327.
Srihari, S.N., and Božinović, R.M. 1987. A multi-level perception approach to reading cursive script. Artificial Intelligence 33: 217–255.
Stentiford, F.W.M. 1985. Automatic feature design for optical character recognition using an evolutionary search procedure, IEEE Trans. Patt. Anal. Mach. Intell. PAMI 7: 349–355.
Suen, C.Y. 1983. Handwriting generation, perception and recognition. Acta Psychologica 54: 295–312.
Suen, C.Y. Berthod, M., and Mori, S. 1980. Automatic recognition of handprinted characters-The state of the art. Proc. IEEE 68: 469–487.
Torre, V., and Poggio, T.A. 1986. On edge detection. IEEE Trans. Patt. Anal. Mach. Intell. PAMI 8 (2): 147–163.
Travers, J.R., and Olivier, D.C. 1978. Pronounceability and statistical “Englishness” as determinants of letter identification. Amer. J. Psychol. 91: 523–538.
Ullman, S. 1986. An approach to object recognition: aligning pictorial descriptions. MIT AI Memo 931, December 1986.
Yoshida, M., and Eden, M. 1973. Handwritten Chinese character recognition by an analysis-by-synthesis method, Proc. 1st Conf. Patt. Recog. pp. 197–204.
Zwikker, C. 1963. The advanced geometry of plane curves and their applications. Dover: New York.
Author information
Authors and Affiliations
Additional information
SU is partially supported by NSF grant IRI-8900267.
Rights and permissions
About this article
Cite this article
Edelman, S., Flash, T. & Ullman, S. Reading cursive handwriting by alignment of letter prototypes. Int J Comput Vision 5, 303–331 (1990). https://doi.org/10.1007/BF00126503
Issue Date:
DOI: https://doi.org/10.1007/BF00126503