Advertisement

International Journal of Computer Vision

, Volume 5, Issue 3, pp 303–331 | Cite as

Reading cursive handwriting by alignment of letter prototypes

  • Shimon Edelman
  • Tamar Flash
  • Shimon Ullman
Article

Abstract

We describe a new approach to the visual recognition of cursive handwriting. An effort is made to attain human-like performance by using a method based on pictorial alignment and on a model of the process of handwriting. The alignment approach permits recognition of character instances that appear embedded in connected strings. A system embodying this approach has been implemented and tested on five different word sets. The performance was stable both across words and across writers. The system exhibited a substantial ability to interpret cursive connected strings without recourse to lexical knowledge.

Keywords

Image Processing Artificial Intelligence Computer Vision Computer Image Visual Recognition 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Aho, A.V., Hopcroft, J.E., and Ullman, J.D. 1974. Design and Analysis of Computer Algorithms. Addison-Wesley: Reading, MA.Google Scholar
  2. Ben-Israel, A., and Greville, T.N.E. 1974. Generalized Inverses: Theory and Applications. Wiley: New York.Google Scholar
  3. Biederman, I. 1985. Human image interpretation: recent experiments and a theory. Comput. Vision, Graphics, Image Process. 32: 1–47.Google Scholar
  4. Bookstein, F.L. 1978. The measurement of biological shape and shape change, Lecture Notes in Biomathematics, 24. S.Levin, ed., Springer-Verlag: Berlin.Google Scholar
  5. Božinović, R.M., and Srihari, S.N. 1985. ROCS: a system for reading off-line cursive script. SUNY at Buffalo TR-85-13, September 1985.Google Scholar
  6. Burr, D.J. 1981. Elastic matching of line drawings. IEEE Trans. Patt. Anal. Mach. Intell. PAMI 3: 708–713.Google Scholar
  7. Burr, D.J. 1983. Matching clastic templates, 260–270. In Physical and Biological Processing of Images, O.J.Braddiek and A.C.Sleigh (eds.), Springer-Verlag: Berlin.Google Scholar
  8. Chen, S., and Penna, M. 1986. Shape and motion of nonrigid bodies. Comput. Vision, Graphics, Image Process. 36: 175–207.Google Scholar
  9. deBoor, C., and Lynch, R.E. 1966. On splines and their minimum properties. J. Math. Mech. 15: 953–969.Google Scholar
  10. Duda, R.O., and Hart, P.E. 1973. Pattern Classification and Scene Analysis. Wiley: New York.Google Scholar
  11. Duvernoy, J., and Charraut, D. 1979. Stability and stationarity of cursive handwriting. Pattern Recognition 11: 145–154.Google Scholar
  12. Edeiman, S. 1988. Reading and writing cursive script: A computational study. Ph.D. thesis, Dept. of Applied Math., Weizmann Institute of Science.Google Scholar
  13. Edelman, S., and Flash, T. 1987. A model of handwriting, Biological Cyberneties 57: 25–36.Google Scholar
  14. Eden, M. 1961. On the formalization of handwriting, Proc. Symp. Appl. Math. 12: 83–88. Amer. Math. Soc., Providence, RI.Google Scholar
  15. Fiash, T. 1983. Organizing principles underlying the formation of arm trajectories. Ph.D. thesis, Harvard MIT Div. of Health Sciences and Technology.Google Scholar
  16. Flash, T., and Hogan, N. 1985. The coordination of arm movements: An experimentally confirmed mathematical model, Journal of Neuroscience 5 (7): 1688–1703.Google Scholar
  17. Foster, D.H. 1975. An approach to the analysis of the underlying structure of visual space using a generalized notion of visual pattern recognition. Biological Cybernetics 17: 77–79.Google Scholar
  18. Goad, C. 1986. Fast 3-D model-based vision, 371–391. In From Pixels to Predicates. A.P.Pentland (ed.), Ablex: Norwood, NJ.Google Scholar
  19. Grimson, W.E.L. 1981. From Images to Surfaces: A Computational Study of the Human Early Visual System. MIT Press: Cambridge, MA.Google Scholar
  20. Grimson, W.E.L., and Lozano-Perez, T. 1987. Locating overlapping parts by searching the interpretation tree. IEEE Trans. Patt. Anal. Mach. Intell. PAMI 9: 469–482.Google Scholar
  21. Halle, M., and Stevens, K. 1962. Speech recognition: a model and a program for research. Inst. Rad. Eng. Trans. IT 8: 155–159.Google Scholar
  22. Hanson, A.R., Riseman, E.M., and Fisher, E. 1976. Context in word recognition. Pattern Recognition 8: 35–45.Google Scholar
  23. Hayes, K.C.Jr. 1980. Reading handwritten words using hierarchical relaxation. Comput. Graphics, Image Process 14: 344–364.Google Scholar
  24. Hogan, N. 1982. Control and coordination of voluntary arm movements. In Proc. 1982 Amer. Control Conf., M.J. Rabins and Y. Bar-Shalom, (eds.), pp. 522–528.Google Scholar
  25. Hogan, N. 1984. An organizing principle for a class of voluntary movements, Journal of Neuroscience 4 (11): 2745–2754.Google Scholar
  26. Huttenlocher, D.P., and Ullman, S. 1987. Object recognition using alignment, Proc. 1st Intern. Conf. Comput. Vision, London, pp. 102–111.Google Scholar
  27. Kahan, S., Pavlidis, T., and Baird, H.S. 1987. On the recognition of printed characters of any font and size. IEEE Trans. Patt. Anal. Mach. Intell. PAMI 9: 274–287.Google Scholar
  28. Karlin, S. 1969. The fundamental theorem of algebra for monosplines satisfying certain boundary conditions and applictions to optimal quadrature formulas. In Approximations with Special Emphasis on Spline Functions. J.J.Bercuberg (ed.) Academic Press: New York, pp. 467–484.Google Scholar
  29. Paivio, A. 1978. The relationship between verbal and perceptual codes, Handbook of Perception E.C.Carterette and M.P.Friedman (eds.), Academic Press: New York, vol. 8, pp. 375–397.Google Scholar
  30. Pavlidis, T. 1977. Structural Pattern Recognition, Springer-Verlag, Berlin.Google Scholar
  31. Pearl, J. 1984. Heuristics. Addison-Wesley: Reading, MA.Google Scholar
  32. Persoon, E., and Fu, K.S. 1977. Shape discrimination using Fourier descriptors. IEEE Trans. Syst. Man Cybern. SMC 7: 170–179.Google Scholar
  33. Ritter, K. 1969. Generalized spline interpolation and nonlinear programming. In Approximations, with Special Emphasis on Spline Functions, I.J.Schoenberg (ed.), Academic Press: New York, pp. 75–117.Google Scholar
  34. Serra, J. 1982. Image Analysis and Mathematical Morphology. Academic Press; New York.Google Scholar
  35. Sha'ashua, A., and Ullman, S. 1988. Structural saliency: The detection of globally salient structures using a locally connected network. Proc. 2nd Intern. Conf. Comput. Vision, Tampa, FL, pp. 321–327.Google Scholar
  36. Srihari, S.N., and Božinović, R.M. 1987. A multi-level perception approach to reading cursive script. Artificial Intelligence 33: 217–255.Google Scholar
  37. Stentiford, F.W.M. 1985. Automatic feature design for optical character recognition using an evolutionary search procedure, IEEE Trans. Patt. Anal. Mach. Intell. PAMI 7: 349–355.Google Scholar
  38. Suen, C.Y. 1983. Handwriting generation, perception and recognition. Acta Psychologica 54: 295–312.Google Scholar
  39. Suen, C.Y. Berthod, M., and Mori, S. 1980. Automatic recognition of handprinted characters-The state of the art. Proc. IEEE 68: 469–487.Google Scholar
  40. Torre, V., and Poggio, T.A. 1986. On edge detection. IEEE Trans. Patt. Anal. Mach. Intell. PAMI 8 (2): 147–163.Google Scholar
  41. Travers, J.R., and Olivier, D.C. 1978. Pronounceability and statistical “Englishness” as determinants of letter identification. Amer. J. Psychol. 91: 523–538.Google Scholar
  42. Ullman, S. 1986. An approach to object recognition: aligning pictorial descriptions. MIT AI Memo 931, December 1986.Google Scholar
  43. Yoshida, M., and Eden, M. 1973. Handwritten Chinese character recognition by an analysis-by-synthesis method, Proc. 1st Conf. Patt. Recog. pp. 197–204.Google Scholar
  44. Zwikker, C. 1963. The advanced geometry of plane curves and their applications. Dover: New York.Google Scholar

Copyright information

© Kluwer Academic Publishers 1990

Authors and Affiliations

  • Shimon Edelman
    • 1
  • Tamar Flash
    • 1
  • Shimon Ullman
    • 2
    • 3
    • 4
  1. 1.Department of Applied Mathematics and Computer ScienceThe Weizmann Institute of ScienceRehovotIsrael
  2. 2.Department of Applied Mathematics and Computer ScienceThe Weizmann Institute of ScienceRehovotIsrael
  3. 3.Department of Brain and Cognitive SciencesMITCambridgeUSA
  4. 4.the Artificial Intelligence LaboratoryMITCambridgeUSA

Personalised recommendations