Skip to main content
Log in

A New Model of Computation for Learning Vision Modules from Examples

  • Published:
Journal of Mathematical Imaging and Vision Aims and scope Submit manuscript

Abstract

This paper addresses an important class of mimicry problems, where the goal is to construct a computer program which is functionally equivalent to an observed behaviour. Computer vision research can be considered such a challenge, where a researcher attempts to impart human visual abilities to a computer. Unfortunately this has proved a difficult task, not least because our vision processes occur mostly at a subconscious level. It is therefore useful to study the general mimicry problem in order to develop tools which may assist computer vision research.

This paper formalises a mimicry problem as one in which a computer learning system (L) constructs a solution from a given program structure (i.e. template or outline) by posing questions to an Oracle. The latter is an entity which, when given an input value, produces the corresponding output of the function which is to be mimicked.

In order to define a program's structure, particularly one which can be extracted from any computer program automatically, a new model of computation is developed. Based on this a fast algorithm which determines the best questions to pose to the Oracle is then described. Thus L relieves the human programmer of the difficulties faced in choosing the examples from which to learn. This is important because a human programmer might inadvertently choose biased, redundant or otherwise unhelpful examples. Results are shown which demonstrate the utility of a complete learning system (L) based on this work.

This paper represents background theory and initial algorithms which further work will extend into powerful automatic learning systems, examples of which are found in [36] and [38].

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. F. Arman and J. Aggarwal, “Model based object recognition in dense range images-A review,” ACM Computing Surveys, Vol. 25, No. 1, pp. 5–43, 1993.

    Google Scholar 

  2. D. Beymer and T. Poggio, “Learning to see,” IEEE Spectrum, Vol. 33, No. 5, pp. 60–69, 1997.

    Google Scholar 

  3. B. Bhanu and T. Poggio, “Editorial,” IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 16, No. 9, pp. 865–868, 1994.

    Google Scholar 

  4. A.K. Bhattacharjya and B. Roysam, “Joint solution of low, intermediate, and high-level vision tasks by evolutionary optimisation: Application to computer vision at low snr,” Neural Networks, Vol. 5, No. 1, pp. 83–95, 1994.

    Google Scholar 

  5. M. Brady, “Forms of knowledge in some machine vision systems,” Philosophical Transactions of the Royal Society: London Series B, Vol. 352, No. 1358, pp. 1241–1248, 1997.

    Google Scholar 

  6. M.L. Braunstein, “Contrasts between human and machine vision: Should technology recapitulate phylogeny,” in Human and Machine Vision, J. Beck, B. Hope, and A. Rosenfeld (Eds.), Academic Press, 1983, pp. 85–96.

  7. D.S. Bridges, Computability: A Mathematical Sketchbook, Springer-Verlag, 1994.

  8. N.H. Bshouty, R. Cleve, R. Gavalda, S. Kannan, and C. Taman, “Oracles and queries that are sufficient for exact learning,” Journal of Computer and System Sciences, Vol. 52, pp. 421–433, 1996.

    Google Scholar 

  9. J. Canny, “A computational approach to edge detection,” IEEE Transactions on Pattern Analysis, Vol. 8, pp. 769–798, 1986.

    Google Scholar 

  10. N. Cerneaz, “Model-based analysis of mammograms,” Ph.D. thesis, Department of Engineering Science, Oxford University, 1994.

  11. R. Chin and C. Dyer, “Model based recognition in robot vision,” ACM Computing Surveys, Vol. 18, No. 1, pp. 67–108, 1986.

    Google Scholar 

  12. K. Cho and P. Meer, “Image segmentation from consensus information,” CVGIP: Image Understanding, Vol. 68, No. 1, pp. 72–89, 1997.

    Google Scholar 

  13. D. Cliff and J. Noble, “Knowledge-based vision and simple visual machines,” Philosophical Transactions of the Royal Society: London Series B, Vol. 352, No. 1358, pp. 1165–1175, 1997.

    Google Scholar 

  14. D. Crevier and R. Lepage, “Knowledge-based image understanding systems,” CVGIP: Image Understanding, Vol. 67, No. 2, pp. 161–185, 1997.

    Google Scholar 

  15. N.J. Cutland, Computability, Cambridge University Press, 1989.

  16. J.C. Daugman, “Pattern and motion vision without laplacian zero crossings,” Journal of the Optical Society of America: A, Vol. 5, No. 7, pp. 1142–1148, 1988.

    Google Scholar 

  17. E.R. Davies, Machine Vision, Academic Press, 1997.

  18. S. Devries, A. Kappers, and J. Koenderink, “Shape from stereo-A systematic approach using quadratic surfaces,” Perception and Psychophysics, Vol. 53, No. 1, pp. 71–80, 1993.

    Google Scholar 

  19. B. Draper, A. Hanson, and E. Riseman, “Knowledge-directed vision: Control learning and integration,” Proceedings of the IEEE, Vol. 84, No. 11, pp. 1625–1637, 1996.

    Google Scholar 

  20. R. Erens, A. Kappers, and J. Koenderink, “Estimating local shape from shading in the presence of global shading,” Perception and Psychophysics, Vol. 54, No. 3, pp. 334–342, 1993.

    Google Scholar 

  21. M.R. Garey and D.S. Johnson, Computers and Intractability: A Guide to the Theory of NP-Completeness, W.H. Freeman&Co., 1979.

  22. G. Grandlund, H. Knutsson, C.J. Westelius, and J. Wiklund, “Issues in robot vision,” Image and Vision Computing, Vol. 12, No. 3, pp. 131–148, 1994.

    Google Scholar 

  23. W.E.L. Grimson, “The intelligent camera-images of computer vision,” Proceedings of the National Academy of Sciences of the USA, Vol. 90, No. 21, pp. 9791–9794, 1993.

    Google Scholar 

  24. B.K.P. Horn, Robot Vision, MIT Press, 1986.

  25. B.K.P. Horn and M.J. Brooks, Shape from Shading, MIT Press, 1989.

  26. G. Humphreys, M. Riddoch, and C. Price, “Top-down processes in object identification: Evidence from experimental psychology, neurophysiology and functional anatomy,” Philosophical Transactions of the Royal Society: London Series B, Vol. 352, No. 1358, pp. 1275–1282, 1997.

    Google Scholar 

  27. A. Jain and C. Dorai, “Practising vision: Integration, evaluation and applications,” Pattern Recognition, Vol. 30, No. 2, pp. 183–196, 1997.

    Google Scholar 

  28. R.C. Jain and T.O. Binford, “Ignorance, myopia, and naivet´e in computer vision systems,” CVGIP-Image Understanding, Vol. 53, No. 1, pp. 112–117, 1991.

    Google Scholar 

  29. E. Johnston, B. Cumming, and A. Parker, “Integration of depth modules-Stereopsis and texture,” Vision Research, Vol. 33, Nos. 5–6, pp. 813–826, 1993.

    Google Scholar 

  30. J.M. Jolion, “Computer vision methodologies,” CVGIP-Image Understanding, Vol. 59, No. 1, pp. 53–71, 1994.

    Google Scholar 

  31. M. Jones, P. Sinha, T. Vetter, and T. Poggio, “Top-down learning of low level vision tasks,” Current Biology, Vol. 7, No. 12, pp. 991–994, 1997.

    Google Scholar 

  32. D. Marr, Vision, Freeman & Co., 1982.

  33. J.D. McCafferty, Human and Machine Vision, Ellis Horwood: Chichester, West Sussex, England, 1990.

    Google Scholar 

  34. M. Mirmehdi, P. Palmer, and J. Kittler, “Genetic optimisation of the image feature extraction process,” Pattern Recognition Letters, Vol. 18, No. 4, pp. 355–365, 1997.

    Google Scholar 

  35. K. Nakayama, S. Shimojo, and G. Silverman, “Stereoscopic depth and occluded depth recognition,” Perception, Vol. 18, No. 1, pp. 55–68, 1989.

    Google Scholar 

  36. R.A. Newman, “Automatic learning in computer vision,” Ph.D. thesis, Oxford University, 1998.

  37. R.A. Newman, “Automatic learning in computer vision,” 1998, in progress.

  38. R.A. Newman, “Madura: A language for learning computer programs from examples,” Journal of Mathematical Imaging and Vision, Vol. 11, No. 1, pp. 65–90, 1999.

    Google Scholar 

  39. D.W. Patterson, Artificial Neural Networks: Theory and Applications, Prentice Hall, 1996.

  40. P. Robertson and M. Brady, “Adaptive image analysis for aerial surveillance,” IEEE Expert, 1998, to appear.

  41. A. Rosenfeld, “Eyes for computers: How hal could see,” in HAL' Legacy, D. Stork (Ed.), MIT Press, 1997.

  42. M. Tarr and M. Black, “A computational and evolutionary perspective on the role of representation in vision,” CVGIP: Image Understanding, Vol. 60, No. 1, pp. 65–73, 1994.

    Google Scholar 

  43. A. Toet and W.P. Hajema, “Genetic contour matching,” Pattern Recognition Letters, Vol. 16, pp. 849–856, 1995.

    Google Scholar 

  44. S. Ullman, “Tacit assumptions in the computational study of vision,” in Representations of Vision, A. Gorea (Ed.), Cambridge University Press, 1991, pp. 305–318.

  45. S. Ullman, “Sequence seeking and counter streams: A computational model for bidirectional information flow in the visual cortex,” Cerebral Cortex, Vol. 5, No. 1, pp. 1–11, 1995.

    Google Scholar 

  46. L.G. Valiant, “A theory of the learnable,” Communications of the ACM, Vol. 27, No. 11, pp. 1134–1142, 1984.

    Google Scholar 

  47. Y. Venkatesh, “Some aspects of information processing in biological vision,” Current Science, Vol. 68, No. 2, pp. 168–184, 1995.

    Google Scholar 

  48. R. Vogt, Automatic Generation of Morphological Set Recognition Algorithms, Springer-Verlag, 1989.

  49. S.W. Zucker, “Endowing AI with vision-A biological and computational perspective,” Current Science, Vol. 64, No. 6, pp. 407–418, 1993.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

Newman, R.A. A New Model of Computation for Learning Vision Modules from Examples. Journal of Mathematical Imaging and Vision 11, 45–63 (1999). https://doi.org/10.1023/A:1008369211484

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1023/A:1008369211484

Navigation