Skip to main content
Log in

Encoding of three-dimensional structure-from-motion by primate area MT neurons

  • Letter
  • Published:

From Nature

View current issue Submit your manuscript

Abstract

We see the world as three-dimensional, but because the retinal image is flat, we must derive the third dimension, depth, from two-dimensional cues. Image movement provides one of the most potent cues for depth1,2,3,4,5,6. For example, the shadow of a contorted wire appears flat when the wire is stationary, but rotating the wire causes motion in the shadow, which suddenly appears three-dimensional. The neural mechanism of this effect, known as ‘structure-from-motion’, has not been discovered. Here we study cortical area MT, a primate region that is involved in visual motion perception. Two rhesus monkeys were trained to fixate their gaze while viewing two-dimensional projections of transparent, revolving cylinders. These stimuli appear to be three-dimensional, but the surface order perceived (front as opposed to back) tends to reverse spontaneously. These reversals occur because the stimulus does not specify which surface is in front or at the back. Monkeys reported which surface order they perceived after viewing the stimulus. In many of the neurons tested, there was a reproducible change in activity that coincided with reversals of the perceived surface order, even though the stimulus remained identical. This suggests that area MT has a basic role in structure-from-motion perception.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Figure 1: The monkeys' task and average performance.
Figure 2: Data from an MT neuron.
Figure 3: Averaged data from ‘correlated’ MT cells (n = 27).
Figure 4: Proposed model to explain how suppression and facilitation in MT could give rise to the illusion of depth: a, a cylinder projection could activate four neuronal pools; b, because of excitatory and inhibitory interactions, activity migrates into opposite-direction, separate-depth channels.

Similar content being viewed by others

References

  1. Rogers, B. J. & Graham, M. E. Similarities between motion parallax and stereopsis in human depth perception. Vision Res. 22, 261–270 (1982).

    Article  CAS  Google Scholar 

  2. Miles, W. R. Movement interpretations of the silhouette of a revolving fan. Am. J. Psychol. 43, 392–405 (1931).

    Article  Google Scholar 

  3. Wallach, H. & O'Connell, D. N. The kinetic depth effect. J. Exp. Psychol. 45, 205–217 (1953).

    Article  CAS  Google Scholar 

  4. Ringach, D. L., Hawken, M. J. & Shapley, R. Binocular eye-movements caused by the perception of 3-dimensional structure-from-motion. Vision Res. 36, 1479–1492 (1996).

    Article  CAS  Google Scholar 

  5. Siegel, R. M. & Andersen, R. A. Perception of three-dimensional structure from motion in monkey and man. Nature 331, 259–261 (1988).

    Article  ADS  CAS  Google Scholar 

  6. Andersen, R. A. & Siegel, R. M. in Signal and Sense: Local and Global Order in Perceptual Maps (eds Edelman, G. M., Gall, W. E. & Cowan, W. M.) 163–184 (Wiley, New York, 1990).

    Google Scholar 

  7. Maunsell, J. H. & Van Essen, D. C. Functional properties of neurons in middle temporal visual area of the macaque monkey. II. Binocular interactions and sensitivity to binocular disparity. J. Neurophysiol. 49, 1148–1167 (1983).

    Article  CAS  Google Scholar 

  8. Bradley, D. C., Qian, N. & Andersen, R. A. Integration of motion and stereopsis in middle temporal cortical area of macaques. Nature 373, 609–611 (1995).

    Article  ADS  CAS  Google Scholar 

  9. Snowden, R. J., Treue, S., Erickson, R. G. & Andersen, R. A. The response of area MT and V1 neurons to transparent motion. J. Neurosci. 11, 2768–2785 (1991).

    Article  CAS  Google Scholar 

  10. Hildreth, E. C., Ando, H., Andersen, R. A. & Treue, S. Recovering three-dimensional structure from motion with surface reconstruciton. Vision Res. 35, 117–137 (1995).

    Article  CAS  Google Scholar 

  11. Nawrot, M. & Blake, R. Aneural network model of kinetic depth. Visual Neurosci. 6, 219–227 (1991).

    Article  CAS  Google Scholar 

  12. Treue, S. & Maunsell, J. H. R. Attentional modulation of visual-motion processing in cortical areas MT and MST. Nature 382, 539–541 (1996).

    Article  ADS  CAS  Google Scholar 

  13. Newsome, W. T., Britten, K. H. & Movshon, J. A. Neuronal correlates of a perceptual decision. Nature 341, 52–54 (1989).

    Article  ADS  CAS  Google Scholar 

  14. Logothetis, N. K. & Schall, J. D. Neuronal correlates of subjective visual perception. Science 245, 761–763 (1989).

    Article  ADS  CAS  Google Scholar 

  15. Braddick, O. Segmentation versus integration in visual-motion processing. Trends Neurosci. 16, 263–268 (1993).

    Article  CAS  Google Scholar 

Download references

Acknowledgements

We thank B. Gillikin and S. Gertmenian for technical assistance and F. Crick and C.Koch for comments on the manuscript. Supported by grants from the National Eye Institute, the Human Frontier Science Program, and the Sloan Foundation for Theoretical Neurobiology.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Richard A. Andersen.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Bradley, D., Chang, G. & Andersen, R. Encoding of three-dimensional structure-from-motion by primate area MT neurons. Nature 392, 714–717 (1998). https://doi.org/10.1038/33688

Download citation

  • Received:

  • Accepted:

  • Issue Date:

  • DOI: https://doi.org/10.1038/33688

  • Springer Nature Limited

This article is cited by

Navigation