Skip to main content

Perception and Action Without Veridical Metric Reconstruction: An Affine Approach

  • Chapter
Shape Perception in Human and Computer Vision

Part of the book series: Advances in Computer Vision and Pattern Recognition ((ACVPR))

Abstract

We present an alternate account of human 3D processing for perception and action. Traditional views postulate that visual mechanisms extract veridical metric properties of environmental objects. Instead, we argue that local affine information, which encodes non metric aspects of a 3D structure, like the depth-order of feature points, determines both our conscious perception of the world and our motor actions. Since only this information is accurately carried on from the earliest stages of 3D processing, our theory predicts large biases in perceptual and motor tasks that require veridical metric estimates. We describe empirical results that support this prediction and show the inadequacy of metric models as computational theories of 3D processing.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    We arbitrarily chose a zero disparity value for the flankers, as if the observers were fixating the plane identified by the two flanking rods.

  2. 2.

    In the specific case of disparity and velocity \(\mathit{SNR}_{c} = \sqrt{\mathit{SNR}_{d}^{2} + \mathit{SNR}_{v}^{2}}\).

  3. 3.

    Consider binocular disparities: The relative disparity d is proportional to the depth Δz of the distal object through a scaling factor k d , related to the egocentric distance of the object (d=k d Δz) and, therefore, to the Signal to Noise Ratio \((\mathit{SNR}_{d} = \frac {k_{d}}{\sigma_{d}}\Delta z)\). For a fixed value of k d , SNR d increases with Δz. Thus, “on average”, larger values of SNR correspond to larger values of Δz.

References

  1. Trommershauser J, Kording K, Landy M (2011) Sensory cue integration. Oxford University Press, New York

    Book  Google Scholar 

  2. Goodale MA (2011) Transforming vision into action. Vis Res 51:1567–1587

    Article  Google Scholar 

  3. Todd JT, Bressan P (1990) The perception of 3-dimensional affine structure from minimal apparent motion sequences. Percept Psychophys 48:419–430

    Article  Google Scholar 

  4. Todd JT, Norman JF (2003) The visual perception of 3-d shape from multiple cues: are observers capable of perceiving metric structure? Percept Psychophys 65:31–47

    Article  Google Scholar 

  5. Todd JT, Oomes AHJ, Koenderink JJ, Kappers AML (2001) On the affine structure of perceptual space. Psychol Sci 12:191–196

    Article  Google Scholar 

  6. Norman JF, Lappin JS, Norman HF (2000) The perception of length on curved and flat surfaces. Percept Psychophys 62:1133–1145

    Article  Google Scholar 

  7. Di Luca M, Domini F, Caudek C (2010) Inconsistency of perceived 3D shape. Vis Res 50:1519–1531

    Article  Google Scholar 

  8. Norman JF, Crabtree CE, Clayton AM, Norman HF (2005) The perception of distances and spatial relationships in natural outdoor environments. Perception 34:1315–1324

    Article  Google Scholar 

  9. Domini F, Shah R, Caudek C (2011) Do we perceive a flattened world on the monitor screen? Acta Psychol 138:359–366

    Article  Google Scholar 

  10. Johnston EB (1991) Systematic distortions of shape from stereopsis. Vis Res 31:1351–1360

    Article  Google Scholar 

  11. Domini F, Braunstein ML (1998) Recovery of 3-d structure from motion is neither Euclidean nor affine. J Exp Psychol Hum Percept Perform 24:1273–1295

    Article  Google Scholar 

  12. Domini F, Caudek C (1999) Perceiving surface slant from deformation of optic flow. J Exp Psychol Hum Percept Perform 25:426–444

    Article  Google Scholar 

  13. Domini F, Caudek C (2003) 3D structure perceived from dynamic information: a new theory. Trends Cogn Sci 7:444–449

    Article  Google Scholar 

  14. Domini F, Caudek C (2003) Recovering slant and angular velocity from a linear velocity field: modeling and psychophysics. Vis Res 43:1753–1764

    Article  Google Scholar 

  15. Fantoni C, Caudek C, Domini F (2010) Systematic distortions of perceived planar surface motion in active vision. J Vis 10:1–20. doi:10.1167/10.5.12

    Article  Google Scholar 

  16. Caudek C, Fantoni C, Domini F (2011) Bayesian modeling of perceived surface slant from actively-generated and passively-observed optic flow. PLoS ONE. doi:10.1371/journal.pone.0018731

    Google Scholar 

  17. Fantoni C, Caudek C, Domini F (2012) Perceived surface slant is systematically biased in the actively-generated optic flow. PLoS ONE. doi:10.1371/journal.pone.0033911

    Google Scholar 

  18. Domini F, Caudek C (2010) Matching perceived depth from disparity and velocity: modeling and psychophysics. Acta Psychol 133:81–89

    Article  Google Scholar 

  19. Domini F, Caudek C, Tassinari H (2006) Stereo and motion information are not independently processed by the visual system. Vis Res 46:1707–1723

    Article  Google Scholar 

  20. Richards W (1985) Structure from stereo and motion. J Opt Soc Am 2:343–349

    Article  Google Scholar 

  21. Foster R, Fantoni C, Caudek C, Domini F (2011) Integration of disparity and velocity information for haptic and perceptual judgments of object depth. Acta Psychol 136:300–310

    Article  Google Scholar 

  22. Campagnoli C, Volcic R, Domini F (2012) The same object and at least three different grip apertures. J Vis. doi:10.1167/12.9.429

    Google Scholar 

  23. Saunders JA, Knill DC (2003) Humans use continuous visual feedback from the hand to control fast reaching movements. Exp Brain Res 152:341–352

    Article  Google Scholar 

  24. Bingham G, Coats R, Mon-Williams M (2007) Natural prehension in trials without haptic feedback but only when calibration is allowed. Neuropsychologia 45:288–294

    Article  Google Scholar 

  25. Chen M, Chen K (1982) A transformational analysis of form recognition under plane isometries. J Math Psychol 26:237–251

    Article  MATH  Google Scholar 

  26. Van Gool M, Moons T, Pauwels E, Wagemans J Invariance from the Euclidean geometer’s perspective. Perception 23:547–561

    Google Scholar 

  27. Wagemans J, Van Gool L, Lamote C, Foster DH (2000) Minimal information to determine affine shape equivalence. J Exp Psychol Hum Percept Perform 26:443–468

    Article  Google Scholar 

  28. Domini F, Caudek C (2011) Combining image signals before 3D reconstruction: the intrinsic constraint model of cue integration. In: Trommershauser J, Kording K, Landy M (eds) Sensory cue integration. Oxford University Press, New York, pp 120–143

    Chapter  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Fulvio Domini .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag London

About this chapter

Cite this chapter

Domini, F., Caudek, C. (2013). Perception and Action Without Veridical Metric Reconstruction: An Affine Approach. In: Dickinson, S., Pizlo, Z. (eds) Shape Perception in Human and Computer Vision. Advances in Computer Vision and Pattern Recognition. Springer, London. https://doi.org/10.1007/978-1-4471-5195-1_20

Download citation

  • DOI: https://doi.org/10.1007/978-1-4471-5195-1_20

  • Publisher Name: Springer, London

  • Print ISBN: 978-1-4471-5194-4

  • Online ISBN: 978-1-4471-5195-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics