Abstract
A primary goal of visual encoding is to determine the inherently three-dimensional (3D) shape and motion structure of the visual scene to provide a functional representation of the scene layout and trajectories of the objects within it. A surface representation is a natural means of encoding the information from the sparse visual cues available, and psychophysical data suggest that surface interpolation is only possible through 3D interpretation of the depth structure provided by local cues; luminance-based interpolation fails in the absence of a 3D interpretation (whether from 2D or 3D cues). Functional imaging studies suggest that this 3D interpretation is located at the dorsal extreme of the lateral occipital complex. The neural 3D interpretation is limited by smoothness constraints derived from stereoscopic studies, although piecewise discontinuities are also permitted. The dimensions of its representational space, however, are not easily conceptualized in the 3D space that the shapes inhabit, but require the much larger configurational space of all recognizable 3D shapes. In this context, ‘shape’ is a conceptual abstraction of those surface configurations specifiable within our relatively limited cognitive window.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Chum O, Philbin J, Sivic J, Isard M, Zisserman A (2007) Total recall: automatic query expansion with a generative feature model for object retrieval. In: Proc ICCV 07, Rio de Janeiro, Brazil, pp 1–8
Gerrits HJ, Vendrik AJ (1970) Simultaneous contrast, filling-in process and information processing in man’s visual system. Exp Brain Res 11:411–430
Gregory RL (1963) Distortion of visual space as inappropriate constancy scaling. Nature 199:678–680
Grossberg S, Kuhlmann L, Mingolla E (2007) A neural model of 3D shape-from-texture: multiple-scale filtering, boundary grouping, and surface filling-in. Vis Res 47:634–672
Grossberg S, Yazdanbakhsh A (2005) Laminar cortical dynamics of 3D surface perception: stratification, transparency, and neon color spreading. Vis Res 45:1725–1743
Hess RF, Holliday IE (1992) The coding of spatial position by the human visual system: effects of spatial scale and contrast. Vis Res 32:1085–1097
Hibbard PB (2005) The orientation bandwidth of cyclopean channels. Vis Res 45:2780–2785
Holway AE, Boring EG (1941) Determinants of apparent visual size with distance variant. Am Psychol 51:21–37
Johnston EB (1991) Systematic distortions of shape from stereopsis. Vis Res 31:1351–1360
Julesz B (1971) Foundations of cyclopean perception. University of Chicago Press, Chicago
Kontsevich LK, Tyler CW (1998) How much of the visual object is used in estimating its position? Vis Res 38:3025–3029
Levi DM, Klein SA, Wang H (1994) Discrimination of position and contrast in amblyopic and peripheral vision. Vis Res 34:3293–3313
Likova LT, Tyler CW (2003) Peak localization of sparsely sampled luminance patterns is based on interpolated 3D surface representation. Vis Res 43:2649–2657
Marr D (1982) Vision: a computational investigation into the human representation and processing of visual information. Freeman, New York
Morgan MJ, Watt RJ (1982) Mechanisms of interpolation in human spatial vision. Vis Res 25:1661–1674
Ovsjanikov M, Bronstein AM, Bronstein MM, Guibas LJ (2009) ShapeGoogle: a computer vision approach for invariant shape retrieval. In: Proc workshop on nonrigid shape analysis and deformable image alignment NORDIA, Kyoto, Japan, pp 320–327
Papert S (1964) Stereoscopic synthesis as a technique for locating visual mechanisms. MIT Q Pro Rep 73:239–243
Paradiso MA, Nakayama K (1991) Brightness perception and filling-in. Vis Res 31:1221–1236
Regan D, Hamstra SJ (1994) Shape discrimination for rectangles defined by disparity alone, by disparity plus luminance and by disparity plus motion. Vis Res 34:2277–2291
Scarfe P, Hibbard PB (2006) Disparity-defined objects moving in depth do not elicit three-dimensional shape constancy. Vis Res 46:1599–1610
Schumer RD, Ganz L (1979) Independent stereoscopic channels for different extents of spatial pooling. Vis Res 19:1303–1314
Tyler CW (1974) Depth perception in disparity gratings. Nature 251:140–142
Tyler CW (1975) Stereoscopic tilt and size aftereffects. Perception 4:187–192
Tyler CW (1983) Sensory processing of binocular disparity. In: Schor CM, Ciuffreda KJ (eds) Basic and clinical aspects of binocular vergence eye movements. Butterworth, Stoneham, pp 199–295
Tyler CW (2006) Spatial form as inherently three-dimensional. In: Jenkin MRM, Harris LR (eds) Seeing spatial form. Oxford University Press, Oxford, pp 67–88
Tyler CW (2011) Paradoxical perception of surfaces in the Shepard tabletop illusion. i-Perception 2:137–141
Tyler CW, Kontsevich LL (1995) Mechanisms of stereoscopic processing: stereoattention and surface perception in depth reconstruction. Perception 24:127–153
Tyler CW, Kontsevich LL (2001) Stereoprocessing of cyclopean depth images: horizontally elongated summation fields. Vis Res 41:2235–2243
Tyler CW, Likova LT, Kontsevich LL, Wade AR (2006) The specificity of cortical area KO to depth structure. NeuroImage 30:228–238
Van Oostende S, Sunaert S, Van Hecke P, Marchal G, Orban GA (1997) The kinetic occipital (KO) region in man: an fMRI study. Cereb Cortex 7:690–701
Acknowledgement
Supported by FA9550-09-1-0678.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag London
About this chapter
Cite this chapter
Tyler, C.W. (2013). Shape Processing as Inherently Three-Dimensional. In: Dickinson, S., Pizlo, Z. (eds) Shape Perception in Human and Computer Vision. Advances in Computer Vision and Pattern Recognition. Springer, London. https://doi.org/10.1007/978-1-4471-5195-1_24
Download citation
DOI: https://doi.org/10.1007/978-1-4471-5195-1_24
Publisher Name: Springer, London
Print ISBN: 978-1-4471-5194-4
Online ISBN: 978-1-4471-5195-1
eBook Packages: Computer ScienceComputer Science (R0)