Human primary visual cortex shows larger population receptive fields for binocular disparity-defined stimuli

Alvarez, Ivan; Hurley, Samuel A.; Parker, Andrew J.; Bridge, Holly

doi:10.1007/s00429-021-02351-3

Human primary visual cortex shows larger population receptive fields for binocular disparity-defined stimuli

Original Article
Open access
Published: 04 August 2021

Volume 226, pages 2819–2838, (2021)
Cite this article

Download PDF

You have full access to this open access article

Brain Structure and Function Aims and scope Submit manuscript

Human primary visual cortex shows larger population receptive fields for binocular disparity-defined stimuli

Download PDF

Ivan Alvarez¹,
Samuel A. Hurley^1,2,
Andrew J. Parker^3,4 &
…
Holly Bridge ORCID: orcid.org/0000-0002-8089-6198¹

2360 Accesses
4 Citations
7 Altmetric
1 Mention
Explore all metrics

Abstract

The visual perception of 3D depth is underpinned by the brain’s ability to combine signals from the left and right eyes to produce a neural representation of binocular disparity for perception and behaviour. Electrophysiological studies of binocular disparity over the past 2 decades have investigated the computational role of neurons in area V1 for binocular combination, while more recent neuroimaging investigations have focused on identifying specific roles for different extrastriate visual areas in depth perception. Here we investigate the population receptive field properties of neural responses to binocular information in striate and extrastriate cortical visual areas using ultra-high field fMRI. We measured BOLD fMRI responses while participants viewed retinotopic mapping stimuli defined by different visual properties: contrast, luminance, motion, correlated and anti-correlated stereoscopic disparity. By fitting each condition with a population receptive field model, we compared quantitatively the size of the population receptive field for disparity-specific stimulation. We found larger population receptive fields for disparity compared with contrast and luminance in area V1, the first stage of binocular combination, which likely reflects the binocular integration zone, an interpretation supported by modelling of the binocular energy model. A similar pattern was found in region LOC, where it may reflect the role of disparity as a cue for 3D shape. These findings provide insight into the binocular receptive field properties underlying processing for human stereoscopic vision.

Stereoscopic processing of crossed and uncrossed disparities in the human visual cortex

Article Open access 21 December 2017

Decoding disparity categories in 3-dimensional images from fMRI data using functional connectivity patterns

Article 09 October 2019

Distributions of Visual Receptive Fields from Retinotopic to Craniotopic Coordinates in the Lateral Intraparietal Area and Frontal Eye Fields of the Macaque

Article Open access 13 August 2023

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Binocular stereopsis underlies our perceptual experience of stereoscopic depth and visual three-dimensional structure. Stereopsis is supported by a set of neural mechanisms for disparity selectivity and binocular integration that are distributed across multiple cortical regions in the human visual cortex (Backus et al. 2001; Bridge and Parker 2007; Preston et al. 2008; Ip et al. 2014; Goncalves et al. 2015; Li et al. 2019) and characterised by selective responses to specific stimulus features, such as absolute and relative disparity, surface curvature, slant, or separation in depth (Parker 2007).

To fully understand stereopsis, it is necessary to establish the relevant region of visual space over which binocular disparity is computed. We define this as the binocular integration zone, comprising the coincident retinal spaces of left and right eyes over which the two monocular inputs are pooled to form a unified binocular representation of disparity in cortical processing (Parker et al. 2016). The site of binocular combination can be localised to the primary visual cortex (V1) in the macaque monkey (Cumming and Parker 1999, 2000; Parker and Cumming 2001), with extrastriate areas performing computations relevant to binocular perception on an integrated representation of the binocular signal. Properties of the binocular integration zone could potentially be similar to the receptive field properties of responses driven by luminance contrast. Alternatively the binocular integration zone could differ in its spatial or temporal properties, revealing limits specific to disparity processing (Prince et al. 2002a; Nienborg et al. 2004, 2005; Anzai et al. 2011).

Neighbouring binocular neurons display similar disparity selectivity, leading to clusters of cells encoding near or far disparities (Chen et al. 2008, 2017). This is compounded by the retinotopic organisation of visual cortex, leading to regions preferentially responding to a particular binocular disparity, at a particular retinal location. This population-level organisation makes disparity selectivity amenable to study with fMRI, a technique that samples cortical responses with a spatial resolution in the range of 1–2 mm. Neuroimaging studies of binocular disparity have characterised the spatial selectivity for binocular information across human visual cortex (Backus et al. 2001; Neri et al. 2004; Preston et al. 2008; Minini et al. 2010; Cottereau et al. 2011; Ip et al. 2014; Ban and Welchman 2015), within cortical areas (Nasr et al. 2016; Tootell and Nasr 2017), as well as the role they play in perceptual judgements of disparity (Backus et al. 2001; Goncalves et al. 2015; Bridge 2016). A previous study by Barendregt et al. (2015) found that a dichoptic bar stimulus presented in spatially offset positions in the two eyes led to larger population receptive fields in V1 compared to extrastriate regions. What remains unclear is how the characteristics of the binocular integration zone in V1 and extrastriate regions perform spatial integration when stimulated with pure binocular disparity, without any monocular cues, compared to receptive fields driven by luminance and contrast.

We analysed quantitatively the receptive field properties of binocularly driven receptive fields with population receptive field (pRF) methods. Using dynamic random dot stereograms in which sequential retinotopic positions are stimulated by changes in binocular disparity, we derived a pRF spatial model of fMRI signals that are specific to processing of a particular binocular disparity. The output of the pRF model summarises the spatial extent of retinal locations over which a disparity signal increases cortical responses. This, in turn, is dictated by the properties of the neurons falling within the stimulated population, specifically the disparity selectivity and spatial selectivity of the receptive fields. As the maximum extent of the disparity-defined pRF is limited by the population-level binocular integration zone of binocular neurons in the sampled cortical space, we take the estimated pRF size as a valid estimate of the binocular integration zone. Sampling across multiple cortical visual areas, we show a pattern of larger pRFs for disparity-defined compared to chequerboard and luminance-defined stimuli in the primary visual area V1 and also in the lateral occipital complex (LOC), supporting a distinct role for this extrastriate region in disparity processing. Model simulations reveal that the increase in estimated pRF size to disparity-defined stimuli in primary visual cortex is predicted by a standard binocular energy model (Ohzawa et al. 1990; Cumming and Parker 1997; Anzai et al. 1999).

Materials and methods

Participants

Eight healthy participants with normal or corrected-to-normal vision took part in the study (mean age 27.6 year, age range 19–42, 6 female). They were screened for normal visual acuity (Snellen chart at 6 m, < 20/20 corrected) and stereoscopic vision (TNO test, < 60 arcsec correct detection). This study received ethical approval from the University of Oxford Central University Research Ethics Committee (MS-IDREC-C1-2015-040) and was conducted in accordance with the Declaration of Helsinki (2013 revision). One participant was unable to successfully fuse the stereoscopic images in the scanner, so they were not included in any of the analyses, leaving seven participants included in the results. To ensure that participants perceived the experimental stimuli as expected, prior to entering the scanner they were shown a short example of the stimuli. When explicitly asked, all participants confirmed that they could see depth in the correlated disparity condition but not in the anti-correlated disparity condition.

Stimulus presentation

Visual stimuli were generated in MATLAB (v8.0, Mathworks Inc., Natick, MA, USA) using Psychtoolbox (v3.0, http://psychtoolbox.org) and displayed through a LCD projector (LC-XL 100, Eiki Industrial Company, Japan) via a back-projection screen situated inside the bore of the MRI scanner (peak luminance = 552 cd/m²). All stimuli were viewed through red and green anaglyph filters (Wratten 2 Optical Filters #29 and #61, Eastman Kodak, Rochester, NY, USA), both to provide stereoscopic display in the case of disparity-containing stimuli, and to ensure equal luminance attenuation across conditions. Luminance crosstalk, defined as the percentage of unintended signal to intended signal, was measured for the red and green filters at 0.16% and 0.82%, respectively. The red filter was always placed over the left eye.

Stimuli were arranged across five conditions (Fig. 1): chequerboard, correlated disparity, motion, luminance, and anti-correlated disparity conditions. All stimuli were presented within the confines of a ‘wedge’ or ‘ring’ aperture, similar to that used in a standard retinotopic mapping design (Engel et al. 1994; Sereno et al. 1995). Four configurations were used: two types of wedge, rotating either clockwise or counter-clockwise, and two types of ring, either expanding or contracting. In the following section, we refer to the stimulus content within the aperture as the foreground and stimulus content outside the aperture as the background.

The chequerboard condition (Fig. 1A) consisted of a foreground of radial contrast-reversing (2 Hz) chequerboard (contrast = 100%), while the background was set to 50% luminance, matched to the mean of the foreground stimulus. The angular segments of the chequerboard were 5° and the visible wedge subtended 20° (four segments). The diameter of the radial segments was log scaled with the smallest ring subtending 0.016° and the largest subtending 0.49°.

The correlated disparity stimulus (Fig. 1B), consisted of a dynamically changing array of randomly placed dots, half of them white and half black on a grey background. Foreground dots were fully correlated in position between left and right eyes and modulated in binocular disparity. Foreground dots were presented at either + 0.2° or − 0.2° disparity, corresponding to near and far positions relative to the fixation plane, and swapping every 1.45 s. Background dots were randomly placed in left and right eyes and were, therefore, uncorrelated binocularly. Both foreground and background contained black and white dots (50% each), and dots refreshed at a frequency of 60 Hz.

The motion stimulus (Fig. 1C) consisted of a dynamic random dot array, with dot positions fully correlated binocularly. Background dots were static, while foreground dots moved in either clockwise or counter-clockwise motion (50% of dots in each direction) at 7°/s. To ensure dot motion was visible, dots were refreshed at a rate of 0.33 Hz, slower than the 60 Hz refresh rate for the correlated disparity stimulus. Both foreground and background contained black and white dots (50% each).

The luminance stimulus (Fig. 1D) consisted of a dynamic random dot array, with dot positions fully correlated binocularly, refreshing at a rate of 60 Hz, and containing both black and white dots (50% each). Foreground dots were either 100% black or 100% white, with the luminance of foreground dots reversing every 1.45 s.

The anti-correlated disparity stimulus (Fig. 1E) consisted of a dynamic random dot array, which was identical in layout to the correlated disparity stimulus (Fig. 1B), except for the arrangement of foreground dot colors. Dots falling inside the aperture always had opposite contrasts in left and right eyes, so that they were presented in matching positions but displayed as either white in the left eye and black in the right eye, or the opposite. Dots falling outside the aperture were binocularly uncorrelated as for the correlated disparity stimulus, described above. This use of anti-correlated disparity to define the aperture negates the sensory percept of depth, while delivering some binocular disparity information that is registered by V1 neurons (Cumming and Parker 1997).

For all dot stimulus conditions (motion, luminance, correlated and anti-correlated disparity), the dynamic random dot arrays were presented at a dot density of 40%, and a dot radius of 0.12° for 6 participants and 0.15° for 1 participant. The two different dot radii were used due to an unforeseen change in the presentation programme after the first participant was run, although dot size was always consistent across conditions for each participant.

For each condition, periods of no modulation (24/168 volumes per run) with a blank grey screen were used to estimate the baseline response.

In addition to the main experimental stimuli, a single full-field radial chequerboard stimulus alternating with a grey background (2.5 s ON, 30 s OFF) was used to estimate the haemodynamic response function (HRF) of visual cortex individually for each participant.

To control participants’ attention, a fixation cross was present throughout stimulation, and participants were required to detect a change of colour of this cross from black to red. The fixation cross was presented in a radial 0.5° cut-out for all stimuli, and therefore any reconstructed pRFs with eccentricities < 0.5° were discarded, due to overlap with the fixation cross. The colour change was brief (200 ms) and occurred pseudo-randomly 80–100 times during a single run. Participants responded to this vigilance task via an MRI-compatible button box, and responses were monitored to ensure participant alertness. The average percentage of events detected was 88% ± 5% SEM.

MRI acquisition

MR images were acquired with an ultra-high field 7 T MRI system (Siemens Healthcare, Germany) using a 32-channel head coil (Nova Medical, USA). Functional imaging during visual stimulation was conducted with a gradient echo echo-planar imaging sequence (TR = 2488 ms, TE = 27.8 ms, 64 slices, resolution = 1.2 mm isotropic) with in-plane acceleration using parallel imaging (GRAPPA factor = 2) (Griswold et al. 2002) and through-slice acceleration using multiband imaging (MB factor = 2) (Moeller et al. 2010). Four runs were acquired for each stimulus condition, totaling 672 volumes per condition. The order of conditions and aperture order was randomised within and across sessions. For HRF estimation, three runs were acquired per participant, total 234 volumes. B₀ field maps were acquired in-plane in each run to correct distortions due to field inhomogeneity (TR = 620 ms, TE_1/2 = 4.08/5.1 ms, resolution = 2 mm isotropic). A T1-weighted (T1w), whole-brain, anatomical image was acquired to reconstruct the cortical surface and anatomically localize functional data (MP-RAGE, TR = 2200 ms, TE = 2.82 ms, TI = 1050 ms, flip angle = 7°, slices = 176, resolution = 1 mm isotropic).

MRI pre-processing

Functional images for each participant were pre-processed with FSL (FMRIB Software Library v5.0.8; http://www.fmrib.ox.ac.uk/fsl). EPI images were corrected for distortions caused by magnetic field inhomogeneities using FUGUE (Jenkinson et al. 2012), image portions showing brain tissue were isolated, and corrected for participant motion by linear realignment to the middle time point of each run. Low-frequency fluctuations were removed using a high-pass filter with cut-off at 0.02 Hz. Each run was then registered to the subject-specific T1w structural image using boundary-based registration (Greve and Fischl 2009).

Cortical surfaces were reconstructed from T1w structural images with FreeSurfer (v5.3.0, http://www.freesurfer.net). Volumes underwent automated segmentation to generate grey and white matter boundaries, and the grey matter surface reconstructed to create a two-dimensional representation of the cortical surface.

pRF analysis

fMRI data were analysed using a Gaussian population receptive field (pRF) model (Dumoulin and Wandell 2008; Wandell and Winawer 2015). The analysis software was implemented in MATLAB and is described detail in Alvarez et al. (2015). In brief, the participant-specific haemodynamic response function (HRF) was estimated by averaging 18 trials over the occipital lobes during full-field chequerboard stimulation and fitting a double gamma function (Friston et al. 1995). Model predictions were constructed by combining the a priori position of the stimulus aperture at each MRI volume acquired and a radially symmetric two-dimensional Gaussian pRF. Predictions were then convolved with the participant-specific HRF and compared to the observed signal in a two-stage procedure. First, the spatially smoothed (full width half maximum = 5 mm, on spherical mesh) BOLD time courses were correlated with signal predictions generated by an exhaustive grid of combinations of the three pRF parameters (X coordinate, Y coordinate, σ size of pRF). The parameters resulting in the highest correlation at each vertex formed the starting point for the second stage, in which the original unsmoothed BOLD time courses were fitted using the Nelder–Mead algorithm for unconstrained nonlinear minimisation (Lagarias et al. 1998) to identify parameter combinations for each vertex that maximise the variance explained by the model. Best-fitting model predictions yielded estimates of retinotopic location (X and Y coordinates) and pRF size (σ) for each vertex. Each condition (chequerboard, luminance, motion, correlated disparity, anti-correlated disparity) was fitted independently. Regions of interest were delineated for each participant based on polar angle and eccentricity estimates obtained in the chequerboard condition (see Fig. 2).

Model performance was assessed with the normalised correlation coefficient metric (CCnorm) (Schoppe et al. 2016). Each stimulus run was divided into two, with each half split considered an independent stimulus presentation. Signal reliability was used to normalise the correlation between pRF model prediction and empirically observed BOLD signals. Vertices were thresholded at CCnorm > 0.5, approximately equivalent to 50% of explainable variance explained by the pRF model.

Regions of interest

Regions of interest V1, V2, V3, V3A/B, V5/MT+ , V7, V4, LOC and VOC were identified for each participant in each hemisphere tested. Since precise retinotopic boundaries could not be observed for all participants in some portions of visual cortex, a merged-region definition was adopted for areas LOC, VOC and V5/MT+ . Specifically, the lateral occipital complex (LOC) encompassed retinotopic definitions of areas LO-1 and LO-2, the ventral occipital complex (VOC) encompassed areas VO-1 and VO-2 (Larsson and Heeger 2006; Wandell et al. 2007; Winawer and Witthoft 2015), and the region V5/MT + encompassed the temporal-occipital areas TO-1 and TO-2 (Amano et al. 2009). Further, the region V5/MT+ was compared with an atlas definition of human occipital area 5 (hOc5), a cytoarchitectonic correlate of area V5/MT+ , for anatomical agreement (Malikovic et al. 2006). This comparison showed a minimum of 50% overlap between vertices in retinotopically-defined V5/MT+ and the atlas-based cytoarchitectonic definition of hOc5 in all hemispheres tested (mean overlap = 74%, SD = 12%, N = 14 hemispheres). The variability in alignment between structural and functional markers of V5/MT + in human cortex has been noted before (Large et al. 2016).

Experimental design and statistical analysis

Differences in model performance between stimulation conditions were assessed in two ways. First, the distributions of CCnorm values were pairwise-compared between conditions with independent Kolmogorov–Smirnov tests. Specifically in the case of KS statistics (Vermeesch 2013) the effect sizes rather than the p values are better estimates of distributional similarity. Therefore, only effect sizes are reported here. Second, differences in mean CCnorm between conditions were assessed with a repeated measures ANOVA, introducing stimulation condition and region of interest as within-subject variables and participant identity as an independent variable. Estimates of pRF size were also assessed for each visual area using a mixed-effects model implemented in Prism (GraphPad Software, San Diego, CA) with stimulus condition and eccentricity bin as within-subject variables and participant identity as a random factor. The anti-correlated disparity condition was not included in this analysis as there were too few vertices for which the pRF model could be successfully fit. This mixed-effects model was used, rather than a repeated measures ANOVA, to account for vertices where no pRF model could be fit, hereafter labelled as missing values. Across all stimulus conditions (excepting anti-correlated disparity), visual areas, participants and eccentricities, 2.1% of values were missing. The disparity condition had the greatest number of missing values at 4.9%, but no individual visual area was missing more than 10% of values. Geisser–Greenhouse correction was applied where necessary and where random effects were zero, the term was removed and a simpler model fit used. Where there was a significant effect of condition in the main analysis, post hoc mixed-effect models were conducted to assess the effect of specific condition pairs at each region of interest, with eccentricity bin introduced as a nuisance variable. All other statistical tests were implemented in MATLAB or SPSS (v24, IBM Corp., Armonk, NY, USA).

Binocular energy model

The response of binocular neurons in V1 to disparity information has been previously characterised as a set of canonical computations, formalised in the binocular energy model (Ohzawa et al. 1990; Cumming and Parker 1997; Anzai et al. 1999).

In this model, the monocular receptive field is defined as a Gabor filter, that is, the product of a sinusoidal grating and a Gaussian envelope given by

$$M_{{\left( {x,y} \right)}} = \sin \left[ {2\pi fx + \theta } \right]\, \times \,\frac{1}{{2\pi \sigma^{2} }}e^{{ - \frac{{x^{2} + y^{2} }}{{2\sigma^{2} }} }} ,$$

where x and y are point locations in 2D space, f is the spatial frequency of the sinusoidal grating, θ is the grating phase, and σ is the standard deviation of the Gaussian envelope. In the presence of stimulus image I, the simple cell response is,

$$Sx = \iint {M_{{\left( {x,y} \right)}} I_{{\left( {x,y} \right)}} }{\text{dxdy}}{.}$$

As the monocular receptive fields for the left and right eyes are independent, a position offset is introduced to generate sensitivity to binocular disparity. Summing and squaring the products of the monocular cells, create a linear-nonlinear ‘LN’ element,

$${\text{LN}} = \left( {Sx_{L} + Sx_{R} } \right)^{2} ,$$

encoding disparity information at a particular phase and spatial frequency. A binocular complex cell is constructed by simply summing the LN elements of four pairs of monocular cells, set in phase-offset in quadrature,

$$Cx = \sum {\text{LN}}_{0} + {\text{LN}}_{0.5\pi } + {\text{LN}}_{\pi } + {\text{LN}}_{1.5\pi } ,$$

where two pairs of LN elements (θ = 0, π and θ = 0.5π, 1.5π) are anti-phase with each other. The response of the complex binocular cell can then be examined for the transient effect of stimulation; as matching retinal images overlap (or not) with the receptive fields, the response of the complex cell is modulated. In analogy with the fMRI task described above, this is equivalent to the stimulus aperture transiting across the model receptive field. Let us designate the aperture position A, for an arbitrary number of positions. In the case of a luminance-defined stimulus, the complex cell response is given by,

$$Sx_{L} = \iint {M_{{L\left( {x,y} \right)}} I_{{\left( {x,y} \right)}}^{A} }{\text{dxdy,}}$$

$$Sx_{R} = \iint {M_{{R\left( {x,y} \right)}} I_{{\left( {x,y} \right)}}^{A} }{\text{dxdy,}}$$

$$Cx_{A} = \mathop \sum \limits_{\theta = 1}^{4} \left( {Sx_{L\left( \theta \right)} + Sx_{R\left( \theta \right)} } \right)^{2} ,$$

where the output of the complex cell is dependent on the overlap of the stimulus aperture I^A and the monocular receptive fields, M_L and M_R.

A population of 1000 binocular complex neurons was simulated, with receptive fields positioned in the centre of the visual field. A small, normally distributed, position offset (SD = 0.1°) was introduced to eliminate, by averaging, the spatial response of the population to detailed positions of the dots forming the randomly generated RDS patterns. The horizontal size of the Gabor profile, orthogonal to the Gabor carrier grating orientation, was manipulated to simulate receptive field size increase with eccentricity. Spatial frequency and disparity tuning were similarly manipulated to simulate the experimentally determined range of V1 receptive field properties found in recordings from macaque visual cortex (see below). The vertical size of the filter, parallel to grating orientation, was set to 1.5 times the horizontal size across eccentricity, also based on V1 recordings in the macaque monkey (Ringach et al. 2003). All filters were vertically oriented.

The stimuli delivered to the model receptive field consisted of (1) binocularly presented, contrast-reversing chequerboards (2) binocularly correlated dots in the aperture with binocularly uncorrelated dots in the background and (3) opposite polarity zero-disparity dots in the aperture and same polarity, zero-disparity dots in the background, just like the chequerboard, correlated disparity and luminance stimuli, respectively, viewed by participants. Stimuli were presented through a sweeping bar aperture in 100 steps, to create a timeseries of responses to the transient presence of contrast or disparity information. Random dots were binocularly correlated within the aperture at the programmed binocular disparity, but were uncorrelated in the background, while in the luminance condition dots were opposite polarity within the aperture and matched polarity in the background. For these random dot stimuli, 1000 unique RDS frames were generated at each aperture step, and responses averaged together. Resulting responses were fitted with the Gaussian pRF model illustrated in Fig. 2. As estimation of the receptive field location is not of concern here, the location parameters were fixed a priori and only the receptive field size was estimated (Fig. 3).

Three manipulations of model receptive field properties were conducted, to observe the effects on pRF model fits. First, the horizontal filter size was set to 15 different values between SD = 0.2° and SD = 3° to simulate RF size increase with eccentricity. The spatial frequency of the sinusoidal component of the Gabor was fixed to × 0.5 the horizontal size, and all cells were set to be tuned to the stimulus disparity (chequerboard and luminance = 0°, disparity stimulus = 0.2°). Second, the same filter size points were sampled, while allowing spatial frequency of the Gabor filter to vary between × 0.5 and × 3.5 horizontal size, reflecting the variability in spatial tuning of V1 cells, while constrained by the size-disparity correlations observed in macaque V1 (Prince et al. 2002b). Third, both spatial frequency and disparity tuning were allowed to vary, with the latter allowing horizontal position of filters for left and right eyes to vary by SD ± 0.25°. This final manipulation most closely resembles the distribution of receptive field properties reported for V1 cells in electrophysiological studies in macaque visual cortex. These simulations were designed to directly compare model responses generated by disparity-defined stimuli with chequerboard and luminance for the specific case when all model units are tuned to the stimulus disparity. A more comprehensive model would include units tuned to many different disparities, but this is beyond the scope of the current implementation.

Results

Disparity responses are widespread across visual cortex

All visual cortical areas and regions of interest gave significant responses to binocular disparity stimulation as well as contrast stimulation. When considering the distributions of CCnorm, negligible effect sizes were detected when comparing the correlated disparity condition with the chequerboard (Kolmogorov–Smirnov distance, KS = 0.12, D = 10^–4), motion (KS = 0.07, D = 10^–4), luminance (KS = 0.07, D = 10^–4) or anti-correlated conditions (KS = 0.15, D = 10^–4). Variability in mean CCnorm was assessed with a repeated measures ANOVA, revealing a significant effect of condition [F(5, 1) = 64.56, p = 10^–3] and visual area [F(5, 1) = 57.05, p = 10^–3]. Post hoc t tests showed all conditions outperformed the anti-correlated disparity condition (all comparisons p < 0.05), with no significant differences between the remaining conditions.

Figure 4 shows the pRF size averaged across all participants for each type of visual stimulation. The spatial distribution of pRF size estimates follows the expected pattern of small pRFs in areas representing the central visual field and larger in the periphery.

pRF size for disparity varies systematically across the visual hierarchy

Estimates of pRF size obtained under the disparity condition may capture the binocular integration zone (Parker et al. 2016) over which monocular signals are combined, and consequently reflect the role that cortical areas play in the integration mechanism of binocular stereopsis. Examining the spatial distribution of pRF size estimates across the visual cortex reveals systematic variation: in particular, there are locations where pRF size estimates differ between the disparity and other conditions. We observed larger pRFs for correlated disparity in the calcarine sulcus, close to representation of the horizontal meridian, when compared to all other control conditions (Fig. 4).

Binned estimates of pRF size for disparity across eccentricity are shown in Fig. 5. Differences in binned values of pRF size between the correlated disparity condition and other conditions were assessed with a full factorial ANOVA model, introducing eccentricity and region of interest as independent variables. Anti-correlated responses were omitted from this comparison, owing to the low number of vertices successfully fitted by the pRF model under that condition. There was a significant interaction between condition and region of interest (F = 43.45, df = 14, 98, p = 0.001). To assess the effect of stimulus condition on a region-by-region basis, we conducted a series of linear mixed models, which are presented in the following section.

pRF size for disparity differs from non-disparity pRFs in V1

Early visual regions: V1, V2 and V3

Figure 6 shows a summary of the pRF sizes at each eccentricity and for the different stimulus conditions across the early visual areas. In V1, a mixed-effects model showed a significant effect of condition [F(2.1, 119.3) = 5; p = 0.007], although there was no effect of eccentricity [F(9, 60) = 1.9; p = 0.06] nor a significant interaction [F(27, 174) = 0.9; p = 0.62]. Post hoc paired comparisons showed that mean pRF size across all eccentricities for disparity (2.4°) were significantly greater than chequerboard [1.8°; F(1, 116) = 15; p = 0.0002] and luminance [2.0°; F(1, 56) = 5.3; p = 0.02]. Disparity pRFs did not differ in size from those defined by motion [2.1°; F(1, 114) = 1.3; p = 0.26]. To further investigate the relationship between pRF size and mapping stimulus, the regression lines for each were compared. In V1, the three conditions showed significantly different slopes [F(3, 266) = 3.5; p = 0.02]. Pairwise conditions indicated that the slope of the disparity condition did not differ from the chequerboard [F(1, 132) = 1.5; p = 0.23] or motion [F(1, 132) = 1.3; p = 0.26], although there was a marginal difference with luminance [F(1, 132) = 3.4; p = 0.06]. In contrast, there was a highly significant difference between the intercepts of the fit for disparity and chequerboard [F(1, 132) = 16.5; p < 0.0001], to a lesser extent with luminance [F(1, 132) = 4.4; p < 0.04] and no difference from motion [F(1, 132) = 1.3; p = 0.25]. These findings are broadly consistent with the mixed-effects model.

In comparison, in V2, there was a significant effect of eccentricity on pRF size [F(9, 60) = 3.8; p = 0.0008], but no difference according to stimulus condition [F(2.6, 147.9) = 2.2; p = 0.11] or interaction [F(27, 174) = 0.75; p = 0.81]. Finally, in V3 there was a significant effect of eccentricity on pRF size [F(9, 60) = 26.9; p < 0.0001], and stimulus condition [F(2.3, 136.6) = 5.5; p = 0.003] but no significant interaction [F(27, 177) = 1.5; p = 0.06]. However, while pRFs with the disparity stimulus (2.1°) were significantly smaller than with the chequerboard [2.5°; F(1, 117) = 16.2; p = 0.0001], they did not differ from either luminance [2.2°; F(1, 117) = 0.8; p = 0.37] or motion [2.3°; F(1, 57) = 3.1; p = 0.08].

Thus, it appears that only in V1 are pRF sizes greater for disparity than both chequerboard and luminance-defined stimuli. It is also the case in V1, and to a lesser extent in V2 that the chequerboard stimulus resulted in smaller pRF sizes at low eccentricities compared to the stimuli defined by dots. This effect may have been driven by spatial integration effects since the aperture boundaries formed by dot-defined stimuli require spatial integration over a larger region of the visual field compared to the stimuli with clear contrast-defined borders, such as the chequerboard stimulus.

Dorsal visual regions: V3A/B, V5/MT + and V7

The pRFs measured with the chequerboard stimuli were larger than those with the dot-defined stimuli across all dorsal visual areas (Fig. 7). As evident from the graphs, there was a highly significant effect of eccentricity in all dorsal areas [V3A/B: F(9, 60) = 32.3; p < 0.0001; V5/hMT + : F(9,235) = 19.6; p < 0.0001; V7: F(9, 236) = 38.6; p < 0.0001]. Similarly, all areas showed a significant effect of stimulus type [V3A/B: F(2.3, 136.3) = 11.1; p < 0.0001; V5/hMT + : F(2.0,158.0) = 4.4; p = 0.01; V7: F(2.7,214.5) = 16.3; p < 0.0001]. However, the disparity-defined pRF size only differed from the pRF sizes defined using the chequerboard in V3A/B [F(1, 119) = 12.6; p = 0.0006] and V7 [F(1, 117) = 13.9; p = 0.0003]. This suggests that any difference in these areas was more likely related to the use of dot-defined stimuli rather than disparity per se. Importantly, the differences for dorsal areas differ from early visual areas in that the dot-defined stimuli have smaller pRF sizes than chequerboard-defined pRFs. This suggests that limits on pRF size at least in the dorsal regions are not related to the clarity of the boundaries as suggested earlier.

Ventral visual regions: V4, LOC and VOC

Figure 8 shows that in the ventral visual areas, like the dorsal regions, pRF size increased with eccentricity [V4: F(9, 230) = 16.6; p < 0.0001; VOC: F(9, 60) = 5; p < 0.0001; LOC: F(9, 60) = 50.3; p < 0.0001]. There was a significant effect of stimulus type in both V4 [F(2.5, 193.0) = 10.8; p < 0.0001] and LOC [F(2.4, 137.4) = 5.4; p < 0.003] and V4 also showed a significant interaction between eccentricity and stimulus type [F(27, 230) = 2.4; p < 0.0002].

In V4, while there was a significant difference in pRF size when defined by disparity (2.6°) and chequerboard [3.1°; F(1, 112) = 12.5; p = 0.0006], disparity did not differ from either luminance [2.7°; F(1, 113) = 1.9] or motion [2.4°; F(1, 111) = 1.8]. In contrast, pRF size in LOC was greater when defined by disparity (2.8°) compared to both luminance [2.5°; F(1, 44) = 10.3; p = 0.002] and motion [2.5°; F(1, 54) = 9.7; p = 0.003]. When compared to chequerboard (2.7°), there was no difference in mean pRF size [F(1, 115) = 0.7], but there was a significant interaction [F(9, 115) = 2.4; p = 0.01] reflecting the larger pRF sizes for disparity at lower eccentricities, and smaller at the highest eccentricities.

Binocular energy model predictions of V1 integration zone

The disparity-defined stimulus used in the fMRI experiment contained a single magnitude of disparity (modulating from + 0.2° to − 0.2°), operating under the assumption that the estimated binocular integration zone would reflect the sub-population of binocular neurons that are tuned to these disparities, irrespective of the cortical territory being examined. However, electrophysiological studies have shown that disparity tuning co-varies with receptive field size, and by extension, with eccentricity (Prince et al. 2002a). This size-disparity correlation means the size of the binocular integration zone is dependent on eccentricity, as well as sensitivity to disparity-defined stimuli.

The binocular energy model provides a parsimonious account of the responses of binocular cells in area V1 in the presence of binocular disparity information, building disparity sensitivity from the linear combination of monocular receptive fields (Ohzawa et al. 1990; Cumming and Parker 1997; Anzai et al. 1999). In the case of a non-disparity-defined stimulus, the binocular receptive field reflects the simple sum of the monocular overlaps between the receptive fields. However, in the presence of stimulus disparity, a disparity-tuned complex cell pools information over an extended region of space, creating an expanded receptive field (Fig. 9). Thus, the relationship between the monocular receptive field width and the complex cell response to a transient stimulus is a function of disparity tuning.

Implementing the binocular energy model, we examined the effect of varying monocular receptive field sizes of a population of synthetic V1 neurons, and fitted the model responses with the pRF procedure, with results shown in Fig. 10. A linear relationship between the model-defined receptive field size and the fitted binocular pRF size was observed, with responses to disparity-defined stimuli exhibiting larger pRFs compared to contrast-defined chequerboard and luminance-defined random dot stimuli. Using populations of model neurons with differing (i) receptive field size, (ii) spatial frequency, and (iii) disparity tuning produced a similar pattern of results with similar discrepancies between stimulus types. These discrepancies qualitatively matched the pattern observed in the empirically estimated pRF sizes from BOLD data in area V1, which also displayed larger pRF sizes for disparity-defined stimuli when compared with contrast- and luminance-defined stimuli. Therefore, while size-disparity correlation limits the size of the binocular integration zone, these results support the view that the receptive field size, constrained by eccentricity, is the principal limiting factor on the size of the binocular integration zone.

One potential reason for discrepancies between the modelling and fMRI data is the different stimulus configuration used for the modelling. We, therefore, additionally modelled the chequerboard, correlated and anti-correlated disparity stimuli using a wedge configuration. Figure 11A-C shows the model responses to each of these conditions for simulated V1 cell populations that vary in (A) RF size, (B) RF size and spatial frequency (SF) or (C) RF size, SF and disparity tuning. In addition, to determine whether model responses can be driven by correlation alone, a further condition in which a correlated zero-disparity wedge and an uncorrelated background were presented to the model. With this stimulus configuration there was no change in pRF size with increasing monocular RF size, suggesting correlation alone is not sufficient to generate the fMRI responses.

Discussion

This study provides estimates of pRFs for binocular disparity across human cortical visual areas and compares them to estimates of pRF size for non-disparity-defined stimuli. In particular, the derived pRFs obtained under correlated disparity stimulation are proposed to reflect the binocular integration zones of a given cortical site at the population level. Stimuli not defined by disparity, such as the luminance edges of the chequerboard, elicit responses across a wide variety of classical receptive fields, including both monocular and binocular RFs. By comparison, the stereoscopic RDS stimuli that define the wedge or ring aperture used to map the pRFs here only deliver aperture-related information encoded in binocular disparity. Therefore, where pRF estimates diverge between the disparity condition and pRFs estimated under luminance or contrast edges, differences should reflect the role of disparity-specific processing.

Our findings are consistent with previous fMRI evidence, which report widespread binocular disparity processing across visual cortex (Backus et al. 2001; Bridge and Parker 2007; Preston et al. 2008; Minini et al. 2010; Ip et al. 2014; Goncalves et al. 2015; Ban and Welchman 2015), and a specific role for area V1, as the site of binocular integration (Barendregt et al. 2015). An important point of interpretation for our study is that, unlike studies such as Barendregt et al. (2015), who compared binocular with monocular stimulation, the current study used binocular viewing in all tested conditions. Therefore, the stereoscopic stimuli used here probe the neuronal mechanisms that are responsible for the extraction of depth from binocular disparity. Second, our study makes direct comparisons of pRF size estimated in stereoscopic viewing conditions for both disparity- and non-disparity-defined stimuli, whereas the paper by Barendregt et al. (2015) compared overall quality of fits of the pRF model to the monocular and binocular stimulation conditions.

Estimates of the binocular integration zone in area V1

Our results demonstrate a discrepancy between pRFs estimated from disparity and non-disparity information in area V1, with larger receptive fields for disparity in agreement with the electrophysiological literature (Nienborg et al. 2004). This is consistent with the proposal that the binocular combination in disparity-specific neurons of V1 is a fundamental limiting stage in determining the size of the pRF (Cumming and Parker 1999, 2000; Parker and Cumming 2001). The lack of discrepancy of pRF size in areas V2 and V3 suggests little further combination of the retinal inputs in early extrastriate cortex, at least at levels detectable by population-level methods. In this regard, our findings are similar to those of Barendregt et al. (2015).

The relationship between the sizes of the non-disparity receptive field and the binocular integration zone in V1 is described by the general form of the binocular energy model (Banks et al. 2004; Nienborg et al. 2004). In this model, the ability of disparity-tuned V1 cells to detect changes in binocular disparity is limited by the width of the correlation window over which monocular signals are compared. If the window is too large, binocular matches become ambiguous; if the window is too small, the binocular image will not contain enough information to compute disparity (Banks et al. 2004; Nienborg et al. 2004). Notably, this constraint is independent of depth variation within the window, or limits imposed by optical effects, retinal sampling or stimulus construction (Tyler 1974; Schlesinger and Yeshurun 1998; Banks et al. 2004). As the correlation window is defined by the size and location of the paired monocular receptive fields, the latter impose the minimum area over which disparity information may be integrated. Indeed, the disparity energy model predicts a binocular integration zone whose effective receptive field is the half-squared product of the monocular receptive fields over which binocular cross-correlation takes place (Banks et al. 2004; Nienborg et al. 2004, 2005). This prediction is borne out in electrophysiological studies; for example, Nienborg et al. (2004) showed that for disparity-tuned V1 neurons, the relationship between monocular receptive field size and the width of the correlation window corresponds to a half-squaring output nonlinearity and is approximately linear across eccentricities. Extrapolating this idea to neuronal population level, the binocular integration zone is predicted to display a half-square nonlinearity in relation to other conditions, equivalent to a positive slope in the pRF size ratios between disparity and control conditions in Fig. 6B.

One potential reason for a difference in pRF size between conditions could be the relative magnitude of the BOLD signal, as increased amplitude leads to greater spread across the cortex. We do not believe this to be an issue since the pRF mapping technique uses a ‘winner takes all’ model so only the highest amplitude signal is considered for a voxel. The size of the BOLD response does, however, increase the signal: noise ratio leading to a better fitting, a larger number of informative voxels and more reliable maps.

Comparison of binocular energy model prediction and the empirical binocular integration zone in V1

An explicit, but restricted implementation of the binocular energy model allowed us to assess the effects of monocular receptive field parameters on the conjugate signal of a model V1 cell population. By manipulating the model monocular receptive field size and fitting the mean population signal with a pRF model, we confirmed that pRF size for disparity is a linear function of monocular receptive field size. We also confirmed that disparity-defined stimuli resulted in larger pRFs compared to both contrast- and luminance-defined stimuli, reflecting the wider binocular integration zone necessary for integrating horizontal discrepancies in the monocular inputs, absent in the case of contrast information. Deriving the pRF from a population of model cells with different receptive field sizes, SF and disparity tuning produced a relationship between responses to disparity-defined and contrast-defined stimuli that was broadly comparable to the empirically estimated pRF sizes from BOLD data in area V1.

Nonetheless, there were several notable discrepancies between the empirical data and modelling results. Firstly, when comparing the pRF size for correlated disparity with the chequerboard, in the model the difference between the two conditions appears to increase with receptive field size, whereas the empirical data appears to converge. Secondly, the model pRF sizes for contrast and luminance stimuli show almost exactly the same pattern. In comparison, the empirical data for these two conditions varied considerably, with smaller pRFs at low eccentricities and larger pRFs at high eccentricities for the chequerboard.

The use of dot stimuli rather than the chequerboards may contribute to both of these differences. Even at the lowest eccentricity value the pRF sizes for the dot-defined stimuli are around 2°, compared to 1° for the chequerboard. Since dot diameter was 0.3°, determining the location of the stimulus boundary between the changing wedge or ring region and the background is likely to require greater spatial averaging compared to the sharp boundaries of the chequerboard. Indeed, for both the luminance and motion stimuli, the difference in pRF size compared to correlated disparity increases with eccentricity as predicted by the model. To determine whether this is the case it would be necessary to scale the size of the dots with eccentricity. In general, the responses to the dot stimuli appear to diverge more from the modelling than the chequerboard; modelling predicts that chequerboard and luminance stimuli should provide similar pRF sizes across eccentricity. However, this is not the case for the fMRI data. Very few previous studies have used dot-defined stimuli for pRF mapping, but a previous study showed larger pRF for a global motion defined stimulus compared to a chequerboard in V1 (Hughes et al. 2019), although the data were not broken down by eccentricity.

The third major difference between the fMRI data and modelling relates to the difference in response to correlated and anti-correlated disparities, which is discussed in detail below.

Considering how the modelling could better reflect fMRI data, first, it would be worth including additional nonlinearities that can account for the known reduction in neuronal response to anti-correlated disparity. Second, as stated earlier, the current implementation of the energy model uses populations of units, all having the same disparity tuning, which imposes limits the size of the pRF for the disparity-defined stimulus. If more units tuned to different disparities were incorporated then the estimated pRF size will tend to increase but this requires considerably more modelling beyond the scope of the current study.

Overall, as would be predicted, the pRF size is much more consistent for the high contrast, compelling chequerboard stimulus compared to the dot-defined luminance and disparity stimuli. Changing the salience of the stimuli may additionally change this pattern.

pRF size is comparable for disparity and non-disparity input defined by random dots in dorsal visual areas

Dorsal regions V3A/B, V5/MT+ and V7 showed no significant difference in pRF size for disparity when compared to the dot-defined luminance and motion conditions. There was, however, a reduction in pRF size compared to the chequerboard stimulus. This is consistent with previous work indicating that pRF mapping with isolated dot-defined bar stimuli resulted in larger pRF sizes compared to stimuli presented with a contrasting surround, either opposing motion or motion noise (Hughes et al. 2019). Thus, given the lack of difference between disparity-defined stimulus and other dot-defined stimuli, our finding is consistent with the conclusion from that paper that the pRF size in dorsal regions may depend on stimulus salience. While this result is also consistent with significant involvement of dorsal visual areas in disparity processing, most notably V3A/B (Poggio et al. 1988; Adams and Zeki 2001; Neri et al. 2004; Minini et al. 2010; Ban and Welchman 2015), it does not indicate a special role for integration of disparity information across space.

Specialised processing for binocular disparity in lateral occipital cortex

In a similar fashion to the results observed in V1, we detected a pattern of larger pRFs for disparity compared to other conditions in area LOC, typically considered a later ‘upstream’ stage in visual cortical hierarchy processing (Grill-Spector et al. 2001). LOC is involved in the processing of 3D shape (Kourtzi and Kanwisher 2001; Kourtzi et al. 2003; Weigelt et al. 2007; Vernon et al. 2016), motion (Moutoussis et al. 2005; Krekelberg et al. 2005) and binocular depth (Chandrasekaran et al. 2006; Preston et al. 2008; Ban et al. 2012). While responsive to binocular disparity stimulation in isolation (Ip et al. 2014), LOC has been particularly associated with view-invariant representations of 3D shape which incorporate information about binocular depth (Welchman et al. 2005; Preston et al. 2009). The discrepancy between pRF sizes for disparity and other conditions may reflect the computational role for disparity information in LOC, not as the input to a binocular integration zone to generate a fused cyclopean representation, but instead as one component drawn upon to form view-invariant object representations. Preston et al. (2008) suggest that LOC represents depth position in a categorical manner, that is, as a coarse indicator of near vs. far position. As larger binocular disparities require larger receptive fields to capture the relevant retinal matches, it follows that coarseness in disparity tuning in LOC may be matched with a coarse spatial tuning in its pRFs. While the relationship between disparity tuning and receptive field size remains largely unknown in the human, in the macaque, electrophysiological studies have reported a multiplicative relationship between receptive field size and preferred disparity for V1 neurons (Prince et al. 2002b; Nienborg et al. 2004). Therefore, a coarse representation of both spatial and disparity tuning in LOC would be consistent with the tuning properties of disparity-selective cells.

An additional consideration is the source of disparity modulation. The dynamic random dot disparity stimulus presented here contains two sources of disparity information; absolute disparity within the aperture field, and relative disparity at the edge between the aperture and the zero-disparity background. Unlike area V1, which is exclusively selective to absolute disparity (Cumming and Parker 1999), either component may drive responses in LOC. While LOC responses can be attributed to relative disparity (Welchman et al. 2005; Chandrasekaran et al. 2006; Preston et al. 2008; Read et al. 2010; Bridge et al. 2013), a direct coding of absolute disparity is possible and consistent with the similarity in tuning properties with area V1.

The role of interocular correlation in perception

The disparity-defined stimulus configuration used for the current study was a limited aperture containing either correlated or anti-correlated dots, set against a background of uncorrelated dots. A background composed of uncorrelated dots has the advantage of providing no coherent or structured binocular signal, comparable to a mid-grey background as used in the chequerboard condition, or static dots in the motion condition. In our previous work, we have performed retinotopic mapping using a zero-disparity background, which produced comparable retinotopic maps (Bridge and Parker 2007).

A wedge or ring defined by correlated disparity, therefore, differs from the background of uncorrelated dots in two ways; first, because there is a change in perceived depth over the stimulus period, and second, by the presence of interocular correlation. The latter point is important because recent evidence suggests that the human visual system can detect interocular correlation at low frequencies (Reynaud and Hess 2018). Modelling of the interocular correlation using a zero-disparity wedge in which the depth stimulus was unchanged did not show an increase in pRF size, suggesting that correlation could not account for this pattern. Nonetheless, it remains to be determined whether the visual system would respond in the same way.

Regarding the stimulus defined by foreground anti-correlated dots, participants were initially asked whether they could perceive depth while viewing this stimulus outside of the scanner, and all reported the absence of perceived depth. However, while not producing the clear percept of depth generated by correlated disparity, anti-correlated random dot stereograms can lead to a weak percept of depth, which is some cases is perceived as ‘reversed’ (Read and Eagle 2000). However, the likelihood of perceiving reversed depth is increased with a correlated surround (Aoki et al. 2017) or when viewed peripherally (Zhaoping and Ackermann 2018), neither of which were relevant in the current study.

The response to the anti-correlated stimulus was most discrepant between the modelling and fMRI BOLD responses. Figure 10 shows that the model response to correlated and anti-correlated stimuli is almost identical, as expected with the classical energy model (Ohzawa et al. 1990). More sophisticated versions of the energy model can reduce responses to anti-correlated stimulation through additional nonlinearities (Read et al. 2002). Interestingly, previous fMRI studies using different approaches have shown comparable responses to anti-correlated and correlated stimuli in early visual areas (Bridge and Parker 2007; Preston et al. 2008). Bridge and Parker used a zero-disparity background (which provides a border between foreground and background that is not present in the current study) and found that while the BOLD change was broadly similar, the reliability of the stimulus evoked response was lower for anti-correlated stimuli. Such a reduction in response reliability in the current study could explain why the model fits were poorer in the anti-correlated condition.

The discrepancy between the modelling and fMRI signals is likely to reflect both the simplicity of the energy model employed and the absence of a depth percept (Ress and Heeger 2003).

Relating disparity pRFs to perception

The aim of the current study was to determine how pRFs defined by changing disparity-defined depth compared to those defined with other stimulus types. With the stimulus configurations used here, all pRFs were defined in the two-dimensional x–y plane. The neural signal was averaged over both near and far disparities, analogous to averaging across different directions of motion for moving dots. Thus, the pRFs estimated did not contain any information about stimulus depth, and could not be used to investigate tuning for disparity. An immediate extension of this work which would allow the estimation of pRF tunings for disparity-defined depth, would be to change the position in depth of the stimulus along the z-dimension over time, to model the preferred depth tuning properties.

fMRI estimates of binocular pRFs are in agreement with electrophysiological priors

This study presents the novel estimation of binocular receptive fields characteristics across human visual cortical areas, highlighting the discrepancies between disparity and non-disparity driven estimates of population-level receptive fields. While the estimates of pRF size for non-disparity modulated stimuli presented here are in broad agreement with previous fMRI studies (Wandell and Winawer 2015), no such baseline is available for disparity-defined pRFs. Furthermore, although direct comparisons to the electrophysiological literature may be informative, it is important to note the abstraction of these metrics from the behaviour of single disparity-tuned cells. First, BOLD fMRI signals are measured from imaging voxels that contain many cells, both tuned and not tuned to disparity, which contribute to the observed signal. Second, imaged voxels encompass a large number of disparity-sensitive neurons that contain a variable distribution of spatial and depth preferences that are aggregated and averaged in the observed signal. Therefore, the BOLD signal reflects a population preference, which nevertheless reveals systematic variation in pRF size for disparity both within and across cortical visual regions.

Relating these findings to electrophysiology, we highlight two points. First, pRF size for disparity increased with eccentricity in all visual areas tested. Second, the scaling of pRF size with eccentricity under disparity stimulation is consistent with the view of a binocular integration zone that obeys local physiological constraints imposed by its component receptive fields, imposing a limit on resolvable disparity (Banks et al. 2004; Nienborg et al. 2004). Together, these observations reinforce the hypothesis that fMRI estimates of binocular receptive fields reflect the same mechanisms as those described in electrophysiological studies of disparity processing in animal models and provide characterisation of the binocular integration zone in humans.

Data availability

The datasets generated during and/or analysed during the current study are available from the corresponding author on reasonable request.

Code availability

The population receptive field modelling toolbox is available from https://github.com/samsrf/samsrf. The binocular energy modelling toolbox is available from https://github.com/IvanAlvarez/BinocularEnergyModel.

References

Adams DL, Zeki S (2001) Functional organization of macaque V3 for stereoscopic depth. J Neurophysiol 86:2195–2203. https://doi.org/10.1152/jn.2001.86.5.2195
Article CAS PubMed Google Scholar
Alvarez I, de Haas B, Clark CA et al (2015) Comparing different stimulus configurations for population receptive field mapping in human fMRI. Front Hum Neurosci 9:96. https://doi.org/10.3389/fnhum.2015.00096/abstract
Article PubMed PubMed Central Google Scholar
Amano K, Wandell BA, Dumoulin SO (2009) Visual field maps, population receptive field sizes, and visual field coverage in the human MT+ complex. J Neurophysiol 102:2704–2718. https://doi.org/10.1152/jn.00102.2009
Article PubMed PubMed Central Google Scholar
Anzai A, Ohzawa I, Freeman RD (1999) Neural mechanisms for processing binocular information II. Complex Cells J Neurophysiol 82:909–924. https://doi.org/10.1152/jn.1999.82.2.909
Article CAS PubMed Google Scholar
Anzai A, Chowdhury SA, DeAngelis GC (2011) Coding of stereoscopic depth information in visual areas V3 and V3A. J Neurosci 31:10270–10282. https://doi.org/10.1523/JNEUROSCI.5956-10.2011
Article CAS PubMed PubMed Central Google Scholar
Aoki SC, Shiozaki HM, Fujita I (2017) A relative frame of reference underlies reversed depth perception in anticorrelated random-dot stereograms. J vis 17(12):17. https://doi.org/10.1167/17.12.17
Article PubMed Google Scholar
Backus BT, Fleet DJ, Parker AJ, Heeger DJ (2001) Human cortical activity correlates with stereoscopic depth perception. J Neurophysiol 86:2054–2068. https://doi.org/10.1152/jn.2001.86.4.2054
Article CAS PubMed Google Scholar
Ban H, Preston TJ, Meeson A, Welchman AE (2012) The integration of motion and disparity cues to depth in dorsal visual cortex. Nat Neurosci 15:636–643. https://doi.org/10.1038/nn.3046
Article CAS PubMed PubMed Central Google Scholar
Ban H, Welchman AE (2015) fMRI analysis-by-synthesis reveals a dorsal hierarchy that extracts surface slant. J Neurosci 35:9823–9835. https://doi.org/10.1523/JNEUROSCI.1255-15.2015
Article CAS PubMed PubMed Central Google Scholar
Banks MS, Gepshtein S, Landy MS (2004) Why is spatial stereoresolution so low? J Neurosci 24:2077–2089. https://doi.org/10.1523/JNEUROSCI.3852-02.2004
Article CAS PubMed PubMed Central Google Scholar
Barendregt M, Harvey BM, Rokers B, Dumoulin SO (2015) Transformation from a retinal to a cyclopean representation in human visual cortex. Curr Bio 25:1–7. https://doi.org/10.1016/j.cub.2015.06.003
Article CAS Google Scholar
Bridge H (2016) Effects of cortical damage on binocular depth perception. Philos Trans R Soc B 371:20150254–20150259. https://doi.org/10.1098/rstb.2015.0254
Article Google Scholar
Bridge H, Parker AJ (2007) Topographical representation of binocular depth in the human visual cortex using fMRI. J vis 7:1–14. https://doi.org/10.1167/7.14.15
Article PubMed Google Scholar
Bridge H, Thomas OM, Minini L et al (2013) Structural and functional changes across the visual cortex of a patient with visual form agnosia. J Neurosci 33:12779–12791. https://doi.org/10.1523/JNEUROSCI.4853-12.2013
Article CAS PubMed PubMed Central Google Scholar
Chandrasekaran C, Canon V, Dahmen JC et al (2006) Neural correlates of disparity-defined shape discrimination in the human brain. J Neurophysiol 97:1553–1565. https://doi.org/10.1152/jn.01074.2006
Article PubMed Google Scholar
Chen G, Lu HD, Roe AW (2008) A map for horizontal disparity in monkey V2. Neuron 58:442–450. https://doi.org/10.1016/j.neuron.2008.02.032
Article CAS PubMed PubMed Central Google Scholar
Chen G, Lu HD, Tanigawa H, Roe AW (2017) Solving visual correspondence between the two eyes via domain-based population encoding in nonhuman primates. Proc Natl Acad Sci 114:13024–13029. https://doi.org/10.1073/pnas.1614452114
Article CAS PubMed PubMed Central Google Scholar
Cottereau BR, McKee SP, Ales JM, Norcia AM (2011) Disparity-tuned population responses from human visual cortex. J Neurosci 31:954–965. https://doi.org/10.1523/JNEUROSCI.3795-10.2011
Article CAS PubMed PubMed Central Google Scholar
Cumming BG, Parker AJ (1999) Binocular neurons in V1 of awake monkeys are selective for absolute, not relative, disparity. J Neurosci 19:5602–5618. https://doi.org/10.1523/JNEUROSCI.19-13-05602.1999
Article CAS PubMed PubMed Central Google Scholar
Cumming BG, Parker AJ (2000) Local disparity not perceived depth is signaled by binocular neurons in cortical area V1 of the macaque. J Neurosci 20:4758–4767. https://doi.org/10.1523/JNEUROSCI.20-12-04758.2000
Article CAS PubMed PubMed Central Google Scholar
Cumming BG, Parker AJ (1997) Responses of primary visual cortical neurons to binocular disparity without depth perception. Nature 389:280–283. https://doi.org/10.1038/38487
Article CAS PubMed Google Scholar
Dumoulin SO, Wandell BA (2008) Population receptive field estimates in human visual cortex. Neuroimage 39:647–660. https://doi.org/10.1016/j.neuroimage.2007.09.034
Article PubMed Google Scholar
Engel SA, Rumelhart DE, Wandell BA et al (1994) fMRI of human visual cortex. Nature 369:525. https://doi.org/10.1038/369525a0
Article CAS PubMed Google Scholar
Friston KJ, Frith CD, Turner R, Frackowiak RS (1995) Characterizing evoked hemodynamics with fMRI. Neuroimage 2:157–165. https://doi.org/10.1006/nimg.1995.1018
Article CAS PubMed Google Scholar
Goncalves NR, Ban H, Sanchez-Panchuelo RM et al (2015) 7 Tesla fMRI reveals systematic functional organization for binocular disparity in dorsal visual cortex. J Neurosci 35:3056–3072. https://doi.org/10.1523/JNEUROSCI.3047-14.2015
Article CAS PubMed PubMed Central Google Scholar
Greve DN, Fischl B (2009) Accurate and robust brain image alignment using boundary-based registration. Neuroimage 48:63–72. https://doi.org/10.1016/j.neuroimage.2009.06.060
Article PubMed Google Scholar
Grill-Spector K, Kourtzi Z, Kanwisher N (2001) The lateral occipital complex and its role in object recognition. Vis Res 41:1409–1422. https://doi.org/10.1016/S0042-6989(01)00073-6
Article CAS PubMed Google Scholar
Griswold MA, Jakob PM, Heidemann RM et al (2002) Generalized autocalibrating partially parallel acquisitions (GRAPPA). Magn Reson Med 47:1202–1210. https://doi.org/10.1002/mrm.10171
Article PubMed Google Scholar
Hughes AE, Greenwood JA, Finlayson NJ, Schwarzkopf DS (2019) Population receptive field estimates for motion-defined stimuli. Neuroimage 199:245–260. https://doi.org/10.1016/j.neuroimage.2019.05.068
Article PubMed Google Scholar
Ip IB, Minini L, Dow J et al (2014) Responses to interocular disparity correlation in the human cerebral cortex. Ophthalmic Physiol Opt 34:186–198. https://doi.org/10.1111/opo.12121
Article PubMed PubMed Central Google Scholar
Jenkinson M, Beckmann CF, Behrens TEJ et al (2012) FSL. Neuroimage 62:782–790. https://doi.org/10.1016/j.neuroimage.2011.09.015
Article PubMed Google Scholar
Kourtzi Z, Kanwisher N (2001) Representation of perceived object shape by the human lateral occipital complex. Science 293:1506–1509. https://doi.org/10.1126/science.1061133
Article CAS PubMed Google Scholar
Kourtzi Z, Tolias AS, Altmann CF et al (2003) Integration of local features into global shapes: monkey and human fMRI studies. Neuron 37:333–346. https://doi.org/10.1016/S0896-6273(02)01174-1
Article CAS PubMed Google Scholar
Krekelberg B, Vatakis A, Kourtzi Z (2005) Implied motion from form in the human visual cortex. J Neurophysiol 94:4373–4386. https://doi.org/10.1152/jn.00690.2005
Article PubMed Google Scholar
Lagarias JC, Reeds JA, Wright MH, Wright PE (1998) Convergence properties of the Nelder-Mead simplex method in low dimensions. SIAM J Optimiz 9:112–147. https://doi.org/10.1137/S1052623496303470
Article Google Scholar
Large I, Bridge H, Ahmed B et al (2016) Individual differences in the alignment of structural and functional markers of the V5/MT complex in primates. Cereb Cortex 26:3928–3944. https://doi.org/10.1093/cercor/bhw180
Article CAS PubMed PubMed Central Google Scholar
Larsson J, Heeger DJ (2006) Two retinotopic visual areas in lateral occipital cortex. J Neurosci 26:13128–13142. https://doi.org/10.1523/JNEUROSCI.1657-06.2006
Article CAS PubMed PubMed Central Google Scholar
Li Y, Hou C, Yao L et al (2019) Disparity level identification using the voxel-wise Gabor model of fMRI data. Hum Brain Mapp 40:2596–2610. https://doi.org/10.1002/hbm.24547
Article PubMed PubMed Central Google Scholar
Malikovic A, Amunts K, Schleicher A et al (2006) Cytoarchitectonic analysis of the human extrastriate cortex in the region of V5/MT+: a probabilistic, stereotaxic map of area hOc5. Cereb Cortex 17:562–574. https://doi.org/10.1093/cercor/bhj181
Article PubMed Google Scholar
Minini L, Parker AJ, Bridge H (2010) Neural modulation by binocular disparity greatest in human dorsal visual stream. J Neurophysiol 104:169–178. https://doi.org/10.1152/jn.00790.2009
Article PubMed PubMed Central Google Scholar
Moeller S, Yacoub E, Olman CA et al (2010) Multiband multislice GE-EPI at 7 Tesla, with 16-fold acceleration using partial parallel imaging with application to high spatial and temporal whole-brain fMRI. Magn Reson Med 63:1144–1153. https://doi.org/10.1002/mrm.22361
Article PubMed PubMed Central Google Scholar
Moutoussis K, Keliris G, Kourtzi Z, Logothetis N (2005) A binocular rivalry study of motion perception in the human brain. Vis Res 45:2231–2243. https://doi.org/10.1016/j.visres.2005.02.007
Article CAS PubMed Google Scholar
Nasr S, Polimeni JR, Tootell RBH (2016) Interdigitated color- and disparity-selective columns within human visual cortical areas V2 and V3. J Neurosci 36:1841–1857. https://doi.org/10.1523/JNEUROSCI.3518-15.2016
Article CAS PubMed PubMed Central Google Scholar
Neri P, Bridge H, Heeger DJ (2004) Stereoscopic processing of absolute and relative disparity in human visual cortex. J Neurophysiol 92:1880–1891. https://doi.org/10.1152/jn.01042.2003
Article PubMed Google Scholar
Nienborg H, Bridge H, Parker AJ, Cumming BG (2004) Receptive field size in V1 neurons limits acuity for perceiving disparity modulation. J Neurosci 24:2065–2076. https://doi.org/10.1523/JNEUROSCI.3887-03.2004
Article CAS PubMed PubMed Central Google Scholar
Nienborg H, Bridge H, Parker AJ, Cumming BG (2005) Neuronal computation of disparity in V1 limits temporal resolution for detecting disparity modulation. J Neurosci 25:10207–10219. https://doi.org/10.1523/JNEUROSCI.2342-05.2005
Article CAS PubMed PubMed Central Google Scholar
Ohzawa I, DeAngelis GC, Freeman RD (1990) Stereoscopic depth discrimination in the visual cortex: neurons ideally suited as disparity detectors. Science 249:1037–1041. https://doi.org/10.1126/science.2396096
Article CAS PubMed Google Scholar
Parker AJ (2007) Binocular depth perception and the cerebral cortex. Nat Rev Neurosci 8:379–391. https://doi.org/10.1038/nrn2131
Article CAS PubMed Google Scholar
Parker AJ, Cumming BG (2001) Cortical mechanisms of binocular stereoscopic vision. Prog Brain Res 134:205–216. https://doi.org/10.1016/s0079-6123(01)34015-3
Article CAS PubMed Google Scholar
Parker AJ, Smith JET, Krug K (2016) Neural architectures for stereo vision. Philos Trans R Soc B 371:20150261–20150314. https://doi.org/10.1098/rstb.2015.0261
Article Google Scholar
Poggio GF, Gonzalez F, Krause F (1988) Stereoscopic mechanisms in monkey visual cortex: binocular correlation and disparity selectivity. J Neurosci 8:4531–4550. https://doi.org/10.1523/JNEUROSCI.08-12-04531.1988
Article CAS PubMed PubMed Central Google Scholar
Preston TJ, Li S, Kourtzi Z, Welchman AE (2008) Multivoxel pattern selectivity for perceptually relevant binocular disparities in the human brain. J Neurosci 28:11315–11327. https://doi.org/10.1523/JNEUROSCI.2728-08.2008
Article CAS PubMed PubMed Central Google Scholar
Preston TJ, Kourtzi Z, Welchman AE (2009) Adaptive estimation of three-dimensional structure in the human brain. J Neurosci 29:1688–1698. https://doi.org/10.1523/JNEUROSCI.5021-08.2009
Article CAS PubMed PubMed Central Google Scholar
Prince SJD, Cumming BG, Parker AJ (2002a) Range and mechanism of encoding of horizontal disparity in macaque V1. J Neurophysiol 87:209–221. https://doi.org/10.1152/jn.00466.2000
Article CAS PubMed Google Scholar
Prince SJD, Pointon AD, Cumming BG, Parker AJ (2002b) Quantitative analysis of the responses of V1 neurons to horizontal disparity in dynamic random-dot stereograms. J Neurophysiol 87:191–208. https://doi.org/10.1152/jn.00465.2000
Article CAS PubMed Google Scholar
Read JCA, Phillipson GP, Serrano-Pedraza I et al (2010) Stereoscopic vision in the absence of the lateral occipital cortex. PLoS ONE 5:e12608. https://doi.org/10.1371/journal.pone.0012608
Article CAS PubMed PubMed Central Google Scholar
Read JC, Eagle RA (2000) Reversed stereo depth and motion direction with anti-correlated stimuli. Vision Res 40(24):3345–3358. https://doi.org/10.1016/s0042-6989(00)00182-6
Article CAS PubMed Google Scholar
Read JC, Parker AJ, Cumming BG (2002) A simple model accounts for the response of disparity-tuned V1 neurons to anticorrelated images. Vis Neurosci 19(6):735–753
Article Google Scholar
Ress D, Heeger DJ (2003) Neuronal correlates of perception in early visual cortex. Nat Neurosci 6(4):414–420. https://doi.org/10.1038/nn1024
Article CAS PubMed PubMed Central Google Scholar
Reynaud A, Hess RF (2018) Interocular correlation sensitivity and its relationship with stereopsis. J vis 18(1):11. https://doi.org/10.1167/18.1.11
Article PubMed PubMed Central Google Scholar
Ringach DL, Hawken MJ, Shapley R (2003) Dynamics of orientation tuning in macaque V1: the role of global and tuned suppression. J Neurophysiol 90:342–352. https://doi.org/10.1152/jn.01018.2002
Article PubMed Google Scholar
Schlesinger BY, Yeshurun Y (1998) Spatial size limits in stereoscopic vision. Spat vis 11:279–293. https://doi.org/10.1163/156856898x00031
Article CAS PubMed Google Scholar
Schoppe O, Harper NS, Willmore BDB et al (2016) Measuring the performance of neural models. Front Comput Neurosci. https://doi.org/10.3389/fncom.2016.00010
Article PubMed PubMed Central Google Scholar
Sereno MI, Dale AM, Reppas JB et al (1995) Borders of multiple visual areas in humans revealed by functional magnetic resonance imaging. Science 268:889–893. https://doi.org/10.1126/science.7754376
Article CAS PubMed Google Scholar
Tootell RBH, Nasr S (2017) Columnar segregation of magnocellular and parvocellular streams in human extrastriate cortex. J Neurosci 37:8014–8032. https://doi.org/10.1523/JNEUROSCI.0690-17.2017
Article CAS PubMed PubMed Central Google Scholar
Tyler CW (1974) Depth perception in disparity gratings. Nature 251:140–142. https://doi.org/10.1038/251140a0
Article CAS PubMed Google Scholar
Vermeesch P (2013) Multi-sample comparison of detrital age distributions. Chem Geol 341:140–146. https://doi.org/10.1016/j.chemgeo.2013.01.010
Article CAS Google Scholar
Vernon RJW, Gouws AD, Lawrence SJD et al (2016) Multivariate patterns in the human object-processing pathway reveal a shift from retinotopic to shape curvature representations in lateral occipital areas, LO-1 and LO-2. J Neurosci 36:5763–5774. https://doi.org/10.1523/JNEUROSCI.3603-15.2016
Article CAS PubMed PubMed Central Google Scholar
Wandell BA, Dumoulin SO, Brewer AA (2007) Visual field maps in human cortex. Neuron 56:366–383. https://doi.org/10.1016/j.neuron.2007.10.012
Article CAS PubMed Google Scholar
Wandell BA, Winawer J (2015) Computational neuroimaging and population receptive fields. Trends Cogn Sci 19:349–357. https://doi.org/10.1016/j.tics.2015.03.009
Article PubMed PubMed Central Google Scholar
Weigelt S, Kourtzi Z, Kohler A et al (2007) The cortical representation of objects rotating in depth. J Neurosci 27:3864–3874. https://doi.org/10.1523/JNEUROSCI.0340-07.2007
Article CAS PubMed PubMed Central Google Scholar
Welchman AE, Deubelius A, Conrad V et al (2005) 3D shape perception from combined depth cues in human visual cortex. Nat Neurosci 8:820–827. https://doi.org/10.1038/nn1461
Article CAS PubMed Google Scholar
Winawer J, Witthoft N (2015) Human V4 and ventral occipital retinotopic maps. Vis Neurosci 32:E020. https://doi.org/10.1017/S0952523815000176
Article PubMed PubMed Central Google Scholar
Zhaoping L, Ackermann J (2018) Reversed depth in anticorrelated random-dot stereograms and the central-peripheral difference in visual inference. Perception. https://doi.org/10.1177/0301006618758571
Article PubMed Google Scholar

Download references

Funding

This work was supported by the Medical Research Council (MR/K014382/1), and The Royal Society (University Research Fellowship to HB). The Wellcome Centre for Integrative Neuroimaging is supported by core funding from the Wellcome Trust (203139/Z/16/Z).

Author information

Authors and Affiliations

Wellcome Centre for Integrative Neuroimaging, Nuffield Department of Clinical Neurosciences, John Radcliffe Hospital, University of Oxford, Oxford, OX3 9DU, UK
Ivan Alvarez, Samuel A. Hurley & Holly Bridge
Department of Radiology, University of Wisconsin, Madison, WI, 53705, USA
Samuel A. Hurley
Department of Physiology, Anatomy and Genetics, University of Oxford, Oxford, OX1 3PT, UK
Andrew J. Parker
Institut für Biologie, Otto-von-Guericke Universität, 39120, Magdeburg, Germany
Andrew J. Parker

Authors

Ivan Alvarez
View author publications
You can also search for this author in PubMed Google Scholar
Samuel A. Hurley
View author publications
You can also search for this author in PubMed Google Scholar
Andrew J. Parker
View author publications
You can also search for this author in PubMed Google Scholar
Holly Bridge
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceptualization: IA, SAH, AJP, and HB; methodology: IA, SAH, AJP, and HB; formal analysis and investigation: IA; writing—original draft preparation: IA; writing—review and editing: IA, SAH, AJP, and HB; funding acquisition: AJP and HB.

Corresponding author

Correspondence to Holly Bridge.

Ethics declarations

Conflict of interest

The authors declare no competing financial interests.

Ethical approval

This study received ethical approval from the University of Oxford Central University Research Ethics Committee (MS-IDREC-C1-2015-040) and was conducted in accordance with the Declaration of Helsinki.

Consent to participate

Informed consent was obtained from all individual participants included in the study.

Consent for publication

The authors affirm that human research participants provided informed consent for publication of results based on data collected in this study.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Alvarez, I., Hurley, S.A., Parker, A.J. et al. Human primary visual cortex shows larger population receptive fields for binocular disparity-defined stimuli. Brain Struct Funct 226, 2819–2838 (2021). https://doi.org/10.1007/s00429-021-02351-3

Download citation

Received: 08 February 2021
Accepted: 22 July 2021
Published: 04 August 2021
Issue Date: December 2021
DOI: https://doi.org/10.1007/s00429-021-02351-3

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Human primary visual cortex shows larger population receptive fields for binocular disparity-defined stimuli

Abstract

Similar content being viewed by others

Stereoscopic processing of crossed and uncrossed disparities in the human visual cortex

Decoding disparity categories in 3-dimensional images from fMRI data using functional connectivity patterns

Distributions of Visual Receptive Fields from Retinotopic to Craniotopic Coordinates in the Lateral Intraparietal Area and Frontal Eye Fields of the Macaque

Introduction

Materials and methods

Participants

Stimulus presentation

MRI acquisition

MRI pre-processing

pRF analysis

Regions of interest

Experimental design and statistical analysis

Binocular energy model

Results

Disparity responses are widespread across visual cortex

pRF size for disparity varies systematically across the visual hierarchy

pRF size for disparity differs from non-disparity pRFs in V1

Early visual regions: V1, V2 and V3

Dorsal visual regions: V3A/B, V5/MT + and V7

Ventral visual regions: V4, LOC and VOC

Binocular energy model predictions of V1 integration zone

Discussion

Estimates of the binocular integration zone in area V1

Comparison of binocular energy model prediction and the empirical binocular integration zone in V1

pRF size is comparable for disparity and non-disparity input defined by random dots in dorsal visual areas

Specialised processing for binocular disparity in lateral occipital cortex

The role of interocular correlation in perception

Relating disparity pRFs to perception

fMRI estimates of binocular pRFs are in agreement with electrophysiological priors

Data availability

Code availability

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Ethical approval

Consent to participate

Consent for publication

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation