Abstract
The precision of stereopsis and vergence are ultimately limited by internal binocular disparity noise. Here we propose an equivalent noise model with both global and local internal disparity noises to provide a unified explanation of both absolute and relative disparity thresholds. To test this model, we developed a psychophysical procedure to measure the equivalent internal disparity noise by adding external disparity noise to random-Gabor-patch stereograms. We used the method of constant stimuli to measure the minimum and maximum disparity thresholds (Dmin and Dmax) for both absolute and relative disparity. Consistent with previous studies, we found that Dmin thresholds are substantially worse for absolute disparity than for relative disparity. We tested three relative disparity mechanisms: (1) the difference between the monocular separations of targets projecting to the two eyes; (2) the direct measurement of relative disparity; and (3) the difference of absolute disparities of targets. Computing the difference of absolute disparities when detecting relative disparity, Mechanism 3 cancels global noise, resulting in a much lower relative Dmin threshold, and provides a reasonable fit to the experimental data. We also found that the presence of as much as 2400 arcsec of external disparity noise does not appear to affect the Dmax threshold. This observation suggests that Dmax is implicated in a mechanism that disregards the disparity variance of individual items, relying instead on the average disparity across all items, supporting the depth model proposed in our previous study (Ding & Levi, 2021), which posits distinct mechanisms governing Dmin and Dmax thresholds.
Similar content being viewed by others
Introduction
Stereopsis, the perception of depth through the brain's processing of binocular disparity, has been extensively studied1,2,3,4,5,6. The absolute disparity of a point in space is defined as the difference between the angle subtended by the target at the two entrance pupils of the eyes and the angle of convergence, i.e., the difference in the angular locations of the retinal images of the target in the two eyes, referenced to corresponding retinal points (the disparity related to the fixation point) (Fig. 1A). When a second point is located in a different depth plane, this introduces a relative disparity with the first point. However, there are three ways to calculate the relative disparity between two points (Fig. 1B): (1) the difference of the monocular separations of the two points projecting to the two eyes (Mechanism 1), i.e., the prior uniocular processing hypothesis7; (2) the direct measurement by a relative disparity mechanism (Mechanism 2); and (3) the difference of absolute disparities of the two points (Mechanism 3).
There are neurons in V1 that are tuned for absolute disparity and are insensitive to relative disparity. Their disparity tuning curves do not shift when the disparity of the surrounding area varies8. On the other hand there are also neurons in the brain areas downstream from V1 that are sensitive to relative disparity. Their disparity tuning curves shift depending on the surrounding disparity9,10. However, in a psychophysical study, it is difficult, or even impossible, to exclude the possibility that relative disparity information may be used for detecting absolute disparity if a fixation point or other background marks are visible4. Because of the absolute disparity anomaly11, the absence of conscious readout of absolute disparity, it is hard to perform the task of “pure” absolute disparity detection without a visible fixation point or other background references.
Here we propose three unified equivalent noise models, each combining one absolute disparity mechanism and one of three relative disparity mechanisms, to provide a unified explanation of two data sets of absolute and relative disparity minimum thresholds. To test the model we used stimuli with a fixation point for detecting absolute disparity to make the task less challenging, and we used stimuli without a fixation point for detecting relative disparity to test the possibility of relative mechanisms without prior absolute disparity processing. In general, we assume that both absolute and relative disparity mechanisms may be involved in performing either task. Our data and modeling support Mechanism 3, the difference of absolute disparities as the mechanism of measuring relative disparity. This is not surprising, given that spatial stereoresolution (relative disparity) is constrained by the calculation of absolute disparity through a binocular matching process12. Furthermore, hypercyclopean channels play a crucial role in the perception of disparity surfaces13,14,15,16.
Absolute disparity
As shown in Fig. 1A, let \({\theta }_{i0}^{{\text{L}}}\) and \({\theta }_{i0}^{{\text{R}}}\) be the visual angles of point Pi referenced to the fixation point P0 in the two eyes respectively, i.e., monocular angular separations of point Pi from the fixation P0 in the two retinas. Pi’s absolute disparity is given by:
In the physiological literature, there is clear evidence that both phase and position disparity sensitive neurons are present in cortical area V12,17,18. Most likely, they are directly sensitive to absolute disparity without prior uniocular measurements of monocular angular separations \({\theta }_{i0}^{{\text{L}}}\) and \({\theta }_{i0}^{{\text{R}}}\) 7,19. Their absolute disparity tuning curves are not affected by the disparity in the surrounding area, i.e., they are insensitive to relative disparity8. These neurons may have different preferred disparities when detecting targets with the same stimulus disparity, i.e., there may be sampling errors when detecting absolute disparities (Fig. 1C). This internal disparity variance leads to inconsistent depth perception. Figure 1C provides a schematic representation of the disparity variance in absolute disparity detectors. In this figure blue circles represent targets with identical stimulus disparity, while red squares represent disparity detectors with varied preferred absolute disparities. We assume that the disparities of these detectors follow a Gaussian distribution with variance of \({\sigma }_{{\text{Loc}}}^{2}\) (local internal disparity noise). Their mean disparity (dashed red line), often offset from the stimulus disparity (dashed blue line), fluctuates from trial to trial because of resampling for a different trial by a different group of neurons, also following a Gaussian distribution with variance of \({\sigma }_{{\text{Glob}}}^{2}\) (global internal disparity noise). We further assume that the disparity response is proportional to the disparity with a random fluctuation that follows a Gaussian distribution. Therefore, the disparity response is given by:
where A is detection efficiency and \(N\left({\sigma }_{{\text{Int}}}^{2}\right)\) is the internal disparity noise. If the disparity variance of one disparity detector is \({\sigma }_{{\text{Int}}}^{2}\), the variance of mean disparity of M detectors equals \({\sigma }_{{\text{Int}}}^{2}/M\). If the efficiency A = 1 for one disparity detector, the efficiency for M detectors would be \(A=\sqrt{M}\). In other words, \(M={A}^{2}\) disparity detectors are involved in disparity detection if the efficiency A completely reflects sampling efficiency.
We assume that the internal noise \(N\left({\sigma }_{{\text{Int}}}^{2}\right)\) has both global and local components, i.e.,
Global noise \(N\left({\sigma }_{{\text{Glob}}}^{2}\right)\) refers to random fluctuations in the visual system's response to binocular disparities across the entire visual field (e.g., offset of red and blue dashed lines in Fig. 1C), while local noise \(N\left({\sigma }_{{\text{Loc}}}^{2}\right)\) pertains to the noise specific to individual disparity detectors (e.g., offsets of red squares from the red dashed line in Fig. 1C). Both global and local noises affect the absolute disparity threshold.
To enhance clarity, we have included a table in “Appendix C” summarizing symbols used in the text.
Relative disparity mechanism 1
As shown in Fig. 1B, let \({\theta }_{ij}^{{\text{L}}}\) and \({\theta }_{ij}^{{\text{R}}}\) be the visual angles subtended by point Pi referenced to point Pj in the two eyes respectively, i.e., the monocular angular separations of point Pi from point Pj in the two retinas. Pi’s relative disparity referenced to point Pj is given by:
We assume that the visual system first measures the monocular separations of the two targets in the two retinas, \({\theta }_{ij}^{{\text{L}}}\) and \({\theta }_{ij}^{{\text{R}}}\), and then performs the subtraction operation. If the responses to monocular separations are given by,
and the internal position noise is independent in the two eyes, the relative disparity response is given by:
We note that relative disparity mechanism 1, i.e., the prior uniocular processing hypothesis, was rejected experimentally by Berry19 and Westheimer and McKee7. In the present study, we performed modeling to see if Mechanism 1 is also rejected statistically.
Relative disparity mechanism 2
If the visual system measures the relative disparity directly, which does not need prior measures of monocular angular separations \({\theta }_{ij}^{{\text{L}}}\) and \({\theta }_{ij}^{{\text{R}}}\), or prior measures of absolute disparities either, the disparity response is given by:
Here, we assume that the internal variance \({\sigma }_{{\text{Int}}}^{2}\) is equal when measuring monocular separation in Eq. 5 and binocular disparity in Eqs. (2) and (7). Due to the direct measurement of relative disparity in Mechanism 2 (Eq. 7), its internal variance is only half that of Mechanism 1 (Eq. 6), where relative disparity is calculated as the difference of the two monocular separations.
Relative disparity mechanism 3
Using the fixation point P0 as the common reference, the visual system first measures the absolute disparities of Pi and Pj, and then performs the subtraction operation on the two absolute disparities. The relative disparity of two points Pi and Pj is given by:
The two absolute disparities share a common reference (the fixation plane), allowing the cancellation of global noise—a random fluctuation across the entire visual field—in the difference calculation of Eq. (8). However, global noise cannot be canceled in relative disparity Mechanisms 1 and 2 due to the absence of a common reference point in these mechanisms. From Eq. (2), the disparity response is given by:
Figure 1D provides a schematic representation of relative disparity Mechanism 3. For detecting the relative disparity dij between targets i and j located in two depth planes, the global noise (offset of red and blue dashed lines) is canceled by calculating the difference in their absolute disparities.
Although neurons tuned to relative disparities have been isolated9,10 and could be an independent system for relative disparity detection (Mechanism 2), they can also take their inputs from absolute disparity neurons (Mechanism 3) or from monocular neurons (Mechanism 1) to calculate relative disparities. In the present paper, we compare the three mechanisms statistically through modeling.
Although internal disparity noise plays a crucial role in limiting depth thresholds, directly measuring it can be challenging or even impossible. To estimate the equivalent internal noise, researchers often used an equivalent noise procedure20,21, e.g., introducing external disparity noise to the stimuli to estimate internal disparity noise22,23,24. Because of the absolute disparity anomaly11, the mechanism of absolute disparity is seldom directly addressed in the literature4. Previous research has primarily focused on understanding the thresholds for detecting relative disparity, often utilizing an equivalent noise model to describe the underlying mechanisms22,23. To avoid the absolute disparity anomaly, in the present study, we provided a visible fixation point for detecting absolute disparity, i.e., relative disparity information was also used for performing the task. However, using a unified model of absolute and relative disparities, we were able to reveal both the absolute and relative disparity mechanisms. Our data and modeling show that global internal noise is involved in detecting absolute disparity but is canceled in detecting relative disparity.
A portion of the present study has been published as an abstract25.
Methods
Stimuli. Random-Gabor-Patch (RGP) stereograms (Fig. 2) were used as stimuli in this study. The stereograms consisted of vertical Gabor patches with random positions and phases, but a fixed spatial frequency. RGP stereograms provide stereoscopic depth signals within a narrow spatial frequency-and-orientation channel, eliminating monocular depth cues. Each eye was presented with an identical array of patches, except for paired patches that could undergo random shifts in opposite directions in the two eyes, following a Gaussian distribution both horizontally and vertically. This introduced a random binocular disparity to each patch pair, effectively acting as external disparity noise. The mean vertical disparity remained consistently at zero, while the mean horizontal disparity assumed a non-zero value. Figure 2A depicts an array with either crossed or uncrossed mean horizontal disparity spanning the entire display, thus serving as the stimulus for detecting absolute disparity. To minimize the impact of the absolute disparity anomaly, a fixation point was included. In Fig. 2B, the top and bottom halves of the array exhibit crossed and uncrossed mean horizontal disparity, respectively, serving as the stimulus for detecting relative disparity without a fixation point. It is important to note that the stimulus remained static during a single presentation of the stereogram. The jth Gabor patch pair is given by:
We tested five spatial frequencies [0.375, 0.75, 1.5, 3.0, and 6.0 cycles per degree (cpd)], with the number of patches being 4, 16, 36, 100, and 400 in a square measuring 14.1 × 14.1 degrees and the patch size in radius being 4.0, 2.0, 1.0, 0.5 and 0.25 degrees, respectively. The luminance contrast (mL, mR) of Gabor patches was always 100% in the two eyes. To create an RGP stereogram with 3.0 cpd Gabor patches, a large square measuring 14.1 × 14.1 degrees was divided into 10 × 10 small grids. Each grid contained a Gabor patch with a spatial frequency (ω) of 3 cpd and a standard deviation (σ) of 0.167 degrees, randomly distributed within the area (gridwidth – σ) x (gridwidth – σ) with equal distribution. The luminance profiles of a Gabor patch pair are given by Eqs. (10) and (11) when the patch radius is less than 4σ and were set to be zero otherwise. The visible size is approximately 3σ in radius, e.g., 0.5 degree for a patch with a spatial frequency of 3.0 cpd. Along the grid border, two patches could partially overlap. The local disparity of each paired patch was generated by shifting the patches by equal amounts (equal to half the local disparity) but in opposite directions. A circular shift was performed to maintain a constant stereogram size. The local disparity across the stereogram followed a Gaussian distribution. The mean vertical disparity was always zero, while the mean horizontal disparity was either a positive (uncrossed) or negative (crossed) value, representing the stimulus disparity (d). The standard deviation of local disparities represents the external noise. RGP stereograms with differently scaled Gabor patches were constructed using similar methods.
Absolute disparity thresholds were measured by presenting two intervals, one with crossed and the other with uncrossed stimulus disparity. Observers judged which interval appeared nearer. Each interval lasted for 1 s, with a 0.5-s inter-interval duration. Relative disparity thresholds were assessed using a single interval lasting for 1 s. The stimulus was divided into top and bottom halves, with either crossed or uncrossed mean horizontal disparity, and observers determined which half appeared closer.
Although a 2IFC (Two-Interval Forced Choice) task for measuring absolute disparity thresholds and a 2AFC (Two-Alternative Forced Choice) task for measuring relative disparity thresholds are two distinct experimental paradigms, they can both be analyzed using Signal Detection Theory (see the following Models section) to extract system internal noise and efficiency. Despite this common analytical approach, the two tasks differ in their memory demands, and a 2IFC task may exhibit asymmetry between the two intervals. To verify the comparability of results obtained from these two tasks, we conducted two control experiments using a 2IFC paradigm to measure relative disparity thresholds: (1) with a signal (relative disparity) plus noise in one interval and noise only in the other (2IFC-1), and (2) with a signal plus noise in both intervals, while altering the direction of relative disparity presented in the two intervals (2IFC-2). In 2IFC-1, the observer’s task was to identify which interval exhibited a relative disparity, whereas in 2IFC-2, the task was to identify which interval exhibited a relative disparity with the bottom half nearer. Our data and modeling indicate that the internal noise remains consistent in both the 2AFC and 2IFC tasks for relative disparity detection, with variations in their performance attributed to distinct efficiencies (Appendix B).
The method of constant stimuli was employed to measure the minimum and maximum disparity thresholds (Dmin and Dmax). Data for one spatial frequency channel was collected as a block of 1200 trials with 6 levels of external noise, 10 disparities, and 20 repeats. To effectively model the data depicting the probability of a correct response in relation to binocular disparity, we utilized a psychometric function based on the sum of two cumulative Gaussian distribution functions. One of these functions exhibited an increasing trend corresponding to Dmin, while the other displayed a declining trend for Dmax as the values of disparity increased. The specific values of Dmin and Dmax thresholds were determined as the disparities that produced a 75% correct response rate.
The stimuli were presented on a 22-inch NEC MultiSync CRT monitor with a spatial pixel resolution of 1920 × 1440 and a vertical refresh rate of 75 Hz. The experimental setup utilized a Linux System76 Mini running Matlab (MathWorks, Inc.) with the Psychophysics Toolbox extensions26,27. A specialized circuit28 was employed to achieve 14-bit gray-scale levels. Gamma correction was applied and verified by measuring 10 luminance levels using a Minolta LS-110 photometer. The minimum luminance of the monitor, with all pixels set to their lowest value, measured 0.2 cd/m2, while the maximum luminance, with all pixels set to their highest value, measured 74.2 cd/m2. The displays were viewed in a custom built 4 mirror stereoscope and positioned optically at 68 cm from the observer.
Models
The following models are proposed to predict disparity minimum thresholds (Dmin) and not disparity maximum thresholds (Dmax).
Absolute disparity
Because the fixation point is fixed without external noise, when adding external noise \(N\left({\sigma }_{{\text{Ext}}}^{2}\right)\) to the RGP stimuli, the system total noise equals \(N\left({\sigma }_{{\text{Int}}}^{2}\right)+N\left({\sigma }_{{\text{Ext}}}^{2}\right)\). From Eq. (2), the response to the uncrossed disparity d is given by:
And the response to the crossed disparity -d is given by:
A trial response is given based on
Assuming that internal and external noises are independent, and that global and local internal noise are also independent, i.e., \({\sigma }_{{\text{Int}}}^{2}={\sigma }_{{\text{Loc}}}^{2}+{\sigma }_{{\text{Glob}}}^{2}\), the absolute minimum disparity threshold is given by:
In the present study, absolute disparity thresholds were assessed using the 2IFC task, where the uncrossed disparity d was randomly assigned to either the first or second interval with an equal 50% probability. Any potential asymmetry between the two intervals can be mitigated in Eq. (14). This is achieved by formulating the equation based on \({R}_{1}- {R}_{2}\) for half of the trials where uncrossed disparity is in the first interval and \({R}_{2}- {R}_{1}\) for the remaining half of the trials where uncrossed disparity is in the second interval.
Relative disparity mechanism 1
Relative disparity thresholds were assessed using a single interval method of constant stimuli. The stimulus was divided into top and bottom halves, with either crossed or uncrossed mean horizontal disparity. Relative disparity between the top and bottom halves (\({d}_{\pm }=2d\)) is determined by the difference in monocular angular separations of a top point Pi from a bottom point Pj in the two retinas (Eq. 4)7. From Eq. (6), a trial response is given based on:
The relative minimum disparity threshold based on Mechanism 1 (Eq. 16) is given by:
Please note that the global internal disparity noise cannot be canceled in Mechanism 1 (Eq. 16).
Relative disparity mechanism 2
Relative disparity between the top and bottom halves (\({d}_{\pm }=2d\)) is computed directly, without prior measurement of their monocular separations (Eq. 4) or prior measurement of their absolute disparities (Eq. 8). The global internal disparity noise cannot be canceled. From Eq. (7), a trial response is given based on:
Because targets in both the top and bottom halves have independent external disparity noise \(N\left({\sigma }_{{\text{Ext}}}^{2}\right)\), unlike the case of detecting absolute disparity in Eqs. (12) or (13), there is a total of \(N\left(2{\sigma }_{{\text{Ext}}}^{2}\right)\) external noise when computing the relative disparity in Eq. (18). The relative minimum disparity threshold based on Mechanism 2 (Eq. 18) is given by:
Relative disparity mechanism 3
Relative disparity between the top and bottom halves is computed as the difference of their absolute disparities. Again, the responses to crossed and uncrossed disparities are given by Eqs. (12) and (13) respectively. Because the global noise is canceled in the difference of absolute disparities within a single interval, a trial response is given based on:
The relative minimum disparity threshold is given by:
In combination with absolute disparity mechanism Eq. (15), we tested three relative disparity mechanisms (Eqs. 17, 19 and 21) by fitting the unified model to predict both absolute and relative minimum thresholds. Our data and modeling show that the unified model with relative disparity Mechanism 3 (Eqs. 15 and 21) provide the best fit to both data sets.
Mechanism 3 for relative disparity in control experiments utilizing the 2IFC task. In the 2IFC-1 task, one interval comprises both the signal (relative disparity) and noise, while the other contains only noise. The task is to indicate which interval contains the relative disparity signal. The response to the interval with both signal and noise is given by Eq. (20), i.e., \({R}_{\pm }=2Ad+N\left(2{\sigma }_{{\text{Loc}}}^{2}\right)+N\left(2{\sigma }_{{\text{Ext}}}^{2}\right)\) and the response to the interval only with noise is given by \({R}_{{\text{N}}}=N\left(2{\sigma }_{{\text{Loc}}}^{2}\right)+N\left(2{\sigma }_{{\text{Ext}}}^{2}\right)\). Therefore, a trial response is given based on:
The relative Dmin threshold is given by:
In the 2IFC-2 task, both intervals contain a signal plus noise, with the direction of relative disparity varying between the two intervals from trial to trial. The task is to indicate which interval exhibits a relative disparity with the bottom half nearer. Again, referring to Eq. (20), the response to one interval with signal plus noise is given by \({R}_{\pm }=2Ad+N\left(2{\sigma }_{{\text{Loc}}}^{2}\right)+N\left(2{\sigma }_{{\text{Ext}}}^{2}\right)\) and the response to the other interval with a reversed relative disparity plus noise is given by \({R}_{\mp }=-2Ad+N\left(2{\sigma }_{{\text{Loc}}}^{2}\right)+N\left(2{\sigma }_{{\text{Ext}}}^{2}\right)\). Therefore, a trial response is given based on:
The relative Dmin threshold is given by:
Observers
Three observers with normal or corrected-to-normal vision signed an informed consent form and participated in the experiment. One observer is a coauthor, and the others are naïve observers. All observers were screened for stereoacuity better than 20 arcseconds using the clinical stereo circle test (Randot Stereotest, Stereo Optical Co., Inc.). The experimental protocol was approved by the internal board of the ethics committee (IRB) of University of California, Berkeley, according to the guidelines and regulations for human subject research. All experimental protocols were performed in accordance with the guidelines provided by the committee approving the experiments. The data were averaged across the three observers.
Results
Figure 3 shows Dmin and Dmax thresholds for detecting relative (Red) and absolute (Blue) disparities as a function of external disparity noise standard deviation. Dmax threshold remains constant (the average is indicated by horizontal lines in Fig. 3, not predictions from a model) independent of external noise for both relative and absolute disparities, indicating that the disparity variance may have no effect on the upper disparity limit under conditions of the present study (maximum external noise standard deviation ≈ 960 arc sec for relative or 2400 arc sec for absolute disparity detection). On the other hand, for both tasks, Dmin threshold remains constant at small external noise levels but increases proportionally with external noise at large noise standard deviations, with the turning point (indicated by a color vertical bar on x-axis in Fig. 3) estimated as the standard deviation of equivalent internal disparity noise. The smooth colored curves for Dmin thresholds are the best fits of Model 3 (Eqs. 15 and 21), a unified equivalent noise model with both global and local internal disparity noise. Dmin threshold is lower in relative than in absolute disparity detection at all external noise levels. Model 3 provides a unified account of both absolute and relative Dmin thresholds.
Our modeling shows that detecting relative disparity is more efficient and has lower internal noise than detecting absolute disparity. Model 3 (Eqs. 15 and 21) has eight parameters, one global noise standard deviation (\({\sigma }_{{\text{Glob}}}\)) for absolute disparity detection (but canceled in relative disparity detection), two equivalent efficiencies (\({A}_{{\text{Abs}}}\) and \({A}_{{\text{Rel}}}\)) for detecting absolute and relative disparities respectively and five local noise standard deviations (\({\sigma }_{{\text{Loc}}}\)) corresponding to five different scales for both absolute and relative disparity detection. The equivalent efficiency, \({A}_{{\text{Abs}}}=\) 1.01 ± 0.06 (~ 1 detector) for detecting absolute disparity and \({A}_{{\text{Rel}}}=\) 1.54 ± 0.10 (~ 2 detectors) for detecting relative disparity, is independent of spatial frequency. This is consistent with Wardle, Bex et al.23, who reported that the efficiency for depth discrimination (relative disparity) was consistently very low (1–4 detectors) across the visual field. However, as far as we know, the efficiency for detecting absolute disparity had not been measured before. We speculate that the higher efficiency for detecting relative (compared to absolute) disparity in the present study may be due to having more disparity detectors involved in detecting relative disparity. Lower internal noise in detecting relative (compared to absolute) disparity may be because the global internal disparity noise, which is involved in detecting absolute disparity, can be canceled out in the detection of relative disparity. As shown in Fig. 4A, the local internal disparity standard deviation \({\sigma }_{{\text{Loc}}}\), which may be partially caused by sampling errors, decreases as spatial frequency increases (slope ≈ − 0.51 in log–log coordinates). In contrast, the global internal disparity \({\sigma }_{{\text{Glob}}}\) (= 79.9 ± 10.6) appears to be independent of spatial frequency, resulting in a slower decrease of the combined global and local internal disparity noise (total internal noise for detecting absolute disparity: \(\sqrt{{\sigma }_{{\text{Loc}}}^{2}+{\sigma }_{{\text{Glob}}}^{2}}\)) as spatial frequency increases (slope ≈ − 0.16 in a log–log plot). This explains why the Dmin threshold decreases more rapidly for detecting relative than absolute disparities when spatial frequency increases (Fig. 3).
We initially performed a preliminary experiment using a spatial frequency of 1.5 cpd to estimate an appropriate range of test noise levels. The estimation was based on the assumption that the internal disparity noise decreases with a log–log slope of − 1 as spatial frequency increases, aiming to maintain constancy in phase disparity space. However, our findings indicate that the reduction in internal disparity noise is much slower than initially anticipated with increasing spatial frequency, as illustrated in Fig. 4A. Consequently, we performed a limited number of tests with lower noise levels at lower spatial frequencies, resulting in suboptimal fits of the horizontal lines in Fig. 3. These fits are particularly influenced by the threshold in the absence of external noise.
Both relative and absolute Dmax thresholds decrease as spatial frequency increases (Fig. 4B). However, the relative disparity Dmax decreases more rapidly than the absolute disparity Dmax (with a slope of approximately − 0.83 compared to − 0.57 in log–log coordinates).
Modeling
We used the Akaike Information Criterion (AIC), a measure of the relative goodness of fit of a statistical model developed by Akaike29, to compare different models (See Appendix A).
Table 1 shows chi square values and AICc scores for model fitting and statistical comparisons of three unified models, each combining one absolute (Eq. 15) and one of three relative disparity mechanisms (Eqs. 17, 19 and 21). The ‘best’ model is the one with the lowest AICc score. The Akaike weight (Aw), the relative likelihood of a model being the ‘best’ one in the set of models being considered, is given in the last column. Model 3 is the best, with a 93.08% Akaike weight. Model 1 (Eqs. 15 and 17) has 7 model parameters, two efficiencies (\({A}_{{\text{Abs}}}\) and \({A}_{{\text{Rel}}}\)) for detecting absolute and relative disparities respectively and five deviations (\({\sigma }_{{\text{Int}}}\)) of internal noise corresponding to five different scales for both absolute and relative disparity detection. The threshold difference for detecting absolute and relative disparities is accounted for only by their different detection efficiencies. However, greater efficiency for detecting relative disparity is not sufficient to explain all the data sets. It’s likelihood for the ‘best’ model (Aw) is very low, less than 0.01%. Model 2 (Eqs. 15 and 19) is almost identical to Model 1, except that the internal variance for detecting relative disparity is only a half of that of Model 1, because the relative disparity is measured directly in Model 2 while it is measured in two stages in Model 1. The lower internal variance for detecting relative disparity in Model 2 is helpful in accounting for the lower relative disparity threshold, increasing the likelihood to 6.91%. However, the constant 50% decrease in internal variance for detecting relative disparity fails to account for the observed fact that the decrease of internal variance for detecting relative disparity is scale dependent. As shown in Fig. 3, when spatial frequency increases, the internal noise decreases more for relative disparity than for absolute disparity detection. Adding a new parameter of Global noise standard deviation (\({\sigma }_{{\text{Glob}}}\)), Model 3 (Eqs. 15 and 21) has eight parameters. Because the global noise is canceled in detecting relative disparity in the difference of absolute disparities in Model 3, the threshold is lower for relative than for absolute disparity. Because the global noise is independent of scale (dashed black line in Fig. 4A), Model 3 successfully predicts that the internal noise is decreased more quickly in relative than in absolute disparities as spatial frequency increases (blue and red lines in Fig. 4A). Its likelihood to be the best model is 93.08%.
Discussion
Although previous studies have tried to study the absolute disparity mechanism based on “pure” absolute disparity detection tasks4,11 performed without any visible marks as a reference in the background, such “pure” absolute disparity tasks may not exist in natural viewing conditions. In our daily life, the depth of a target is always perceived with visible surrounding references. Even in a lab environment, where any visible marks may be removed from the background, an invisible reference from memory may still be needed to accurately perceive depth4,11. In fact, any depth perception needs both absolute and relative disparity mechanisms, whether detecting absolute or relative disparities. This makes it difficult, or even impossible, to isolate each mechanism with psychophysical experiments. Although neurons tuned to absolute8 and relative9,10 disparities have been isolated, the debate over the relative disparity mechanism has never been completely settled4,11,30. The present study is an effort to study the two mechanisms in combination, using a new model of the two mechanisms, which provides a unified explanation of both absolute and relative detection tasks in a comprehensive framework.
Our data and modeling show that detecting relative disparity requires prior absolute disparity processing, even when no fixation point is present when performing a relative disparity task, which aligns with findings from previous studies7,11,12,13,14,15,16,31,32,33. Tyler13 reported stereoscopic tilt and size aftereffects and suggested that visual processing at the hypercyclopean level involves feature-selective channels tuned for both size and orientation of stimulus elements. Subsequently, these hypercyclopean channels were employed to explain anisotropies in stereoacuity for disparity corrugations15,31,32,33. The present study suggests that the global internal disparity noise is effectively canceled through the relative disparity calculation within these hypercyclopean channels.
As a comparison to the present results, we performed a control experiment for detecting relative disparity with a fixation point (Red x’s in Fig. 5) and found no difference whether a visible fixation point was present (Red x’s in Fig. 5) or not (Red circles in Fig. 5). On the other hand, relative disparity information may also be necessary to reveal the absolute disparity mechanism. Providing a visible fixation point as a reference is helpful in detecting absolute disparity, minimizing the impact of the absolute disparity anomaly11. Our participants showed much worse performance for detecting absolute disparity without a fixation point.
Figure 5 shows the comparison of two absolute disparity Dmin data sets—one with a fixation point (Blue circles) and one without (Green circles), both collected from the 2IFC task. It also presents four relative Dmin datasets under different conditions: (1) with a fixation point, collected from the 2AFC task (Red x’s); (2) without a fixation point, collected from the 2AFC task (Red circles); (3) without a fixation point, collected from the 2IFC-1 task with a signal (relative disparity) plus noise in one interval and noise only in the other (Black squares); and (4) without a fixation point, collected from the 2IFC-2 task featuring a signal plus noise in both intervals but with a reversal in the direction of relative disparity presented in the two intervals (Black triangles). The data were collected from one of the three participants in Fig. 3. The smooth curves are the best fits to the six data sets of a unified equivalent noise model (Model 3) with seven parameters as shown in Table 2: global and local noise standard deviations and five detection efficiencies. The standard deviations of equivalent internal disparity noise are indicated by colored vertical bars on the x-axis in Fig. 5. Please note that a single curve (Red) fits both relative Dmin data sets collected from the 2AFC task with (Red x’s) and without (Red circles) a fixation point.
Our modeling indicates that incorporating a fixation point is more likely (73.4%) to enhance the efficiency of absolute disparity detection, rather than reducing internal noise (22.5%) (see Appendix B). Assuming that global noise cancellation occurs only within the same spatial-frequency band, the internal noise may remain unchanged in absolute disparity detection, regardless of the presence of a fixation point (as depicted in Fig. 5). This is because the disparity targets are confined to a narrow-banded channel, while the fixation point spans a broader band. The addition of a fixation point reduces the absolute disparity anomaly11, thereby increasing the efficiency of absolute disparity detection. However, given the limited conditions examined in our control experiment, we cannot conclusively rule out the possibility that a fixation point might effectively reduce global internal noise, potentially mitigating the absolute disparity anomaly. We leave further exploration of this topic to future studies. Conversely, the addition of a fixation point has no impact on relative disparity detection. The internal noise in relative disparity detection remains consistent in both the 2AFC and 2IFC tasks, with their differing performance attributed to distinct efficiencies (see Appendix B). Importantly, the global internal noise can be effectively canceled in relative disparity detection using either the 2AFC or 2IFC task.
On the other hand, by canceling global disparity noise, observers are highly sensitive to relative disparity even during eye/head movements and are essentially blind to large changes in absolute disparity34,35,36. Indeed, Erkelens and Collewijn35,36 reported that relative depth perception is independent of vergence errors. Steinman, Collewijn and co-workers37,38 concluded that relative horizontal disparity alone determines stereo thresholds and that a shift in vergence posture, which alters the absolute retinal disparities across the entire visual field, does not degrade stereopsis.
However, Ukwade, Bedell and Harwerth39 found that the stereo threshold is elevated if the vergence error exceeds a critical value (around 90 arcseconds) regardless of whether the vergence error was induced by forced vergence or was simulated by disconjugate retinal image motion. They explained this increase in stereo threshold by comparing the induced vergence error to a disparity pedestal. Indeed, numerous studies have reported that stereoacuity thresholds rise exponentially as the pedestal disparity increases40,41,42,43,44,45. We speculate that under natural viewing conditions, vergence errors and possibly other forms of internal global disparity noise have evolved to be well-suited for stereo vision, minimizing their impact on stereoacuity.
Considering the induced vergence error in Ukwade, Bedell and Harwerth39 as external global disparity noise, when it exceeds a critical value (around 90 arcseconds), i.e., the internal global disparity noise, the stereo threshold is elevated. The internal global disparity noise measured in the present study is approximately 80 arcseconds, which aligns closely with their findings of around 90 arcseconds39. While vergence noise may constitute a significant portion of global disparity noise, uncorrelated motor noise in the two eyes during eye/head movements may also contribute to this global noise. Furthermore, in addition to these sources of global noise, neuron resampling, which results in varying mean disparities among different neuron groups, may also contribute to the overall global noise. All these forms of global noise are relevant to the normal constraints on human vision.
In the present study the acuity for relative disparity was, on average, approximately 3.5 times better than that for absolute disparity with a fixation point presented. This result is consistent with Chopin et al.'s 2016 study11, which reported an average improvement of ≈ 4 times for relative disparity compared with “pure” absolute disparity thresholds measured by attempting to exclude any visible reference marks. However, without a visible fixation point, our participants showed much worse performance for detecting absolute disparity than Chopin’s. Compared with relative disparity detection, one of our participants showed about 10 times worse performance (Fig. 5) and another participant was unable to provide consistent data. We speculate that Chopin’s participants may have been well trained to use a reference from memory when detecting absolute disparity without a visible reference.
Our findings indicate that the Dmax threshold is influenced by spatial frequency (Fig. 4B), aligning with the results of a previous study46. However, it's important to note that in our experiments, spatial frequency and size were correlated, which raises the possibility that our results may reflect a dependency on stimulus size47. Under the conditions of the present study, adding external disparity noise (at least up to a standard deviation of ≈ 2400 arc sec for absolute or 960 arc sec for relative disparity) does not appear to affect the Dmax threshold (Fig. 3), which suggests that Dmax may involve a mechanism that ignores local disparity variance but relies on the mean disparity of all items. This is consistent with a previous study showing that the Dmax threshold was determined by a non-linear coarse stereopsis mechanism, such as a second-order process48 or envelope extraction47, which disregards the local differences of individual targets49,50. Conversely, Dmin threshold depends on the local details of individual targets. Just 1 or 2 items may be sampled for disparity detection at the Dmin threshold level during a trial if the equivalent efficiency (1.01 for absolute and 1.54 for relative disparity detections) mainly reflects sampling efficiency, i.e., most of the disparity samples are neglected in the performance of the task24. This is consistent with previous studies23,24, which suggest that disparity signals are not globally integrated to extract the mean depth, and supports the model proposed by Ding and Levi48, which suggests that Dmin and Dmax are regulated by two distinct mechanisms.
In the present study, we proposed a novel unified equivalent noise model that incorporates both global and local internal disparity noise to provide a comprehensive framework for both absolute and relative disparities. Our model offers a crucial insight into the detection of absolute and relative disparity. Both global and local internal disparity noise affect detecting absolute disparity. However, for detecting relative disparity, which involves comparing differences between absolute disparities, our modeling shows that the influence of global noise on absolute disparity cancels out, leading to a lower threshold for detecting relative disparity compared to absolute disparity. Indeed, the dynamics51 and developmental trajectories52,53 of absolute and relative disparity are different. Moreover our conclusion that distinct mechanisms govern Dmin and Dmax thresholds is consistent with the finding that coarse stereopsis develops earlier54 and is less susceptible to the effects of abnormal visual experience55, than fine stereopsis.
The establishment of this unified model contributes to a deeper understanding of the underlying mechanisms of absolute and relative disparities involved in stereovision. Furthermore, the findings from our study have the potential to address depth deficits in abnormal binocular vision, offering valuable insights that can aid in the diagnosis and treatment of such conditions.
Data availability
The datasets generated during and/or analyzed during the current study are available from the corresponding author, Jian Ding jian.ding@berkeley.edu, on reasonable request.
References
Poggio, G. & Fischer, B. Binocular interaction and depth sensitivity in striate and prestriate cortex of behaving rhesus monkey. J. Neurophysiol. 40, 1392–1405 (1977).
DeAngelis, G. C., Ohzawa, I. & Freeman, R. D. Depth is encoded in the visual cortex by a specialized receptive field structure. Nature 352, 156 (1991).
Prince, S., Cumming, B. & Parker, A. Range and mechanism of encoding of horizontal disparity in macaque V1. J. Neurophysiol. 87, 209–221 (2002).
Parker, A. J. Binocular depth perception and the cerebral cortex. Nat. Rev. Neurosci. 8, 379 (2007).
Schor, C. M. In The New Visual Neurosciences (eds Chalupa, L. M., & Werner, J. S.) 809–823 (MIT Press, 2013).
McKee, S. & Verghese, P. Oxford Research Encyclopedia of Psychology (Oxford University Press, 2023).
Westheimer, G. & McKee, S. P. What prior uniocular processing is necessary for stereopsis?. Invest. Ophthalmol. Vis. Sci. 18, 614–621 (1979).
Cumming, B. G. & Parker, A. J. Binocular neurons in V1 of awake monkeys are selective for absolute, not relative, disparity. J. Neurosci. 19, 5602–5618. https://doi.org/10.1523/jneurosci.19-13-05602.1999 (1999).
Thomas, O. M., Cumming, B. G. & Parker, A. J. A specialization for relative disparity in V2. Nat. Neurosci. 5, 472. https://doi.org/10.1038/nn837 (2002).
Umeda, K., Tanabe, S. & Fujita, I. Representation of stereoscopic depth based on relative disparity in macaque area V4. J. Neurophysiol. 98, 241–252. https://doi.org/10.1152/jn.01336.2006 (2007).
Chopin, A., Levi, D., Knill, D. & Bavelier, D. The absolute disparity anomaly and the mechanism of relative disparities. J. Vis. 16, 2. https://doi.org/10.1167/16.8.2 (2016).
Banks, M. S., Gepshtein, S. & Landy, M. S. Why is spatial stereoresolution so low?. J. Neurosci. 24, 2077–2089 (2004).
Tyler, C. W. Stereoscopic tilt and size aftereffects. Perception 4, 187–192 (1975).
Tyler, C. W. Shape Perception in Human and Computer Vision: An Interdisciplinary Perspective 357–372 (Springer, 2013).
Tyler, C. W. & Kontsevich, L. L. Stereoprocessing of cyclopean depth images: Horizontally elongated summation fields. Vis. Res. 41, 2235–2243. https://doi.org/10.1016/S0042-6989(00)00261-3 (2001).
Goutcher, R., Connolly, E. & Hibbard, P. B. Surface continuity and discontinuity bias the perception of stereoscopic depth. J. Vis. 18, 13. https://doi.org/10.1167/18.12.13 (2018).
Ohzawa, I., DeAngelis, G. C. & Freeman, R. D. Stereoscopic depth discrimination in the visual cortex: Neurons ideally suited as disparity detectors. Science 249, 1037–1041 (1990).
Cumming, B. G. & Parker, A. J. Responses of primary visual cortical neurons to binocular disparity without depth perception. Nature 389, 280 (1997).
Berry, R. N. Quantitative relations among Vernier, real depth, and stereoscopic depth acuities. J. Exp. Psychol. 38, 708 (1948).
Pelli, D. G. Effects of Visual Noise (University of Cambridge, 1981).
Pelli, D. G. & Farell, B. Why use noise?. J. Opt. Soc. Am. A 16, 647–653. https://doi.org/10.1364/JOSAA.16.000647 (1999).
Alarcon Carrillo, S., Baldwin, A. S. & Hess, R. F. Factors limiting sensitivity to binocular disparity in human vision: Evidence from a noise-masking approach. J. Vis. 20, 9. https://doi.org/10.1167/jov.20.3.9 (2020).
Wardle, S. G., Bex, P. J., Cass, J. & Alais, D. Stereoacuity in the periphery is limited by internal noise. J. Vis. 12, 12. https://doi.org/10.1167/12.6.12 (2012).
Harris, J. M. & Parker, A. J. Efficiency of stereopsis in random-dot stereograms. J. Opt. Soc. Am. A 9, 14–24. https://doi.org/10.1364/JOSAA.9.000014 (1992).
Ding, J. et al. Global versus local internal disparity noise in stereovision. J. Vis. 23, 4979–4979. https://doi.org/10.1167/jov.23.9.4979 (2023).
Brainard, D. H. The psychophysics toolbox. Spat. Vis. 10, 433–436 (1997).
Pelli, D. G. The VideoToolbox software for visual psychophysics: Transforming numbers into movies. Spat. Vis. 10, 437–442 (1997).
Li, X., Lu, Z.-L., Xu, P., Jin, J. & Zhou, Y. Generating high gray-level resolution monochrome displays with conventional computer graphics cards and color monitors. J. Neurosci. Methods 130, 9–18 (2003).
Akaike, H. A new look at the statistical model identification. IEEE Trans. Autom. Control 19, 716–723. https://doi.org/10.1109/tac.1974.1100705 (1974).
DeAngelis, G. C. Seeing in three dimensions: The neurophysiology of stereopsis. Trends Cognit. Sci. 4, 80–90. https://doi.org/10.1016/S1364-6613(99)01443-6 (2000).
Hibbard, P. B. The orientation bandwidth of cyclopean channels. Vis. Res. 45, 2780–2785. https://doi.org/10.1016/j.visres.2005.06.006 (2005).
Bradshaw, M. F. & Rogers, B. J. Sensitivity to horizontal and vertical corrugations defined by binocular disparity. Vis. Res. 39, 3049–3056. https://doi.org/10.1016/S0042-6989(99)00015-2 (1999).
Serrano-Pedraza, I. & Read, J. C. A. Multiple channels for horizontal, but only one for vertical corrugations? A new look at the stereo anisotropy. J. Vis. 10, 10–10. https://doi.org/10.1167/10.12.10 (2010).
Regan, D., Erkelens, C. J. & Collewijn, H. Necessary conditions for the perception of motion in depth. Invest. Ophthalmol. Vis. Sci. 27, 584–597 (1986).
Erkelens, C. J. & Collewijn, H. Motion perception during dichoptic viewing of moving random-dot stereograms. Vis. Res. 25, 583–588. https://doi.org/10.1016/0042-6989(85)90164-6 (1985).
Erkelens, C. J. & Collewijn, H. Eye movements and stereopsis during dichoptic viewing of moving random-dot stereograms. Vis. Res. 25, 1689–1700. https://doi.org/10.1016/0042-6989(85)90141-5 (1985).
Collewijn, H., Steinman, R. M., Erkelens, C. J. & Regan, D. In Vision and Visual Dysfunction (ed. David Regan) 121–136 (Macmillan, 1991).
Steinman, R. M., Levinson, J. Z., Collewijn, H. & van der Steen, J. Vision in the presence of known natural retinal image motion. J. Opt. Soc. Am. A 2, 226–233. https://doi.org/10.1364/JOSAA.2.000226 (1985).
Ukwade, M. T., Bedell, H. E. & Harwerth, R. S. Stereopsis is perturbed by vergence error. Vis. Res. 43, 181–193. https://doi.org/10.1016/S0042-6989(02)00408-X (2003).
Ogle, K. N. Precision and validity of stereoscopic depth perception from double images. J. Opt. Soc. Am. 43, 906–913. https://doi.org/10.1364/JOSA.43.000906 (1953).
Blakemore, C. The range and scope of binocular depth discrimination in man. J. Physiol. 211, 599–622 (1970).
Badcock, D. R. & Schor, C. M. Depth-increment detection function for individual spatial channels. J. Opt. Soc. Am. A 2, 1211–1216. https://doi.org/10.1364/JOSAA.2.001211 (1985).
Siderov, J. & Harwerth, R. S. Stereopsis, spatial frequency and retinal eccentricity. Vis. Res. 35, 2329–2337. https://doi.org/10.1016/0042-6989(94)00307-8 (1995).
Farell, B., Li, S. & McKee, S. P. Disparity increment thresholds for gratings. J. Vis. 4, 156–168. https://doi.org/10.1167/4.3.3 (2004).
McKee, S. P., Verghese, P. & Farell, B. Stereo sensitivity depends on stereo matching. J. Vis. 5, 3–3 (2005).
Schor, C. M. & Wood, I. Disparity range for local stereopsis as a function of luminance spatial frequency. Vis. Res. 23, 1649–1654. https://doi.org/10.1016/0042-6989(83)90179-7 (1983).
Wilcox, L. M. & Hess, R. F. Dmax for stereopsis depends on size, not spatial frequency content. Vis. Res. 35, 1061–1069 (1995).
Ding, J. & Levi, D. M. A unified model for binocular fusion and depth perception. Vis. Res. 180, 11–36. https://doi.org/10.1016/j.visres.2020.11.009 (2021).
Kaye, M. Stereopsis without binocular correlation. Vis. Res. 18, 1013–1022. https://doi.org/10.1016/0042-6989(78)90029-9 (1978).
Mitchell, D. E. Qualitative depth localization with diplopic images of dissimilar shape. Vis. Res. 9, 991–994. https://doi.org/10.1016/0042-6989(69)90104-7 (1969).
Kaestner, M., Evans, M. L., Chen, Y. D. & Norcia, A. M. Dynamics of absolute and relative disparity processing in human visual cortex. NeuroImage 255, 119186. https://doi.org/10.1016/j.neuroimage.2022.119186 (2022).
Norcia, A. M., Gerhard, H. E. & Meredith, W. J. Development of relative disparity sensitivity in human visual cortex. J. Neurosci. 37, 5608–5619. https://doi.org/10.1523/JNEUROSCI.3570-16.2017 (2017).
Seemiller, E. S., Cumming, B. G. & Candy, T. R. Human infants can generate vergence responses to retinal disparity by 5 to 10 weeks of age. J. Vis. 18, 17. https://doi.org/10.1167/18.6.17 (2018).
Giaschi, D., Narasimhan, S., Solski, A., Harrison, E. & Wilcox, L. M. On the typical development of stereopsis: Fine and coarse processing. Vis. Res. 89, 65–71. https://doi.org/10.1016/j.visres.2013.07.011 (2013).
Giaschi, D., Lo, R., Narasimhan, S., Lyons, C. & Wilcox, L. M. Sparing of coarse stereopsis in stereodeficient children with a history of amblyopia. J. Vis. https://doi.org/10.1167/13.10.17 (2013).
Acknowledgements
This study was supported by Grants RO1EY030544 (J.D.) and RO1EY020976 (D.L.) from the National Eye Institute (NEI), Bethesda, MD. We thank Suzanne McKee and Adrien Chopin for their constructive comments and suggestions on an earlier version of this manuscript.
Author information
Authors and Affiliations
Contributions
J.D. designed the experiments; J.D. & H.H.L. collected the data; J.D. proposed the models and performed the modeling; J.D. drafted the manuscript; J.D. & D.M.L. revised the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Ding, J., Lu, H.H. & Levi, D.M. Absolute and relative disparity mechanisms revealed by an equivalent noise analysis. Sci Rep 14, 6863 (2024). https://doi.org/10.1038/s41598-024-57406-2
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-024-57406-2
- Springer Nature Limited