Introduction

Scientific psychology got underway in the 19th century with a strong impetus from physics and physiology: think of Helmholtz, E.H. Weber, Fechner, and Wundt among others (see Boring, 1957). Murray and Link (2021) have recently shown that quantitative ideas were more frequent and deeply seated in the developing science than historians previously realized. A central accomplishment of the time was Fechner’s psychophysics, which concerned the encoding of external energies into psychological values. Speaking broadly, if Fechner’s psychophysics can be thought of as getting from the stimulus into the head (into psychological representation via inner psychophysics; Boring, 1957; Dzhafarov & Colonius, 1999; Murray & Link, 2021), signal detection theory can be credited with getting back out of the head and into observable behavior (Ashby & Wenger, 2021; Link, 1994; Wixted, 2020). This foundational conceptualization is in part why William K. Estes, one of the giants of scientific psychology in the 20th century, considered signal detection theory to be “the most towering achievement of basic psychological research in the last half century” (Estes, 2002, p. 15).

A central distinction in signal detection theory is between the amount of perceptual or psychological evidence in favor of a response alternative and the predisposition to choose that alternative, regardless of the level of evidence in its favor. This distinction is typically quantified in classic signal detection theory in terms of sensitivity or discriminability (e.g., \(d^{\prime }\)) and response bias or response criterion (e.g., β or c, Ashby & Townsend, 1986; Ashby & Wenger, 2021; Macmillan & Creelman, 2004). Although there have been discussions of the necessity of the construct of response bias (e.g., Balakrishnan, 1999), one of the strong empirical regularities in the application of signal detection theory has been the fact that response frequencies can quite easily be shifted using manipulations such as relative trial frequencies and payoffs (Egan, 1975; Macmillan & Creelman, 2004). In fact, this result has been found even at the level of features making up more complex stimulus patterns (Townsend et al., 1988).

The distinction between levels of perceptual evidence and the decisional criterion that is applied to that evidence was maintained in the multidimensional generalization of signal detection theory known as general recognition theory (GRT, Ashby & Townsend, 1986). Because of the introduction of more than a single perceptual dimension, the originators developed several key types of invariance (and their complements) across and among the distinct dimensions. Of course, as in single dimensional signal detection theory, concepts must still be embedded in a stochastic setting, but now that setting is multidimensional.

In GRT, the distinction is made among perceptual separability (PS), perceptual independence (PI), and decisional separability (DS). Perceptual separability signifies that changing the physical level of one dimension does not disturb the marginal probability distribution on the other dimensions. Perceptual independence is defined by the property that the joint probability distribution on values from the various dimensions be independent, expressed by that joint density function being expressed as the product of the separate marginal densities. Finally, decisional separability is interpreted as the decisional criterion on any one dimension being expressed by a single value, and thereby unaffected by values on the other dimensions. This constraint implies that the decision bounds in the ambient perceptual space be expressed as planes orthogonal to the pertinent dimension and parallel to the remainder of the (by definition, linear and mutually orthogonal) subspaces.

The majority of the published applications of GRT to date have used the simplest experimental paradigm: a 2 × 2 design in which two dimensions are manipulated at two levels each. Unfortunately, as convincingly documented by Silbert and Thomas (2013, 2017), there is an identifiability problem with respect to DS in this design: In any case in which DS is violated by a rotation of the linear decision bounds while satisfying PS, there always exists an alternative model that makes precisely the same empirical predictions and satisfies decisional separability while violating PS and with accompanying changes in the within-stimulus correlations, and these two models are related by a linear transformation.

This identifiability problem prompted Silbert and Thomas to suggest that applications of GRT that rely on the 2 × 2 design should be conducted under the a priori assumption that decisional separability holds (Silbert & Thomas, 2013, 2017). Acknowledging the inherent challenges with the basic design, that approach obviates the use of one of the most valuable and long-standing assumptions of what Townsend and Ashby (1983, p. 465) called the principle of correspondent change. Specifically, if it is an empirical regularity that decisional criteria can be shifted, then the possibility of such a shift needs to be incorporated into the theory. This is what separates quantitative theory from curve fitting and assuming a priori that DS holds further seems to ignore the multitude of strong empirical regularities associated with classic manipulations such as variations in payoffs and stimulus frequency (Egan, 1975; Macmillan & Creelman, 2004; Townsend et al., 1988). And, when the possibility opens up that this linkage might be truly violated, there would be no avenue for its study. Thus, research has uncovered the important possibility that shifts in decisional operations may be contributors to effects that have previously been interpreted purely in terms of perceptual factors, such as is the case in the literatures on configural or holistic perception (e.g., Cornes et al.,, 2011; Wenger & Ingvalson, 2002; Wenger & Ingvalson, 2003). As a consequence, we sought to determine here whether, consistent with the logic of converging operations (Garner et al., 1956), there might be empirical regularities in independent measures associated with a known violation of DS.

In the present investigation, a classical illusion provides the terrain where several important goals may be pursued. The foremost goal is to further probe novel regions where bias manipulations might actually also perturb certain sensory functions. Toward this end, and following our recent work on perceptual learning (Wenger & Rhoten, 2020) we bring together within the same study GRT and systems factorial technology (SFT, Little et al.,, 2017; Townsend & Nozawa, 1995). GRT was invented primarily to investigate, as noted, dependencies among perceptual dimensions whereas SFT was designed primarily to identify mental architectures, decisional stopping rules, and workload capacity. Thus, one overriding aim was to perform experiments and allied analyses on the same individuals with the same or highly similar stimuli in such a way as to simultaneously shed light on each of the formerly isolated sets of mechanisms and properties. In so doing and beyond the proof of concept in joint application of SFT and GRT, we can thereby examine the effects of our bias manipulations on their conjoined findings.

Context: The Hering illusion

The specific experimental context for the present investigation is an illusion first described by Ewald Hering in the 19th century (Hering, 1861). The illusion is one in which one or more objectively vertical lines are perceived as being curved outward when superimposed on a background set of radial lines (see Fig. 1a). At least two physical factors appear to modulate the magnitude of the illusion: the number of radial lines and the physical congruence between the lines on either side of the center point (Hansen, 1961). The illusion offers a situation in which subjective perception does not correspond to the physical ground truth of the stimulus, which has distinct physical properties that allow for easy manipulation, and (by way of the influence of the congruence of the two sets of lines) offers a form of configurality. Thus, it gives us a chance to examine the potential function and effect of potential violations of DS in configural perception that are outside domains in which violations of DS as a source of configurality have been the source of some debate. In order to use the Hering illusion in this way, it was first necessary to determine the extent to which we could control the magnitude of perceived curvature as a function of the true level of curvature and the congruence between the left and right vertical lines in the stimuli. This was the purpose of Experiment 2.

Fig. 1
figure 1

a Example stimulus: in this example, the left and right vertical lines are objectively straight but are reliably judged to be curved outward. b A subset of the results from the rating experiment

In that preliminary experiment, we document the extent to which varying the objective curvature of the horizontal lines—from curved slightly inward to curved outward—affected the perceived curvature of each set of lines. Documenting the magnitude of the effect of actual curvature on perceived curvature allowed us to select the levels of curvature that we used in Experiment 2. In that second experiment, we varied payoffs either in favor of a curved response or in favor of a straight response in two tasks: a complete identification (CID) paradigm and a double factorial paradigm (DFP). The CID allowed us to examine performance from the perspective of GRT (Ashby & Townsend, 1986; Ashby & Wenger, 2021) in order to determine whether our payoff manipulation was successful in inducing violations of DS. The DFP allowed us to examine performance from the perspective of SFT (Little et al., 2017; Townsend & Nozawa, 1995) in order to determine if violations of DS were regularly accompanied by any changes in processing architecture, stopping rule, independence in processing rates, or capacity.

Experiment 2

The intent of Experiment 2 was to quantify the extent to which the magnitude of the Hering illusion can be affected by factorially manipulating the level of curvature present in each set of vertical lines, holding the number of radial inducing lines constant. This was done by having participants rate the perceived curvature of the left and right set of lines separately, while varying the level of actual curvature in each set of lines.

Methods

Design

This experiment was designed as a 6 (level of left curvature: -2, 0 to + 4 pixels) × 6 (level of right curvature: -2, 0 to + 4 pixels) × 2 (set of lines to be judged: left, right) factorial, with all factors manipulated within participants. With respect to curvature, the units were the amount in pixels by which the vertical lines were curved either inward (negative values) or outward (positive values) at their maximum point of curvature. A value of 0 pixels curvature identifies vertical lines that were objectively straight. The stimulus at -2 pixels was chosen on the basis of pilot work indicating that this level of inward curvature was reliably perceived to be straight.

Participants

A total of 25 participants (13 female) were recruited from the University of Oklahoma community and were compensated for their participation with a $25 gift card.

Materials

An example stimulus is presented in Fig. 1a. Stimuli were constructed of black lines on a mid-scale gray background. At a constrained viewing distance of 79 cm, the stimuli subtended 2.6 and 3.9 of visual angle, horizontally and vertically. The pair of vertical lines subtended 1.3 of visual angle, horizontally. Stimuli were presented on a 53 cm (diagonal) grayscale CRT with a refresh rate of 100 Hz. Stimulus displays and response collection were controlled by DMDX (Forster & Forster, 2003). Stimulus onsets were synchronized to the vertical refresh rate of the monitor and all responses were timed to ± 1 ms.

Procedure

The experiment was performed in a dimly lit room, and participants adapted to the dim lighting for five min before beginning the experiment. The entire experiment was completed in approximately 45 min. The session began with the experimenter explaining the procedure for making the ratings and included four example trials, two in which the participant rated the perceived curvature of the left set of lines and two in which the participant rated the perceived curvature of the right set of lines.

Each trial in the experiment was self-initiated, and participants could take short breaks at any point. Order of the side to be rated (left, right) was randomly determined for each participant. Each trial began with a centrally presented black dot signaling that the trial was ready to begin. Participants initiated the trial by pressing the space bar on the computer keyboard. A fixation cross was then presented for a duration determined by sampling from an exponential distribution with a mean of 1100 ms, censored at 700 and 1500 ms. This was replaced by a pattern pre-mask, followed by the stimulus to be rated, followed by a pattern post-mask, each for 100 ms. This was followed by a linear response scale presented in the bottom half of the screen, whose midpoint was labeled “straight” and whose left and right end-points were labeled as “curved left” and “curved right”, respectively. Participants made their curvature ratings by clicking a point on this line, and no feedback was provided. Curvature ratings were normalized to a range of -1 to 1 for each observer prior to analyses. Each level of curvature was rated ten times.

Results

The normalized curvature ratings were analyzed using two 6 (level of left curvature) × 6 (level of right curvature) repeated-measures analyses of variance (ANOVAs), one each for the ratings of the left and right sides. Results are presented in Table 1, and a subset of the ratings are plotted in Fig. 1b. The normalized responses were remarkably consistent across participants. For both the left and right sides, the main effects of curvature on each side were reliable, indicating that perceived curvature increased with actual curvature. In addition, the interaction of the two sides was reliable, indicating that the perceived curvature of the side being rated was affected by the curvature of the other side, with the interaction being to increase perceived curvature of one side if the other side was more curved. All levels of actual curvature, including curvature of 0 (objectively straight) but excluding curvature of -2 (curved inward) resulted in curvature ratings that were reliably different from 0. Ratings at curvature of -2 were all not reliably different from 0. This is the Hering illusion: lines that were objectively curved inward were reliably judged to be straight while lines that were objectively straight were reliably judged to be curved outward. Levels of curvature beyond + 1 did not reliably differ from one another on either side. Consequently, for clarity of presentation, we restrict the data in Fig. 1b to levels -2, 0, and + 1 on both sides.

Table 1 Results of the repeated-measures ANOVAs on the curvature ratings

Discussion

The results of Experiment 2 affirm that the presence and magnitude of the Hering illusion can be affected by the congruence of curvature between the two sets of lines. This is important as Experiment 2 requires that we be able to manipulate the magnitude of the illusion. In addition, this suggests a potential type of configurality that we wish to characterize in terms of independence vs. dependence in the encoded information and the forms of processing that are applied to that information.

Experiment 2

In Experiment 2, we sought to use the combined tools of GRT and SFT to determine whether an induced violation of DS (in the complete identification task, CID) would be accompanied by regular changes in characteristics of processing in the double factorial paradigm (DFP). Participants performed both tasks twice, once in a neutral payoff condition and then in one of the two biased conditions. Half of the participants were transferred to a condition which preferentially rewarded responses indicating perceived curvature and half were transferred to a condition which preferentially rewarded responses indicating an absence of perceived curvature.

Methods

Design

The CID was designed as a 2 (subjective curvature on the left: absent, present) × 2 (subjective curvature on the right: absent, present) × 2 (payoffs: neutral, biased) complete factorial, with all factors manipulated within participants. The neutral payoff condition was always performed first, and half of the participants were transferred to one of the two bias conditions. The payoff matrices for the three bias conditions in the CID are presented in Table 2.

Table 2 Payoff matrices for the CID as a function of the biasing conditions

The DFP was designed as a nested pair of 2 × 2 factorial designs. The outer factorial was a 2 (subjective curvature on the left: absent, present) × 2 (subjective curvature on the right: absent, present) factorial. The inner factorial pertained to the stimuli in which subjective curvature was present on both sides and was a 2 (ambiguity of the curvature on the left: ambiguous, unambiguous) × 2 (ambiguity of the curvature on the right: ambiguous, unambiguous) factorial, with ambiguity manipulated by way of the actual amount of curvature. All factors were manipulated within participants. As with the CID, the neutral bias condition was performed first, then half of the participants were transferred to a condition biased in favor of curvature and half were transferred to a condition biased against curvature. The payoff matrices for the three bias conditions of the DFP are presented in Table 3.

Table 3 Payoff matrixes for the DFP

Participants

Eight participants (five female) were recruited from the university community and were compensated with a $150 gift card. Data from one participant could not be used due to the participant withdrawing from the experiment before completion.

Materials

Stimuli were selected from those rated during experiment 1. Example stimuli for both tasks are presented in Figure 2. The stimulus with lines curved in by two pixels was considered subjectively straight, while the stimulus with 0 pixels of curvature was considered ambiguously curved, and the stimulus with 1 pixel of outward curvature was considered unambiguously curved. Participants viewed the images from a constrained distance of 79 cm, with the images overall subtending 2.6 and 3.9 of visual angle, horizontally and vertically, and the pair of vertical lines subtending 1.3 of visual angle horizontally. Stimuli were presented on a 53 cm (diagonal) grayscale CRT with a refresh rate of 100 Hz. Stimuli were displayed and responses were collected using DMDX (Forster & Forster, 2003). Stimulus onsets were synchronized to the vertical refresh rate of the monitor. All responses were timed to +/- 1 ms.

Fig. 2
figure 2

a Example stimuli for the CID. b Example stimuli for the DFP, including the four stimuli in the inner (nested) factorial, manipulated for selective influence on the processing times of the left and right set of lines. Note: s = slow, f = fast

Procedure

Participants completed 16 sessions lasting approximately 1 h each. They were offered three 5-min breaks during each session. Each session was completed in darkness and participants were adapted to the dark for 5 min before every session and after each break. Figure 3 summarizes the procedure for Experiment 2.

Fig. 3
figure 3

a Overview of the procedure used in experiment 2

Participants were randomly assigned to begin with either three sessions of the CID or five sessions of the DFP before moving on to complete the other task. The first eight sessions always had neutral payoffs. After both paradigms were completed, participants engaged in them again in the same order, this time with the appropriately biased payoffs.

During both the CID and DFP, participants accumulated points for their selections on the basis of the payoffs. The points were constantly shown in the bottom right of the screen and reset after each block. At the start of every block, participants were reminded of their previous block score (or, in the case of the first block of the day, their previous day’s final block score) and encouraged to do their best to beat their score. At the end of each block of trials, the participant’s block score was compared to their previous block. If the participant beat their previous score, they could select a piece of candy from a dish.

Complete identification paradigm

Participants completed two series of three CID sessions, each consisting of four blocks of 192 trials, for a total of 768 trials per session. Each of four stimuli appeared 48 times per block. During the first session of each series, participants were instructed to press one of four keys on the number pad to identify which stimulus they saw, and they were given 12 practice trials.

Each trial began with a fixation point in the center of the screen that signaled to the participant that the trial could begin. Participants pressed the space bar to begin the trial. A fixation cross was presented for a duration determined by sampling from an exponential distribution with a mean of 1000 ms, censored at 500 and 1500 ms. This was followed by a pattern pre-mask, the stimulus, and a pattern post-mask, each for 100 ms. Responses were collected, and participants heard an 800-Hz tone if their judgment of both sides was correct, a 660-Hz tone if their judgment was only correct about one side, and a 440-Hz tone if they were incorrect about both sides.

Double factorial paradigm

Participants completed two series of five DFP sessions, each consisting of four blocks of 192 trials, for a total of 768 trials per session. In each block, half of the presented stimuli appeared subjectively straight on both sides. Of the other half, 64 appeared straight on one side while the other side was unambiguously curved; 16 appeared unambiguously curved on one side and ambiguously curved on the other; eight appeared ambiguously curved on both sides; and eight appeared unambiguously curved on both sides. Participants were instructed to press a key assigned to their dominant hand if either set of lines were perceived to be curved, and a key assigned to their non-dominant hand if neither set of lines was perceived to be curved. Participants began each session with four practice trials.

Each trial began with a fixation point in the center of the screen that signaled to the participant that the trial could begin. Participants pressed the space bar to begin the trial. A fixation cross was presented for a duration determined by sampling from an exponential distribution with a mean of 1000 ms, censored at 500 and 1500 ms. This was followed by a pattern pre-mask, the stimulus, and a pattern post-mask, each for 100 ms. Responses were collected and participants were presented with an 800-Hz tone if their judgment was correct and a 440-Hz tone if their response was incorrect.

Results

Complete identification task

We first examined the data for any violations of marginal response invariance (MRI) and timed marginal response invariance (tMRI, Townsend et al.,, 2012), given the critical and non-parametric role that MRI and tMRI play in drawing inferences concerning DS and PS (Silbert & Thomas, 2013; 2017).Footnote 1 MRI holds for the i th level of the left side if

$$ \begin{array}{@{}rcl@{}} p(l_{i} | L_{i} R_{s}) & = & p(l_{i} r_{s} | L_{i} R_{s}) + p(l_{i} r_{c} | L_{i} R_{s}) \\ & = & p(l_{i} r_{s} | L_{i} R_{c}) + p(l_{i} r_{c} | L_{i} R_{c}) \\ & = & p (l_{i} | L_{i} R_{c}) \end{array} $$

where L and R indicate the left and right set of lines, respectively, and the subscripts s and c indicate straight and curved. tMRI holds for the i th level of the left side if, for all t > 0

$$ \begin{array}{@{}rcl@{}} p(l_{i}, T \leq t | L_{i} R_{s}) & = & p(l_{i} r_{s}, T \leq t | L_{i} R_{s}) + p(l_{i} r_{c}, t \leq t | L_{i} R_{s}) \\ & = & p(l_{i} r_{s}, T \leq t | L_{i} R_{c}) + p(l_{i} r_{c}, T \leq t | L_{i} R_{c}) \\ & = & p(l_{i}, T \leq t | L_{i} R_{c}) \end{array} $$

The results of these comparisons are presented in Tables 4 and 5. No violations of MRI or tMRI were obtained for any of the observers in the neutral payoff condition. A total of seven violations of MRI were obtained in the condition in which curvature was preferred, and a total of five violations of MRI were obtained in the condition in which a straight response was preferred. In each case, the inequalities in the response proportions were consistent with the intended bias in favor of either the curved/curved or straight/straight stimulus. That is, the failures of MRI in the condition in which curvature was preferred showed either an increase in the proportion of responses indicating curvature or a decrease in the proportion of responses indicating a lack of curvature. The opposite was true for the condition in which a straight response was preferred. In this case, the failures of MRI were either an increase in the proportion of straight responses or a decrease in the proportion of straight responses. A total of five violations of tMRI were obtained in the condition in which the bias was in favor of curvature, and in all five cases the violation occurred when the second dimension was curved. A total of seven violations of tMRI were obtained in the condition in which bias favored a straight response, and four of those seven cases occurred when the second dimension was straight. Thus, the MRI and tMRI results suggest potential violations of either PS or DS, but only in the biased conditions.

Table 4 Tests of marginal response invariance
Table 5 Tests of timed marginal response invariance

We next considered comparisons involving the marginal signal detection measures of discriminability (\(d^{\prime }\)) and bias (c), calculated according to the standard equal-variance gaussian model. These comparisons took the form

$$ \begin{array}{@{}rcl@{}} d^{\prime}(L, R_{s}) & = & d^{\prime}(L, R_{c}) \\ c(L, R_{s}) & = & c(L, R_{c}) \end{array} $$

These are presented for the neutral condition and the condition biased in favor of curvature in Table 6 and for the neutral condition and the condition biased against curvature in Table 7. Equality of the parameters was assessed using 95% confidence intervals (Gourevitch & Galanter, 1967). No reliable differences for either marginal measure were obtained in the neutral conditions. We found one reliable inequality for \(d^{\prime }\), for observer 6 in the condition favoring curvature. In addition, we obtained reliable inequalities for c on one dimension for all observers in both biasing conditions, with the direction of the shifts in the value of c being consistent with the biasing manipulation. That is, marginal c became more negative (liberal) in the condition favoring curvature and became more positive (conservative) in the condition favoring a lack of curvature. This suggests that our manipulation was able to induce the desired violations of decisional separability in a regular and interpretable way, although this was true for only one of the dimensions.

Table 6 Marginal signal detection parameters for the four observers who were switched to a bias in favor of curvature
Table 7 Marginal signal detection parameters for the four observers who were switched to a bias against curvature

We then examined the data for any violations of report independence (RI) and timed report independence (tRI, Townsend et al.,, 2012). RI holds for stimulus LiRj if

$$ \begin{array}{@{}rcl@{}} p(l_{i} r_{j} | L_{i} R_{j}) & = & [p(l_{i} r_{s} | L_{i} R_{j}) + p(l_{i} r_{c} | L_{i} R_{j}] \\ & & \times [p(l_{s} r_{j} | L_{i} R_{j}) + p(l_{c} r_{j} | L_{i} R_{j})] \end{array} $$

tRI holds for stimulus LiRj if

$$ \begin{array}{@{}rcl@{}} p(RT(i, j) &\leq& t | L_{i} R_{j}) \times p(RT(\cdot, \cdot)\\ &\leq& t | L_{i} R_{j}) = p(RT(i, \cdot) \leq t | L_{i}, R_{j})\\ &&\times p(RT(\cdot, j) \leq t | L_{i} R_{j}) \end{array} $$

Results for the checks on RI are presented in Table 8 and the results for tRI are presented in Table 9. No violations of RI or tRI were obtained for any of the observers in either of the biasing conditions, suggesting that there were no violations of PI in either the neutral or either one of the bias conditions.

Table 8 Tests of report independence
Table 9 Tests of timed report independence

To this point, the data suggest no violations of PI in either the neutral or either biased condition, and only a single violation of PS for one observer in the positive bias condition. In contrast, while there were no violations of DS in the neutral conditions, every observer in the biased conditions showed potential violations of DS, in directions consistent with the biasing manipulation. However, given the known weaknesses of the 2 × 2 design, we checked these inferences using a hierarchy of constrained multivariate Gaussian models (Thomas, 2001). Descriptions of the best-fitting models for each observer in each biasing condition are presented in Table 10. Consistent with the preceding analyses, the best-fitting models for the neutral conditions allowed preservation of PI, PS, and DS for all observers. For the condition favoring curvature, the models showed a single violation of PS for one observer, and consistent violations of DS on just one dimension for all observers. The same result was obtained for the condition biased against curvature: the best-fitting models for all of the observers contained a violation of DS on just one dimension.

Table 10 Best-fitting models for each of the observers in each of the bias conditions

Predicted effects of a violation of DS on capacity

Having established that we were able to induce regular violations of DS with our biasing manipulations in the CID, we can now inquire as to the effects that such violations might have in the context of the DFP. Consider that the violations of DS that we observed occurred on only one of the two dimensions. Now consider that the cells of the CID that provide the data for observing this violation are the cells in the DFP that are used to estimate capacity (see Fig. 4a) using the AND version of the capacity coefficient

$$ C_{A}(t) = \frac{\ln[F_{L}(t)] + \ln[F_{R}(t)]}{\ln[F_{LR}(t)]}, $$

where FL(t),FR(t) and FLR(t) are, respectively, the cumulative distribution functions for the conditions in which the left, right, and both sides are curved. Finally, consider that we obtained two types of violation of DS, one in favor of curved responses in the curved/curved trials (Fig. 4b), and one in favor of straight responses. From Fig. 4b and c we can see that the RTs in the DFP that will be affected should be those for the straight/curved and curved/curved stimuli. In order to obtain predictions for the effects of these violations of DS on capacity, we simulated these conditions using the linear dynamic systems modeling approach that we have used in previous work (Townsend & Wenger, 2004, 2006, 2012). Details of this modeling are presented in the Appendix and the results of the simulations are presented in Fig. 5.

Fig. 4
figure 4

Effects of a violation of DS on capacity: a Correspondence between the cells in the CID that are used to infer a violation of DS (shaded) and the cells in the DFP that are used to estimate capacity. b The shift in the decision bound in favor of a curved response in the curved/curved trials. (c) The shift in the decision bound in favor of a straight response in the curved/curved trials

Fig. 5
figure 5

Simulation results: Cumulative distribution functions (cdfs) (a) for the trials involving the straight/curved stimuli (b) for the trials involving the curved/straight stimuli, and (c) for the trials involving the curved/curved stimuli. d Predicted effects on the capacity coefficient CA(t). e Predicted error rates as a function of the activation threshold γ; the pair of vertical lines indicate the range of γ used in the simulations

Consistent with our expectations, neither of the shifts in decision bounds predicts an effect on RTs in the curved/straight condition (Fig. 5b). In the case in which the shift in the decision bound is to favor a curved response in the curved/curved condition, we see opposing effects in the cumulative distribution functions for the straight/curved and curved/curved conditions. For the straight/curved condition, the shift in the decision bound in favor of the curved response for the right side is a shift against the straight response for the right side, which results in a slowing of the RTs for the straight/curved stimuli (Fig. 5a). These are the RTs that contribute to the numerator of the capacity coefficient CA(t). For the curved/curved condition (Fig. 5c), we obtain a slight speeding-up of RTs due to statistical facilitation (Gumbel, 1958), with these being the denominator of CA(t). This slowing of one of the single-target distributions and slight speeding-up of the double-target distributions has the net effect of predicting an increase in CA(t) (Fig. 5d). In the case in which the shift in the decision bound is to favor a straight response in the curved/curved condition, we get opposite effects: a slight speeding-up of the straight/curved RTs and a substantial slowing of the curved/curved RTs, which results in a reduction in CA(t). Intriguingly, Fig. 5e reveals that in our dynamic stochastic linear systems model, fairly substantial changes in the decision criteria can occur such that processing times are significantly affected without accompanying alterations in accuracy. The outcome can be sizeable effects on CA(t) absent effects on accuracy. In sum, our simulations predict that a shift in favor of a curved response should result in an increase in capacity, while a shift in favor of a straight response should result in a reduction in capacity.

Double-factorial paradigm

Having established that we were able to induce regular violations of DS with our biasing manipulations in the CID, we can examine observers’ performance in the DFP, before and after experiencing the biasing manipulation. In order for inferences to be drawn with respect to architecture and stopping rule, we need to first establish that the distributions of RTs in the double-target condition were ordered consistent with the assumption of selective influence (Townsend & Nozawa, 1995). Results of these tests are presented for all observers in both biasing conditions in Table 11, with the simple result being that proper and reliable orderings were obtained in all cases.

Table 11 Results of the pairwise Kolmogorov–Smirnov tests of stochastic dominance

We then examined the interaction contrasts for the double target trials, first at the level of the survivor function of the RT distribution. This survivor function interaction contrast (SIC), defined as

$$ \begin{array}{@{}rcl@{}} SIC = [S_{ss}(t) - S_{sf}(t)] - [S_{fs}(t) - S_{ff}(t)] \end{array} $$

along with the mean interaction contrast (MIC), defined as

$$ \begin{array}{@{}rcl@{}} MIC = (\overline{RT}_{ss} - \overline{RT}_{sf}) - (\overline{RT}_{fs} - \overline{RT}_{ff}), \end{array} $$

where the subscripts f and s denote the fast and slow conditions, allows for strong inferences regarding processing architecture and stopping rule. The SICs for all observers who were switched from a neutral to bias in favor of curvature are presented in Fig. 6, and the SICs for those who were switched from a neutral to a bias against are presented in Fig. 7. Solid lines in each panel present the SIC and the dashed lines represent the upper and lower limits of a 95% confidence interval, as estimated by bootstrap.Footnote 2 As can be seen in these figures, the SICs for all observers in both neutral and biased conditions were overwhelmingly negative, from which we infer parallel exhaustive processing of the two sets of lines. No noticeable shifts in the SICs were noted as a function of having experienced the biasing manipulation.

Fig. 6
figure 6

Survivor function interaction contrasts for the four observers who were switched to a bias in favor of curvature. a Obs 1, neutral; b obs. 1, prefer curvature; c obs. 4, neutral, d obs. 4, prefer curvature; e obs. 6, neutral; f obs. 6, prefer curvature; g obs. 8, neutral; h obs. 8, prefer curvature

Fig. 7
figure 7

Survivor function interaction contrasts for the three observers who were switched to a bias against curvature. a Obs 2, neutral; b obs. 2, prefer straight; c obs. 7, neutral, d obs. 7, prefer straight; e obs. 10, neutral; f obs. 10, prefer straight

The results of analyzing the data in terms of the main effects and (more importantly) their interaction are presented in Table 12. The interactions for all observers in both the neutral and biased conditions were significant and the MICs were all reliably < 0. Taken together with the SICs, these results unambiguously indicate parallel exhaustive processing in both the neutral and biased conditions, with no differences in terms of the valence of the induced bias. This consistency is almost startling, given the response demands, which allowed for parallel self-terminating processing. It appears that in the presence of this illusion and the varying payoff structures participants opted to engage in exhaustive processing, consistent with notions of configural processing (Townsend & Wenger, 2015). It is quite rare to find pure exhaustive processing where it is possible to obtain perfect accuracy with a minimum time stopping rule.

Table 12 Tests of the interactions at the level of the means

We next examined the capacity coefficients, and plots of all C(t)s are presented for those transferred to a bias in favor of curvature in Fig. 8 and for those transferred to a bias against curvature in Fig. 9. In addition, Figs. 10 and 11 plot the difference between the capacity coefficients in the biased and neutral conditions for the two biasing conditions. We should note that accuracy for all observers was > 94% in all cases, consequently we used the form of the capacity coefficient that is based only on correct trials (Townsend & Nozawa, 1995; Townsend & Wenger, 2004) rather than the form that takes accuracy into account (Townsend & Altieri, 2012). In each panel of Figs. 8 and 9, the solid line represents the value of CA(t) while the dotted lines plot the upper and lower bounds of the 95% confidence interval, as estimated by bootstrap. For those transferred to the positive bias condition, CA(t) was reliably > 1 both in the neutral and the positive bias condition, and capacity increased after the transfer to the positive bias condition (see Fig. 10). For those transferred to the negative bias condition, CA(t) was reliably > 1 in the neutral condition, and decreased to no longer being reliably different from 1 for two observers, and simply decreased for the third (see Fig. 11).

Fig. 8
figure 8

Capacity coefficients for the four observers who were switched to a bias in favor of curvature: a obs. 1, neutral, b obs. 1, prefer curvature; c obs. 4, neutral; d obs. 4, prefer curvature; e obs 6., neutral; f obs. 6, prefer curvature; g obs. 8, neutral; h obs. 8, prefer curvature

Fig. 9
figure 9

Capacity coefficients for the four observers who were switched to a bias against curvature: a obs. 2, neutral, b obs. 2, prefer straight; c obs. 7, neutral; d obs. 7, prefer straight; e obs 10, neutral; f obs. 10, prefer straight

Fig. 10
figure 10

Difference in CA(t), bias - neutral, for observers who were switched to a bias in favor of curvature: a obs. 1, b obs. 4, c obs. 6, d obs. 8

Fig. 11
figure 11

Difference in CA(t), bias - neutral, for observers who were switched to a bias against curvature: a obs. 2, b obs. 7, c obs. 10

Discussion

In the present study, we sought to determine whether an induced violation of DS would be accompanied by other regular changes in RT-based measures for GRT and in any characteristics of processing in the DFP. We demonstrated in the CID that our manipulations reliably and consistently varied response frequencies in the desired directions and that these were revealed in the analysis of the response frequencies as reliable shifts in c, reliable violations of MRI and tMRI, and with multivariate gaussian model fitting concurring in the inference of violations of DS in the biased conditions. We should emphasize that the evidence from all of these measures was consistent in pointing to violations of DS in the two bias conditions. This was obtained with there being no evidence in either response frequencies or RTs suggestive of violations of PI for any of the observers in any of the conditions. In addition, there was only a single case in which there was evidence to support a violation of PS and, in this case, the change in response frequencies under the bias manipulations was consistent with the direction of the induced bias.

We further demonstrated in the DFP that, consistent with our dynamic systems modeling, the induced bias was accompanied by shifts in capacity: inducing a bias in favor of curvature was associated with an increase in capacity, and inducing a bias against curvature was associated with a decrease in capacity. No other changes in processing characteristics were noted, with all observers in both bias conditions engaging in parallel exhaustive processing, in spite of a response instruction that allowed for self-terminating processing. This relationship between variations in a decisional operator and variations in capacity was first noted in our earlier work on faces and words (Wenger & Townsend, 2006), where a model that incorporated changes in a decisional operator best explained the observed changes in capacity.

General discussion

The present results unequivocally do not and are not intended to solve the identifiability issues with the 2 × 2 version of the CID. What we can claim instead is that induced violations of DS in the CID were regularly associated with changes in capacity in the DFP, and that these two sources of evidence, along with the findings from our dynamic systems modeling, converge with respect to an inference of a violation of DS.Footnote 3

It is possible that a transformation such as those discussed by Silbert and Thomas (2013, 2017) could produce some of the same results. However, it is notable that the several bias conditions lead to quite parsimonious and regular findings in a way very consonant with the experimental manipulations. Thus, we contend that violations of DS more elegantly explain our results than would a violation of either PS or PI. This is important, as it illustrates the need for GRT to be able to consider hypothesized and actual shifts in response criteria as the basis for behavioral regularities. It simply is not sufficient to a priori assume preservation of DS in all cases.

An unexpected but intriguing result from our simulations was that we were able to identify a range of values for the decisional operator (the activation threshold) over which it was possible to see large changes in capacity without any changes in accuracy. This goes against intuition based on classic speed/accuracy tradeoffs (e.g., Dosher, 1979; Wickelgren, 1977) and to our knowledge is the first observation of its kind. Further theoretical and empirical work will be needed to understand the boundary conditions and robustness of this finding.

A violation of DS is a form of integrality that has been suggested as one of the sources for effects that have been interpreted as perceptually holistic (e.g., Wenger & Ingvalson, 2002, 2003). Here we obtained this integrality by manipulating a visual illusion. This phenomenon therefore represents a truly psychological integrality, which is important given that dimensional interactions at the level of decisional operators is distinctly something “inside the head” and not of necessity determined by the physical characteristics of the stimuli.

The present results also speak to the power of combining the measurement approaches of GRT and SFT along with the insights allowed by our linear systems modeling. We previously have shown the potential for supporting strong inferences by combining the two approaches to questions pertaining to perceptual learning (Wenger & Rhoten, 2020). We believe that the current results further reinforce this point. Taking the main analyses all together, we conclude that processing on the two sides of the Hering stimuli were perceptual independent, perceived in parallel with an exhaustive processing rule, in a super capacity fashion and in such a fashion that bias manipulations influenced processing speed without a significant effect on accuracy.

The surprising finding of super capacity while accompanied (at least in the GRT phase of the study) by perceptual independence and the rather startling implementation of a (quite inefficient) exhaustive decision stopping rule are intriguing and merit further exploration. We are in the process of constructing experimental designs which engage SFT and GRT tools over the same sets of trials. This new direction may aid in explaining some of the current puzzles.

In sum, we have shown that it is possible to induce regular violations of DS in the context of a classical illusion, and that these violations are associated in a regular and interpretable way with changes in capacity. We believe this finding suggests avenues for testing hypotheses regarding varieties of independence and dependence in perception that do not require the strong a priori assumption that DS always holds. In addition, for the most part, illusions have existed more or less as experimenta demonstranda (as Boring, 1957, expressed it) for over a century. The current explorations may help lead to more detailed knowledge concerning their information processing mechanisms.