Introduction

How do we make perceptual decisions in the visual periphery? Subjectively, it may seem that we perceive the unattended visual periphery in precise detail, and there is some evidence supporting this claim (Block, 2007, 2011; Lamme, 2004, 2006; Sperling, 1960). However, inattentional blindness (Simons & Chabris, 1999) and change blindness (Grimes, 1996; Rensink et al., 1997) experiments show that alterations to unattended elements in natural scenes are often unnoticed. Additionally, recent experiments reveal that coarse summary statistical properties can be perceived and remembered for unattended items, but individual details are lost (Ward et al., 2016). Profound deficits are also revealed when subjects are asked simple questions about color perception in the periphery (see Box 2, Cohen, Dennett, & Kanwisher, 2016). Thus, while perceptual deficits in the periphery are evident, a precise characterization of this deficit remains elusive, and overall the idea of impaired peripheral perception seems at odds with the subjective impression that we perceive the visual world in relatively uniform detail.

Previous work has addressed this question within the framework of signal detection theory. Results show that observers tend to use liberal detection biases when evaluating peripheral or unattended stimuli, such as grating patterns embedded in noise (Rahnev et al., 2011; Solovey et al., 2015); that is, participants were more likely to say that items were presented in peripheral or unattended locations, even when they were not. These results were interpreted to reflect a subjective sense of inflated phenomenology in the unattended periphery because detection biases could in principle reflect both subjective perception and decisional or response strategies (Witt et al., 2015). It was argued that a decisional or cognitive account of these biases is less likely because results remain consistent even when subjects are given feedback and training (Rahnev et al., 2011; Solovey et al., 2015).

If liberal biases reflect inflated visual phenomenology, we should expect these results to emerge in more ecologically valid settings, too. This question is important, because as noted by previous research, findings from artificial tasks may not be representative of the perceptual and decisional strategies used by the brain in more ecologically valid settings (Felsen & Dan, 2005). Additionally, research indicates that while most processing of real-world scene information is intact with diminished attention, the depth of processing of this information does depend on attention (Groen et al., 2016). Therefore, results from attentional cueing tasks based on a few simple stimuli on a gray background may not generalize well to other settings where the visual scene is complex. To address the question of whether liberal detection biases are evident when making judgments in the unattended periphery, we conducted a study where subjects completed a simulated driving task while they looked for (i.e., detected) specific colors of pedestrians’ clothing in the visual periphery. Unlike previous studies (Rahnev et al., 2011; Solovey et al., 2015), the stimuli were not degraded by noise or reduced contrast; instead, near-threshold task difficulty was created by having the pedestrians move at a fast speed. We reasoned that if the previous results were not due to idiosyncratic strategies adopted in artificial psychophysics experiments, here, subjects should exhibit similar liberal detection biases in the unattended periphery as well, which could be explained by the variance-reduction model (Rahnev et al., 2011; Solovey et al., 2015), as shown in Fig. 1. This model assumes a single unified criterion across attention levels, and proposes reduced trial-by-trial variability of the internal perceptual signal when attended. Thus, the criterion is more liberal when stimuli are unattended.

Fig. 1
figure 1

The variance-reduction model (Rahnev et al., 2011). This schematic demonstrates how participants may exhibit more liberal perceptual criteria when attention is low. Specifically, the model assumes that attention produces smaller trial-by-trial variability in the internal perceptual signal. When the amount of attention devoted to processing a particular location or feature is high, the variance of the target absent (gray dashed lines) and target present (black solid lines) distributions is lower than when the amount of allocated attention is low. The model also assumes that a single unified criterion (the black dashed line) is used across attention levels, which is supported by previous research (Gorea & Sagi, 2000). Thus, the model accounts for more liberal detection criteria under low attention compared to high attention

Methods

Participants

Twenty-six individuals (15 female, 11 male) participated in Experiment 1; mean age = 20.8, SD = 3.0. Fifteen individuals (12 female, three male) participated in Experiment 2; mean age = 21.8, SD = 5.5. We conducted a power analysis based on previous research (i.e., (Rahnev et al., 2011)), and determined that the number of participants should be greater or equal to 15 (see Supplementary Methods). No statistical analyses reported in the paper were performed on partial data. All participants were students at the University of California-Los Angeles. All of them had normal or corrected-to-normal vision, were 18 years or older, and did not have a history of seizures, epilepsy, stroke, or head trauma. Each participant gave informed consent and received either credits or monetary compensation of $10 per hour. This study was approved by the UCLA Institutional Review Board.

Stimuli and apparatus

Stimuli were generated using Unreal Engine 4 (Epic Games, Cary, NC, USA) to create game-like and more ecologically valid environments and stimuli. Participants were instructed to use four keys to accelerate (key ‘W’), brake (key ‘S’) and steer (key ‘A’ for turning left and key ‘D’ for turning right), similar to the conventions in many racing games. The overall experience of stimuli was similar to 3D-racing video games. Participants were instructed to drive a vehicle along a straight two-lane road in a small town (Fig. 2a) and stop at the stoplight at the first intersection. There were no other vehicles on the road. The vehicle was parked 82 m (in virtual distance) away from the stop line at the intersection at the beginning of each trial. When the vehicle was 43 m away from the stop line, in 80% of the trials, a pre-cue textbox was displayed for approximately 4 s in the upper center of the screen, reading either “Attention to the LEFT” or “Attention to the RIGHT” (Fig. 2b). Participants were informed that if a pre-cue textbox appeared, there was a 75% chance that the item they would have to answer a detection question about would be at the same location as indicated by the pre-cue (“valid” trials), and there was 25% chance that the item they would have to answer a detection question about would be at a different location as indicated by the pre-cue (“invalid” trials). They were instructed to allocate attention equally to both the left and right locations if no pre-cue text was presented (“baseline” trials), which accounted for 20% of the total trials.

Fig. 2
figure 2

The task design of both experiments. a The start of each trial. Vehicles started from a parked position near the end of a straight, two-lane road in a small town, 82 m away from the stop line of the intersection. b The attention cue. When the vehicle was 43 m away from the intersection, in 80% of the trials, a pre-cue textbox was displayed in the upper central portion of the screen, which indicated the direction that subjects were to attend when stopped at the intersection. c Fixation validation. Before stimuli were presented, each participant’s fixation was continuously monitored to ensure he/she was fixating on the red stoplight for one full second. The white dotted circle indicates the position of the red light. d Presentation of stimuli. Two people wearing shirts of different colors ran out from behind two telephone booths positioned on each side of the street, and then turned around before getting halfway across the intersection. The two white dotted circles indicate the positions of the two people. e The yes/no detection question. This question was presented 1 s after the stimuli disappeared. f The confidence question. After answering the yes/no question, participants were also asked to report their confidence level (1-4) regarding their choice

Upon reaching the intersection, the vehicle was automatically moved to a fixed position to ensure the stimuli were presented in identical fashion on the screen on each trial. After the vehicle was fully stopped, the pre-cue text disappeared. Then, the participant’s fixation was continuously monitored to ensure the participant was fixating on the red light for 1 s (we set the tolerance of distance between fixation and center of the red light as 2.5 in visual angle) (Fig. 2c). For details of eye tracking, see Supplementary Methods. The stimuli of interest in this task were two running male individuals, with shirt colors randomly drawn from seven potential colors for Experiment 1 (yellow, red, orange, green, blue, purple and gray) or 11 potential colors for Experiment 2 (red, pink, orange, yellow, green, blue, purple, brown, white, gray, and black) on each trial (Fig. 2d). These male pedestrians quickly ran out from behind telephone booths simultaneously at the speed of 17.5 /second after the vehicle had stopped for 1 s, and then quickly ran back behind the telephone booths. They were presented for 300 ms in total; 1000 ms after the stimuli disappeared, a yes-no question (e.g., “Was the shirt color of the person on the left yellow?” or ”Was the shirt color of the person on the right yellow?”) was presented (Fig. 2e). The probability that the correct choice was “Yes” or “No” were both 50%, and participants were told this explicitly in advance. In Experiment 1, the target color was always yellow, while in Experiment 2, the target color was randomly picked from the 11 possible colors in each trial. After answering the yes-no question, participants were also asked to report their confidence level regarding their choice from 1 (low confidence) to 4 (high confidence) (Fig. 2f).

Stimuli were shown on a 24-in. monitor (about 19 inches wide) with 60-Hz refresh rate and resolution of 1280 × 1024 pixels. Each shirt was approximately 1.1× 1.0 in visual angles, and was located 15.4 - 17.6 away from fixation while the stimuli were moving. Participants were seated 60 cm from the screen, and viewed the screen freely with their head unrestrained.

Procedure

In both experiments, all participants were required to complete two sessions; the two sessions occurred approximately 1 week apart. The first session included eye tracker calibration, 15 practice trials, and 210 experiment trials. The second session included eye tracker calibration and 270 experiment trials. Participants were instructed to take a break of up to 60 s after every 30 trials.

Results

Experiment 1: evaluating detection biases for one specific color

To ensure participants were fixating as instructed, we analyzed their eye gaze at 17 ms (i.e., time of one frame) before stimuli were presented, and during 300 ms of stimuli presentation. Participants’ fixations when stimuli were presented from every trial in Experiment 1 are illustrated in Fig. 3a, which shows that subjects were fixating properly when stimuli were presented. The range of the target eccentricity at the onset of the stimulus is 15.52 - 20.28; the 99% confidence interval of the target eccentricity is (16.53, 19.22). Participants’ fixations during stimuli presentation are illustrated in Fig. 3b, which shows that the stimuli were presented in the periphery of our participants’ visual fields in nearly all of the trials. Trials in which participants directed their gaze towards the target were discarded, and are not included in the analyses in this manuscript.

Fig. 3
figure 3

Participants’ fixations in Experiment 1. a Participants’ fixations at 17 ms (i.e., time of one frame) before stimuli (the two male pedestrians) were presented, from every trial in Experiment 1. Each black point represents a fixation from one of the two eyes, and fixation points from both eyes are plotted in the figure. Almost all fixations are on or near the stoplight in every trial. b Participants’ fixations during stimulus presentation (300 ms total) in every trial in Experiment 1. Each black point represents an average fixation of two eyes. Only about 0.28% of fixations are directed to the stimuli of interest (as located in black dash-dotted frames on the sides of the screen), while 98.42% of fixations were properly located near the traffic light (the black dashed frame in the middle)

Next, before conducting signal detection theoretical (SDT) analyses, we assessed the equal variance assumption from SDT and found significant deviations in some subjects (Supplementary Methods, Supplementary Fig. 1). Because of this, in subsequent SDT analyses we used the measures da (sensitivity) and ca (i.e., the criterion, or response bias) to account for unequal variances; this was possible because in addition to Yes/No answers, we also collected confidence ratings and could perform a full ROC analysis with multiple criteria points in ROC space for each subject.

For our primary analyses, we conducted Mauchly’s test and repeated measures ANOVAs to analyze the effects of attention on the percentage of correct responses, da and ca. Overall, performance was similar across attention conditions (Fig. 4a), and the percentage of correct trials did not significantly differ across attention levels, F(2,50) = 0.97, p = .39. This indicates the effects of attentional cuing were likely modest. However, performance was still highest in the valid attention trials and lowest in the invalid attention trials.

Fig. 4
figure 4

Behavioral results from Experiment 1. a The percentage of correct responses across attention conditions. While subjects exhibited the best performance for valid attention trials and the lowest performance for invalid trials, the conditions were not significantly different from one another. bda and ca across attention conditions. da values were quite consistent across attention conditions. ca under inattention (i.e., invalid trials) was significantly lower than ca in the valid or baseline conditions, providing evidence for a liberal detection bias. Bars represent the average values across subjects; error bars represent the standard error of the mean. ∗∗p < .01, p < .05, NS: not significant

Similar to the effect of attention on the percentage correct, the average da was quite consistent across attention conditions (Fig. 4b) and da did not significantly differ across attention levels, F(2,50) = 0.43, p = .65. However, when we compared ca across attention conditions, ca values were significantly different across attention levels, F(2,50) = 7.16, p = .002. Post hoc paired t tests indicated that the difference of ca between the valid condition and invalid condition was significant, t(25) = 3.20, p = .004; the difference of ca between baseline condition and invalid condition was also significant, t(25) = 2.70, p = .012; however, the difference of ca between valid condition and baseline condition was not significant, t(25) = 0.91, p = .37. This provided evidence that participants adopted more liberal criteria for making detection judgments when the target was unattended and presented in periphery.

Experiment 2: evaluating detection biases in the periphery for an array of colors

In Experiment 1, the detection question that was asked on every trial involved a constant target: a person in a yellow shirt. One could argue that a liberal detection bias may be observed due to the sheer fact that subjects may have a tendency to respond “Yes” whenever they are uncertain. That is, our primary finding of a liberal detection criterion could be driven by a confirmation bias at the decisional level. To test if this is the case, we designed Experiment 2 in a specific manner: we changed the target color on each trial, and participants only knew of the color they were to detect when they were prompted by the detection question at the end of the trial. This way, participants could not anticipate what target to detect beforehand. If the liberal detection bias we observed in Experiment 1 were due to a confirmation bias in responding, we would still see the effect here. On the other hand, we would not expect similar results if they were contingent on the fact that in Experiment 1 subjects were able to form an expectation for the target color before they were prompted.

As in Experiment 1, we evaluated participants’ fixations at 17 ms before stimuli were presented and during stimuli presentation. The fixation positions indicated that stimuli were indeed presented in the periphery of our participants’ visual fields in nearly all trials (Supplementary Fig. 3). The range of the target eccentricity at the onset of the stimulus is 15.72 - 21.05; the 99% confidence interval of the target eccentricity is (16.40, 19.67). We discarded trials in which participants directed their gaze towards the target.

We evaluated the percent correct in each of the three attention conditions from Experiment 2, and also calculated da and ca to measure the sensitivity and response criterion for each subject. Specifically, we conducted Mauchly’s test and repeated measures ANOVAs to evaluate the effect of attention on the percent correct, da and ca. Overall, attention increased the percentage of correct responses, as participants exhibited the highest performance in the valid attention condition and the lowest performance in the invalid condition (Fig. 5a). The percentage of correct trials was significantly different across attention levels, F(2,50) = 5.53, p = .009. Post hoc paired t tests indicated that the difference in the percentage of correct responses between the valid condition and invalid condition was significant, t(14) = 2.99, p = .01; the difference in the percentage of correct responses between the valid condition and baseline condition was also significant, t(14) = 2.39, p = .031, which suggested that participants performed better when allocating full attention to stimuli, rather than distributing attention across the two locations; the difference in the percentage of correct responses between the valid condition and baseline condition was not significant, t(14) = 1.52, p = .15.

Fig. 5
figure 5

Behavioral results for Experiment 2. a The percentage of correct responses across attention conditions. The percentage of correct responses for valid trials was significantly larger than the percentage correct for baseline and invalid trials. bda and ca across attention conditions. Similar to the effect of attention on percentage of correct responses, da was significantly greater in the valid attention condition compared to the invalid condition. ca did not significantly differ across attention conditions, which indicated that participants used similar internal criteria to make perceptual judgments in all attention conditions. Bars represent means, error bars represent SEM. p < .05, NS: not significant

The average da showed a similar trend compared to the percentage of correct responses: da increased with attention significantly (Fig. 5b), F(1.45,20.28) = 4.54, p = .033 (the degrees of freedom were corrected using Greenhouse–Geisser estimates of sphericity, ε = .72). Post hoc paired t tests indicated that the difference of da between the valid condition and invalid condition was significant, t(14) = 2.46, p = .028, while the difference of da between the baseline condition and invalid condition was not significant, t(14) = 2.04,p = .06; the difference of da between the valid condition and baseline condition was also not significant, t(14) = 0.79, p = .44. As for the response criteria ca, the results showed that ca did not significantly differ across attention levels, F(2,28) = 0.52, p = .6. Importantly, these response criteria were positive, meaning they were much more conservative than those observed in Experiment 1. This finding indicates that our results in Experiment 1 were not due to a sheer confirmation bias at the decisional level (see Discussion).

Discussion

In this study, we investigated how subjects detected colorful stimuli in the unattended periphery in a simulated driving task. In our first experiment, on each trial, we asked observers whether an individual wearing a shirt with a specific color (yellow) had been presented at a specific location. Results showed that observers exhibited higher numbers of false alarms (Supplementary Fig. 4) (i.e., saying a yellow shirt was present, even when it was not) in unattended locations in the visual periphery, compared to locations that were fully or partially attended. This tendency to use liberal detection criteria in unattended or peripheral locations has been shown previously in artificial settings (Rahnev et al., 2011; Solovey et al., 2015). Here, we confirmed the hypothesis that this detection bias also extends to more ecologically valid stimuli and tasks, which can be interpreted as providing further support for the variance reduction model (See Fig. 1). In our second experiment, on each trial, we asked observers whether an individual wearing a shirt with a particular color had been presented at a specific location, but in this experiment, we varied the target color randomly from trial to trial. Results showed that subjects used relatively conservative perceptual criteria (i.e., were relatively reluctant to say a color was present) when making detection-related judgments in this experiment, regardless of the amount of attention that was allocated to a given location.

One may notice that observers showed dramatic differences in the placement of the perceptual criteria between our two experiments. Here, we posit two factors that may account for these differences. First, it is possible that differences in criterion between the two experiments may stem from whether the stimulus dimension is known in advance. Specifically, the variance reduction model (Rahnev et al., 2011; Solovey et al., 2015) assumes a single unified criterion across attention levels, which means that this model works based on an a priori, well-defined stimulus dimension on which the subject can place the criterion to do the detection. If the feature to be detected can only be known after the presentation of stimulus (which was the major change in Experiment 2), one is not able to place the criterion in the same location for both the cued and uncued stimuli over many trials. As such, we would not predict the inflation effect in Experiment 2, and this is exactly what we obtained. Second, it is also possible that the difference in perceptual criteria between two experiments is due to a subjective or metacognitive expectation effect. That is, in Experiment 2, participants knew what to detect only after the stimuli disappeared, so one possible reason subjects used conservative criteria in Experiment 2 is that the task may simply feel more difficult to perform. The conservative criteria obtained in Experiment 2 further confirms that the effect of attention on detection bias in Experiment 1 is unlikely due to a confirmation bias at the decisional level, i.e., a general tendency to answer “yes” to any question, when uncertain.

One important question that remains is what it means when people say they see the target more often. Traditionally, it is thought that much of peripheral vision is ‘filled in’ via top-down mechanisms (Komatsu, 2006). However, it has also been reported that people trust unreliable, filled-in percepts more than percepts based on external input (Ehinger et al., 2017), suggesting that filling-in may not be the complete mechanism to explain peripheral phenomenology, and that decisional or metacognitive mechanisms are also involved. In our study, perhaps the findings of detection bias in the unattended periphery can also be interpreted as congruent with this account involving mechanisms at the decisional or metacognitive level. Importantly, just because the effect is to be thought of at the ‘decisional’ level does not mean that this is unrelated to perceptual phenomenology; criterion effects can also reflect subjective percept (Phillips, 2016; Witt et al., 2015). This interpretation is in line with previous findings that people tend to overestimate their ability to detect changes in change blindness experiments (Levin et al., 2000), and in a sense, people are not fully aware of their decreased capacity for color perception in the periphery (Cohen et al., 2016).

In terms of practical implications, if people tend to confirm what they expect whenever they don’t attend, this raises an important issue regarding driving safety. Many people are optimistic and tend to expect things to be positive (Sharot, 2011, Sharot, Guitart-Masip, Korn, Chowdhury, & Dolan, 2012, Sharot, Riccardi, Raio, & Phelps, 2007), so they may tend to mistakenly detect hazards that present in unattended periphery as no danger, while what is actually happening is they just don’t see the hazards. Similar results have been found in previous studies where drivers showed conservative criteria in hazard detection regardless of driving experience (Ventsislavova et al., 2016; Wallis & Horswill, 2007). Hazard detection is a special detection task, because the penalty for a miss and a false alarm is different: a miss may cause a crash, while a false alarm may only lead to unnecessary brakes; thus, the criterion in hazard detection task is closely related with driving safety. In our research, the task is to detect pedestrian when the vehicle has stopped. However, in real driving, most of the hazards are presented while driving, which is quite different from our task. Future research should more systematically address how inattention affects hazard detection judgment while driving. Specifically, while the impact of distractions (e.g., due to phone calls and text messaging) on driving performance is well known (Haigney et al., 2000; Rumschlag et al., 2015), the impact on specific aspects of the perceptual decision making process remains relatively unexplored.

One limitation of our study is that the attention had a minimal effect on perceptual performance in Experiment 1, despite the overall trend that the performance increased with attention. While at first it may seem that attention was not allocated properly, we note that attention produced significant effects on reaction time in both experiments, as shown in Supplementary Figs. 6 and 7, indicating that attention changed at least one behavioral outcome in the hypothesized manner. Another limitation is that the differences in criteria or perceptual performance between baseline and the other two attention conditions were not always significant. Future research should consider using a larger sample size to increase the power to produce stronger conclusions.

This study used a simulated driving task, which is more ecologically valid than several previous investigations. However, we acknowledge that the experience of our simulated driving is quite different from real driving in some aspects. The experience of the task was similar to playing a video game instead of driving in a driving simulator. Also, in real driving, the drivers are not usually warned about the directions of upcoming objects, like in our task. However, we think this study represents an important step towards using more ecologically valid paradigms in psychophysical investigations, and will motivate future research to increase use of ecologically valid tasks.