Active visual search in naturalistic environments reflects individual differences in classic visual search performance

Botch, Thomas L.; Garcia, Brenda D.; Choi, Yeo Bi; Feffer, Nicholas; Robertson, Caroline E.

doi:10.1038/s41598-023-27896-7

Active visual search in naturalistic environments reflects individual differences in classic visual search performance

Article
Open access
Published: 12 January 2023

Volume 13, article number 631, (2023)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

Active visual search in naturalistic environments reflects individual differences in classic visual search performance

Download PDF

Thomas L. Botch¹,
Brenda D. Garcia¹,
Yeo Bi Choi¹,
Nicholas Feffer^2,3 &
…
Caroline E. Robertson¹

3441 Accesses
5 Citations
28 Altmetric
Explore all metrics

Abstract

Visual search is a ubiquitous activity in real-world environments. Yet, traditionally, visual search is investigated in tightly controlled paradigms, where head-restricted participants locate a minimalistic target in a cluttered array that is presented on a computer screen. Do traditional visual search tasks predict performance in naturalistic settings, where participants actively explore complex, real-world scenes? Here, we leverage advances in virtual reality technology to test the degree to which classic and naturalistic search are limited by a common factor, set size, and the degree to which individual differences in classic search behavior predict naturalistic search behavior in a large sample of individuals (N = 75). In a naturalistic search task, participants looked for an object within their environment via a combination of head-turns and eye-movements using a head-mounted display. Then, in a classic search task, participants searched for a target within a simple array of colored letters using only eye-movements. In each task, we found that participants’ search performance was impacted by increases in set size—the number of items in the visual display. Critically, we observed that participants’ efficiency in classic search tasks—the degree to which set size slowed performance—indeed predicted efficiency in real-world scenes. These results demonstrate that classic, computer-based visual search tasks are excellent models of active, real-world search behavior.

A Comparative Study of Virtual Reality and 2D Display Methods in Visual Search in Real Scenes

Visual search is influenced by 3D spatial layout

Article 14 May 2015

Dwelling on simple stimuli in visual search

Article 12 November 2019

Introduction

Locating an object in a cluttered environment is a ubiquitous visual behavior. The mechanisms by which humans accomplish visual search have been comprehensively studied in traditional computer-based settings using both artificial arrays¹ and complex scene images². Yet, little is known about whether the principles of visual search revealed by these studies extend to active, self-directed exploration in real-world environments, and whether individual performance in both traditional and naturalistic contexts is limited by common factors.

Classic, computer-based studies have identified numerous factors that govern visual search performance³. For example, search is limited by the similarity between the visual features of a target (e.g., color, shape, size) and the array of distractors in which it is embedded⁴. A key component of these studies is the use of minimalistic, simplified stimulus arrays, which allow experimenters to systematically manipulate one factor of interest (e.g., color), while controlling for others (e.g., shape, size), and to measure the impact of this isolated factor on performance. This approach has provided insights into the mechanisms underlying visual search and inspired multiple formal and conceptual models of the behavior^1,5,6. Further, these models underpin frameworks for understanding diverse cognitive processes including attention^4,6,7,8,9, reward^10,11, and decision-making¹².

However, the computer-based approach contains two key drawbacks that limit generalization to real-world search behavior^13,14,15. First, artificial stimuli lack the complex visual statistics and structural cues present within real-world scenes¹⁴. Recent computer-based studies investigating search in complex scene images demonstrate that the structure of the visual environment supplements attentional guidance beyond basic factors probed in paradigms with minimalistic stimuli^16,17 by engaging episodic and semantic memory^18,19 and guiding eye-movements to visual targets^20,21. Second, computer-based approaches engage working memory differently from active, immersive contexts²². During active exploration, working memory operates across multiple spatial reference frames to guide attention²³. Thus, naturalistic paradigms present a valuable opportunity to validate models of human behavior derived in traditional laboratory settings and extend these models to the conditions and demands of everyday life^24,25.

Indeed, decades of research have established many connections between visual search in laboratory settings and in real-world environments^{26,27,28,29,30,31}. In particular, researchers have characterized visual search performance in multiple professional contexts including radiology^32,33,34, airport security^35,36, and driving³⁰. These studies have revealed numerous features of computer-based visual search that translate to everyday settings. For example, these studies have shown that experienced radiologists are both faster and more accurate at detecting abnormalities in medical images than naïve observers^37,38,39. In the context of airport security, individual differences in search speed and accuracy measured on a computer-based app have been shown to predict target detection at TSA checkpoints⁴⁰. Interestingly, not all aspects of the laboratory are paralleled in real-world environments. For example, because radiologists and airport security officers encounter targets at lower rates in occupational settings, as compared with laboratory paradigms, error rates (misses) are relatively higher and false alarm rates lower in these occupational settings, regardless of expertise^28,38. Together, these studies show important parallels of visual search performance across computer-based and real-world contexts.

Virtual reality (VR) offers complementary opportunities to investigate visual behavior in naturalistic contexts. Similar to real-world settings, head-mounted VR displays allow researchers to study search in active conditions, where working memory can guide search across spatiotopic reference frames^23,41. However, in contrast to real-world settings, VR enables researchers to present diverse sets of stimuli with ease, manipulate specific environmental features of these stimuli, and explore the contributions of these factors to visual search performance. Recent studies have investigated active visual search behavior using head-mounted VR^42,43,44. These studies again highlight the importance of environmental structure (e.g., scene layout, semantics) in shaping active visual search strategies^45,46,47,48. However, these studies have largely employed minimalistic, computer-rendered virtual environments as stimuli, where the experimenter can manipulate scene content and structure to identify regularities that facilitate attentional guidance in active settings. Because such rendered stimuli do not contain real-world visual content, these studies are subject to the first limitation of the classic paradigms described above: they cannot address the degree to which the statistical regularities of real-world scenes impact search in active, naturalistic settings.

Here, we leverage advances in VR technology to study the common factors limiting visual search in classic, computer-based paradigms and immersive scenes with real-world visual content. We specifically focused on one key factor that limits search performance in classic studies, set size: the number of items within a visual array. Increasing set size impairs search performance in both artificial arrays^49,50 and images of complex scenes^21,51,52. However, it remains unclear whether set size effects analogously limit behavioral performance during active exploration of real-world environments, where environmental structure and memory are available to aid attentional guidance^53,54. Further, to our knowledge, whether individual differences in search efficiency in artificial displays predict naturalistic search performance in real-world environments has never been explored.

Thus, our study aimed to answer two questions: (1) does set size limit both classic and naturalistic search, and (2) is search efficiency on classic, computer-based search tasks predictive of active search performance in real-world scenes? Participants (N = 75) completed two tasks: (1) a classic, computer-based conjunctive search paradigm with arrays varying in set size and (2) a naturalistic, VR-based search paradigm with immersive, real-world environments varying in levels of visual clutter⁵⁵. In both tasks, we characterized the impact of set size on visual search performance. We also tested whether participants’ search efficiency was related across the two paradigms (classic and naturalistic).

Methods

Participants

75 adults participated in two experiments (N = 49 females; mean age 21.55 + /− 3.31 STD years). Participants were recruited based on (1) having normal or corrected-to-normal vision and no colorblindness, (2) having no neurological or psychiatric conditions, and (3) having no history of epilepsy. We selected our sample size based on comparable studies^46,47, and no participants were excluded from the analysis. Written consent was obtained in accordance with the Declaration of Helsinki via a protocol approved by the Dartmouth College Ethics Committee for the Protection of Human Subjects (CPHS).

Remote data collection

Participants received a standalone head-mounted display (Oculus Quest 2, www.oculus.com, single fast-switch LCD, 1832 × 1920px per eye; ~ 90° field of view; 72 Hz refresh rate) preconfigured with the ManageXR (www.managexr.com) device management software. Experiments were built in Unity version 2018.4.12f1 (www.unity.com) with custom scripts written in C#. Experimental data was collected through a custom data transfer pipeline written in C# and PHP to transmit data from the HMD to lab servers.

Experiment 1: Naturalistic visual search

Exp. 1—Stimuli and set size manipulation

In the naturalistic search experiment, stimuli consisted of 360° “photospheres” of real-world scenes, sourced from an online photo sharing website (www.flickr.com). We curated 54 photospheres with four criteria to minimize the complications of defining set size in real scenes⁵². First, we selected photospheres of indoor scenes, as outdoor scenes contain few segmented regions which may not be representative of the true set size. Second, we ensured the photospheres did not contain humans to avoid the possibility that humans are a unique object category. Third, we confirmed that each photosphere contained a “singleton” target object: an object that appeared only once inside a given photosphere. Fourth, given the importance of depth to scene processing in early visual areas on the brain⁵⁶, we ensured that all photospheres had comparable depth. To this end, we estimated the depth of each photosphere using the big-to-small (BTS) algorithm⁵⁷.

We adopted the concept of visual clutter as a proxy for set size in real-world scenes^49,55 and approximated the visual clutter of each photosphere using the proto-object segmentation algorithm⁵⁸. Subsequently, we divided the photospheres into three bins (18 photospheres each) based on the estimated clutter measurements (low, medium, and high clutter) and ensured that the average clutter of each bin significantly differed from the others (Fig. 1A). The average depth of photospheres in each bin did not significantly differ between bins (Fig. 1B).

Target object locations were balanced across photospheres within each clutter bin. For each scene, the yaw of each photosphere was randomly rotated such that the target object was located in one of three quadrants of the immersive environment relative to the participant’s initial facing direction: (1) to the left of the participant, (2) in front of the participant, or (3) to the right of the participant. This resulted in an equal distribution of target object locations relative to the participant across the three possible quadrants (6 photospheres per quadrant), and across the clutter bins (18 photospheres per quadrant).

Exp. 1—Paradigm

On each trial of the naturalistic visual search experiment (54 trials), participants were presented with a photosphere via the head-mounted display (HMD) for a maximum of 30 s, or until the controller trigger was pressed indicating detection of the target object (Fig. 2A; Supplemental Video S1). In all scenes, an occluding wall obstructed the 90° immediately behind the participant such that the 270° in front of the participant was visible. Accordingly, participants were informed that the area behind them would not be visible and instructed to explore the forward, left, and right portions of the photosphere. To mitigate confusion during the real-world visual search task, we informed participants that the target object would always be present inside the virtual environment.

Before each trial, participants were presented with a pre-trial fixation target at screen-center to ensure participants entered each photosphere facing the same direction. Participants were required to align their head-center with the target for 3 s. Subsequently, participants were presented with a conjunctive word cue (e.g., green bottle) describing the target object in the following photosphere. Participants were instructed to “find the target as quickly as possible”. To report the target, participants centered their head on the target (specifically, they centered a light gray circle, which was locked to screen-center, on the target) and pressed the controller trigger. A response was considered correct if the participant’s head coordinate was within a 7.5° visual angle radius from target center when the trigger was pressed, and reaction time was calculated as the time of the trigger press relative to trial start. After pressing the trigger, participants were given feedback on the accuracy of their response. The gray, head-locked circle would turn green if the participant selected the correct object and would turn red if the participant selected an incorrect object. After each trial, participants were returned to a virtual home environment where they were informed of their reaction time and instructed to take a break. A mandatory break occurred after each quarter of the experiment (14 trials) to allow participants to rest their eyes.

At the start of the study, participants were shown a set of instructions orienting them to the task. Following the instructions, participants completed two practice trials to ensure familiarity with the task. Participants were highly accurate during practice trials (mean accuracy: 84%), indicating comprehension of the task.

Experiment 2: Classic visual search

Exp 2—Stimuli and set size manipulation

In the classic visual search experiment, stimuli consisted of letter arrays, which were presented on a gray background around a central fixation point (Fig. 2B). The letters in the array had two feature dimensions: form (Ts and Ls) and color (red and black). Arrays spanned 25° × 25° visual angle, and letters within the array were randomly distributed around a central fixation point and spaced from others by 2° visual angle. Displays had three potential set size conditions: 5, 15, or 25 letters.

Exp 2—Paradigm

On each trial of the classic conjunctive search task (180 trials), participants were instructed to report the presence/absence of a target letter (a red T) using a keypad. Note, the target letter shared a feature dimension with each type of distractor (black Ts and red Ls). There were two trial types, target present or target absent, which each occurred 50% of the time. On trials without a conjunction target, an additional distractor was added at random.

Each trial lasted for a maximum of 10 s or until a keypress. Before each trial, participants were shown a black fixation cross and required to press a button to start the trial. Participants were instructed to fixate on the cross until trial start, after which point they were free to move their eyes. Participants were instructed to “find the target as quickly as possible” and to “press 4 if the target is present or 6 if the target is absent”. Participant reaction time was calculated as the time of the button press relative to trial start. Following each trial, participants were given feedback on the accuracy of their response (a green check for correct responses and a red X for incorrect responses). A mandatory break occurred every 45 trials to allow participants to rest their eyes.

At the start of the study, participants were shown a set of instructions orienting them to the task. Following the instructions, participants completed a set of practice trials (12 trials) to ensure familiarity with the task. Participants were highly accurate during practice trials (mean accuracy: 91%), indicating comprehension of the task.

Statistical analyses

For all statistical tests, alpha level of p < 0.05 was used to assess significance, tests were two-tailed, and we applied Bonferroni correction for multiple-comparisons where appropriate. All analyses were conducted in the R statistical programming environment⁵⁹. Effect sizes were calculated using the effectsize package⁶⁰. For each task (naturalistic/classic), we built a linear mixed-effects model to evaluate the predictivity of condition (low, medium, or high set size) on reaction time (RT) using the lme4 package⁶¹. In each model, we included a fixed effect of condition. Additionally, we included a within-subject random effect of condition to account for individual variation in a) baseline reaction times (random intercepts) and b) individual efficiency (random slopes). Thus, we were able to separately estimate group-level and subject-level effects of the impact of condition on RT.

Results

To investigate whether classic findings of visual search extend to naturalistic settings, we developed a novel paradigm in which participants searched for real-world objects inside of 360° real-world scenes. For each visual search task (naturalistic/classic), we evaluated the extent to which condition (low, medium, or high clutter/set size) predicts reaction times (RT) using a linear mixed-effects model. We hypothesized that greater set sizes would result in slower RTs in each task, and that individual estimates of this effect of set size on RT (search efficiency) would correlate across tasks (naturalistic / classic).

Naturalistic visual search performance

We first examined the relationship between visual clutter levels and search performance inside immersive, real-world scenes. As predicted, we found that participants were faster and more accurate to locate the target in less-cluttered as compared with more-cluttered scenes. Combining data across our participants, we found a significant correlation between clutter-level and reaction times to correctly detect a target (r_s = 0.595, p < 0.001). This correlation was significant in all three sections of the environment: left, front, and right of the participant (left frame: r_s = 0.62, p < 0.001; front frame: r_s = 0.74, p < 0.001; right frame: r_s = 0.51, p = 0.032). Importantly, a one-way ANOVA on the fixed effect of clutter revealed a significant main effect on reaction times across participants (Fig. 3A; F_(2,368.76) = 187.42, p < 0.001, η_p² = 0.5). An additional one-way ANOVA demonstrated a main effect of condition on individual participant false alarm rate (F_(2,222) = 63.1, p < 0.001, η_p² = 0.36). Overall, these results suggest that visual clutter modulates visual search performance inside real-world scenes.

Classic visual search performance

We next evaluated the relationship between set size and search performance in a classic visual search paradigm. For target present and target absent trials, we used separate linear mixed-effects models to evaluate the fixed effect of set size on RT while accounting for the random effect of subject. A one-way ANOVA conducted on the fixed effect of set size revealed a significant main effect of set size on RT across participants for both target present (Fig. 3B; F_(2,116.08) = 463.04, p < 0.001, η_p² = 0.89) and target absent trials (Fig. 3C; F_(2,105.09) = 309.66, p < 0.001, η_p² = 0.85). A separate one-way ANOVA demonstrated a main effect of set size on individual participant false alarm rates (F_(2,222) = 6.40, p = 0.002, η² = 0.05). In sum, these results dovetail with previous findings of classic visual search paradigms that demonstrate the impact of set size on visual search performance⁴.

Reliability of search efficiency

Before examining the relationship between performance on the two experimental paradigms, we established the reliability of search efficiency: the impact of set size on a participant’s RT. For each task, we split each participant’s RTs in half within each level of set size. We next fit a linear mixed effects model for each half-split of RT to estimate search efficiency, the random slope of condition for each participant. We calculate reliability (ρ*) as the Pearson’s correlation between search efficiency of one half and the other, corrected with the Spearman-Brown prediction formula to estimate the full-length task reliability. We find low reliability for naturalistic search efficiency (ρ* = 0.293) and high reliability for classic search efficiency on both target present (ρ* = 0.947) and target absent trials (ρ* = 0.947).

Relating performance on naturalistic and classic visual search tasks

Having established the reliability of search efficiency within each task, we next investigated the relationship of search performance between the two tasks. For each task, we used a linear-mixed effects model to derive search efficiency: the random slope of condition fit to each participant’s RT. Importantly, we accounted for variability of RT within each task by modelling random intercepts for each participant.

We found a significant relationship between search efficiency on the naturalistic search task and on target present trials of the classic visual search task (Fig. 4A: r_s = 0.36, p = 0.002). However, the relationship between naturalistic and classic visual search was attenuated on target absent trials (Fig. 4B: r_s = 0.14, p = 0.23). We next compared individual efficiency in each quadrant (left, front, right) of the naturalistic visual search task with each trial type of the classic visual search task. Interestingly, efficiency in the front quadrant of the naturalistic visual search task was significantly related to efficiency on both target present and target absent trials (present: r_s = 0.27, p = 0.02; absent: r_s = 0.29, p = 0.012). While we also observed a significant relationship between efficiency in the right quadrant and target present trials (r_s = 0.28, p = 0.012), this relationship did not hold when considering target absent trials (r_s = 0.02, p = 0.85). Furthermore, we found no relationship between efficiency in the left quadrant and either classic search trial type (present: r_s = 0.15, p = 0.19; absent: r_s = 0.09, p = 0.46). Together, these results suggest that efficiency on a classic visual search task, indexed by a set size manipulation, predicts efficiency in naturalistic visual search, indexed by a clutter manipulation in complex, visual scenes.

Discussion

We find that visual search in immersive, real-world environments bears remarkable similarities to classic search in two important senses. First, classic and naturalistic search performance are both limited by set size: just as classic search efficiency is limited by the number of distractors in the visual display, naturalistic search efficiency is limited by a real-world analogue of set size, visual clutter. Second, individual differences in search efficiency are related in both tasks: participants with steeper costs of set size in artificial arrays of letters were more severely impacted by visual clutter in real-world environments. Together, these findings suggest that classic search is an excellent model of search efficiency within real-world environments.

Relating individual performance between computer-based and naturalistic settings is central to identifying the cognitive factors and task strategies that facilitate visual search^3,62. Differences in visual search performance have been demonstrated across development and healthy aging^63,64, clinical diagnoses^65,66, and expertise⁶⁷. Further, the task of visual search is highly relevant to performance in various professional settings (e.g., radiology, airport security)^{34,35,36,37,68}. For instance, previous research has shown a relationship between airport security officers’ search performance on a computer-based app and their detection of violations at an airport checkpoint, where people with faster and more accurate search within the app were better at detecting prohibited items at TSA checkpoints⁴⁰. While studies within professional settings begin to establish connections between computer-based paradigms and naturalistic experience, both the examined populations (e.g., experts) and sampled contexts (e.g., TSA checkpoints) limit the generalizability of these results to diverse, real-world environments.

Advances in virtual reality (VR) technology present a promising avenue to investigate visual behavior within naturalistic stimuli and contexts while simultaneously maintaining experimental control^42,43,44. First, VR enables researchers to exact similar rigor as in computer-based studies (e.g., trial length) without placing physical limitations (e.g., head-restraint) on the complex repertoire of participants’ naturalistic behavior. Second, researchers can leverage VR to efficiently investigate behavior across a wide range of diverse settings (e.g., beaches, parks, libraries), likely increasing the real-world generalizability of findings. Taken together, the use of VR empowers researchers to construct more representative models of naturalistic experience. Accordingly, an increasing number of studies employ VR headsets to investigate visual functions, providing essential connections between computer-based findings and naturalistic behavior. Yet, few studies have sought to relate models of visual functions, such as visual search, that are derived from behavior measured in traditional, computer-based paradigms to analogous behavior measured in real-world settings.

Recent studies investigating visual search using head-mounted displays highlight, in particular, that active behavior recruits memory to aid search performance in naturalistic settings. Active exploration of virtual environments prior to search has been shown to improve search performance by engaging spatial memory^46,69, a benefit not seen for explicit memorization⁷⁰. For example, one study demonstrates that spatial memory aids search by restricting attention to relevant areas of the scene⁴⁷: when the location of a target object was changed from a learned location, participants continued to initially fixate on the learned location. Further, interaction with objects in virtual environments bolsters memory of target object locations: participants are faster to locate objects they arranged within a room compared with objects arranged by others⁴⁵. By utilizing VR to investigate visual search, these studies reveal the contributions of action in and interaction with virtual environments on search performance. Our results extend prior research on active search by generalizing the well-known set size effect to a diverse set of real-world scenes, and by demonstrating a predictive relationship between an individual’s search efficiency in artificial and naturalistic contexts.

Certainly, our experimental paradigm has shortcomings. First, in contrast to many studies of visual search in which eye-tracking measures are employed, we were only able to use a combination of head-tracking data and keypress reaction times. This method is undoubtably noisier than measuring eye-tracking reaction times in each task. However, given the close coupling of head and eye movements⁷¹ and the presence of set size effects within both paradigms, we do not believe a different measurement would drastically alter our results. Second, while the classic search paradigm demonstrated high split-half reliability, the naturalistic search paradigm exhibited relatively low split-half reliability. Despite this low reliability, our results show a relationship of an individual’s search efficiency between the two visual search tasks. We hypothesize that, the magnitude of the task relationship would increase with more naturalistic search trials. Future studies are needed to test this hypothesis, as well as to understand behavioral changes across a continuum of stimulus naturalism moving from well-controlled psychophysical displays to naturalistic settings.

In sum, we find that set size analogously limits visual search performance in both classic, computer-based visual search and immersive, real-world scenes. Further, individual search efficiency on a classic search task predicts search efficiency in a naturalistic search task. These findings suggest that individual search performance is limited by common properties in artificial and naturalistic contexts and have important implications for relating models of vision to real-world behavior.

Data availability

Requests for materials should be directed to T.L.B.

References

Treisman, A. M. & Gelade, G. A feature-integration theory of attention. Cognit. Psychol. 12, 97–136 (1980).
Article CAS Google Scholar
Wolfe, J. M., Võ, M.L.-H., Evans, K. K. & Greene, M. R. Visual search in scenes involves selective and nonselective pathways. Trends Cogn. Sci. 15, 77–84 (2011).
Article Google Scholar
Wolfe, J. M. Visual search: How do we find what we are looking for?. Annu. Rev. Vis. Sci. 6, 539–562 (2020).
Article Google Scholar
Wolfe, J. M. & Horowitz, T. S. Five factors that guide attention in visual search. Nat. Hum. Behav. 1, 0058 (2017).
Article Google Scholar
Wolfe, J. M. Guided search 2.0 a revised model of visual search. Psychon. Bull. Rev. 1, 202–238 (1994).
Article CAS Google Scholar
Geisler, W. S. & Cormack, L. K. Models of Overt Attention (Oxford University Press, 2011). https://doi.org/10.1093/oxfordhb/9780199539789.013.0024.
Book Google Scholar
Eckstein, M. P. Visual search: A retrospective. J. Vis. 11, 14–14 (2011).
Article Google Scholar
Lindsay, G. W. Attention in psychology, neuroscience, and machine learning. Front. Comput. Neurosci. 14, 29 (2020).
Article Google Scholar
Peelen, M. V. & Kastner, S. Attention in the real world: Toward understanding its neural basis. Trends Cogn. Sci. 18, 242–250 (2014).
Article Google Scholar
Anderson, B. A. Social reward shapes attentional biases. Cogn. Neurosci. 7, 30–36 (2016).
Article Google Scholar
Maunsell, J. H. R. Neuronal representations of cognitive state: Reward or attention?. Trends Cogn. Sci. 8, 261–265 (2004).
Article Google Scholar
Rust, N. C. & Cohen, M. R. Priority coding in the visual system. Nat. Rev. Neurosci. 23, 376–388 (2022).
Article CAS Google Scholar
Henderson, J. Human gaze control during real-world scene perception. Trends Cogn. Sci. 7, 498–504 (2003).
Article Google Scholar
Tatler, B. W., Hayhoe, M. M., Land, M. F. & Ballard, D. H. Eye guidance in natural vision: Reinterpreting salience. J. Vis. 11, 5–5 (2011).
Article Google Scholar
Hayhoe, M. M. Vision and Action. 27 (2017).
Biederman, I., Glass, A. L. & Stacy, E. W. Searching for objects in real-world scenes. J. Exp. Psychol. 97, 22–27 (1973).
Article CAS Google Scholar
Potter, M. C. Meaning in visual search. Science 187, 965–966 (1975).
Article CAS ADS Google Scholar
Võ, M.L.-H. & Wolfe, J. M. When does repeated search in scenes involve memory? Looking at versus looking for objects in scenes. J. Exp. Psychol. Hum. Percept. Perform. 38, 23–41 (2012).
Article Google Scholar
Võ, M.L.-H. & Wolfe, J. M. The interplay of episodic and semantic memory in guiding repeated search in scenes. Cognition 126, 198–212 (2013).
Article Google Scholar
Castelhano, M. S. & Heaven, C. The relative contribution of scene context and target features to visual search in scenes. Atten. Percept. Psychophys. 72, 1283–1297 (2010).
Article Google Scholar
Castelhano, M. S. & Henderson, J. M. Initial scene representations facilitate eye movement guidance in visual search. J. Exp. Psychol. Hum. Percept. Perform. 33, 753–763 (2007).
Article Google Scholar
Draschkow, D., Kallmayer, M. & Nobre, A. C. When natural behavior engages working memory. Curr. Biol. 31, 869-874.e5 (2021).
Article CAS Google Scholar
Draschkow, D., Nobre, A. C. & van Ede, F. Multiple spatial frames for immersive working memory. Nat. Hum. Behav. 6, 536–544 (2022).
Article Google Scholar
Felsen, G. & Dan, Y. A natural approach to studying vision. Nat. Neurosci. 8, 1643–1646 (2005).
Article CAS Google Scholar
Leopold, D. A. & Park, S. H. Studying the visual brain in its natural rhythm. Neuroimage 216, 116790 (2020).
Article Google Scholar
Wolfe, J. M. Rethinking the basic-applied dichotomy. Cogn. Res. Princ. Implic. 1, 1 (2016).
Article Google Scholar
Brunyé, T. T., Drew, T., Weaver, D. L. & Elmore, J. G. A review of eye tracking for understanding and improving diagnostic interpretation. Cogn. Res. Princ. Implic. 4, 7 (2019).
Article Google Scholar
Wolfe, J. M. Use-inspired basic research in medical image perception. Cogn. Res. Princ. Implic. 1, 17 (2016).
Article Google Scholar
Clark, K., Cain, M. S., Adamo, S. H. & Mitroff, S. R. Overcoming hurdles in translating visual search research between the lab and the field. In The Influence of Attention, Learning, and Motivation on Visual Search (eds Dodd, M. D. & Flowers, J. H.) 147–181 (Springer, New York, 2012). https://doi.org/10.1007/978-1-4614-4794-8_7.
Chapter Google Scholar
Blacker, K. J., Peltier, C., McKinley, R. A. & Biggs, A. T. What versus how in visual search: Effects of object recognition training, strategy training, and non-invasive brain stimulation on satellite image search. J. Cogn. Enhanc. 4, 131–144 (2020).
Article Google Scholar
See, J. E., Drury, C. G., Speed, A., Williams, A. & Khalandi, N. The role of visual inspection in the 21st century. Proc. Hum. Factors Ergon. Soc. Annu. Meet. 61, 262–266 (2017).
Article Google Scholar
Drew, T., Williams, L. H., Aldred, B., Heilbrun, M. E. & Minoshima, S. Quantifying the costs of interruption during diagnostic radiology interpretation using mobile eye-tracking glasses. J. Med. Imaging 5, 1 (2018).
Article Google Scholar
van der Gijp, A. et al. How visual search relates to visual diagnostic performance: A narrative systematic review of eye-tracking research in radiology. Adv. Health Sci. Educ. 22, 765–787 (2017).
Article Google Scholar
Adamo, S. H., Ericson, J. M., Nah, J. C., Brem, R. & Mitroff, S. R. Mammography to tomosynthesis: examining the differences between two-dimensional and segmented-three-dimensional visual search. Cogn. Res. Princ. Implic. 3, 17 (2018).
Article Google Scholar
Biggs, A. T., Cain, M. S., Clark, K., Darling, E. F. & Mitroff, S. R. Assessing visual search performance differences between Transportation Security Administration Officers and nonprofessional visual searchers. Vis. Cogn. 21, 330–352 (2013).
Article Google Scholar
Mendes, M., Schwaninger, A. & Michel, S. Can laptops be left inside passenger bags if motion imaging is used in X-ray security screening?. Front. Hum. Neurosci. 7, 654 (2013).
Article Google Scholar
Clancy Dollinger, S. M. Individual differences in visual search performance among medical technologists. Personal. Individ. Differ. 17, 297–300 (1994).
Article Google Scholar
Evans, K. K., Georgian-Smith, D., Tambouret, R., Birdwell, R. L. & Wolfe, J. M. The gist of the abnormal: Above-chance medical decision making in the blink of an eye. Psychon. Bull. Rev. 20, 1170–1175 (2013).
Article Google Scholar
Williams, L. H. & Drew, T. What do we know about volumetric medical image interpretation?: A review of the basic science and medical image perception literatures. Cogn. Res. Princ. Implic. 4, 21 (2019).
Article Google Scholar
Mitroff, S. R., Ericson, J. M. & Sharpe, B. Predicting airport screening officers’ visual search competency with a rapid assessment. Hum. Fact. 60, 201–211 (2018).
Article Google Scholar
Haskins, A. J., Mentch, J., Botch, T. L. & Robertson, C. E. Active vision in immersive, 360° real-world environments. Sci. Rep. 10, 14304 (2020).
Article CAS ADS Google Scholar
Doucet, G., Gulli, R. A. & Martinez-Trujillo, J. C. Cross-species 3D virtual reality toolbox for visual and cognitive experiments. J. Neurosci. Methods 266, 84–93 (2016).
Article Google Scholar
Scarfe, P. & Glennerster, A. Using high-fidelity virtual reality to study perception in freely moving observers. J. Vis. 15, 3 (2015).
Article Google Scholar
Draschkow, D. Remote virtual reality as a tool for increasing external validity. Nat. Rev. Psychol. 1, 433–434 (2022).
Article Google Scholar
Draschkow, D. & Võ, M.L.-H. Scene grammar shapes the way we interact with objects, strengthens memories, and speeds search. Sci. Rep. 7, 16471 (2017).
Article ADS Google Scholar
Beitner, J., Helbing, J., Draschkow, D. & Võ, M.L.-H. Get your guidance going: Investigating the activation of spatial priors for efficient search in virtual reality. Brain Sci. 11, 44 (2021).
Article Google Scholar
Li, C.-L., Aivar, M. P., Tong, M. H. & Hayhoe, M. M. Memory shapes visual search strategies in large-scale environments. Sci. Rep. 8, 4324 (2018).
Article ADS Google Scholar
Marek, N. & Pollmann, S. Contextual-cueing beyond the initial field of view—A virtual reality experiment. Brain Sci. 10, 446 (2020).
Article Google Scholar
Neider, M. B. & Zelinsky, G. J. Exploring set size effects in scenes: Identifying the objects of search. Vis. Cogn. 16, 1–10 (2008).
Article Google Scholar
Palmer, J. Set-size effects in visual search: The effect of attention is independent of the stimulus for simple tasks. Vis. Res. 34, 1703–1721 (1994).
Article CAS ADS Google Scholar
Henderson, J. M., Chanceaux, M. & Smith, T. J. The influence of clutter on real-world scene search: Evidence from search efficiency and eye movements. J. Vis. 9, 32–32 (2009).
Article Google Scholar
Wolfe, J. M., Alvarez, G. A., Rosenholtz, R., Kuzmova, Y. I. & Sherman, A. M. Visual search for arbitrary objects in real scenes. Atten. Percept. Psychophys. 73, 1650–1671 (2011).
Article Google Scholar
Bar, M. Visual objects in context. Nat. Rev. Neurosci. 5, 617–629 (2004).
Article CAS Google Scholar
Võ, M.L.-H., Boettcher, S. E. & Draschkow, D. Reading scenes: How scene grammar guides attention and aids perception in real-world environments. Curr. Opin. Psychol. 29, 205–210 (2019).
Article Google Scholar
Rosenholtz, R., Li, Y. & Nakano, L. Measuring visual clutter. J. Vis. 7, 17 (2007).
Article Google Scholar
Kravitz, D. J., Peng, C. S. & Baker, C. I. Real-world scene representations in high-level visual cortex: It’s the spaces more than the places. J. Neurosci. 31, 7322–7333 (2011).
Article CAS Google Scholar
Lee, J. H., Han, M.-K., Ko, D. W. & Suh, I. H. From Big to Small: Multi-Scale Local Planar Guidance for Monocular Depth Estimation. ArXiv190710326 Cs (2020).
Yu, C.-P., Samaras, D. & Zelinsky, G. J. Modeling visual clutter perception using proto-object segmentation. J. Vis. 14, 4–4 (2014).
Article Google Scholar
R Core Team. R: A language and environment for statistical computing (2013).
Ben-Shachar, M., Lüdecke, D. & Makowski, D. Effectsize: Estimation of effect size indices and standardized parameters. J. Open Source Softw. 5, 2815 (2020).
Article ADS Google Scholar
Bates, D., Mächler, M., Bolker, B. & Walker, S. Fitting linear mixed-effects models using lme4. J. Stat. Softw. 67 (2015).
Biggs, A. T., Kramer, M. R. & Mitroff, S. R. Using cognitive psychology research to inform professional visual search operations. J. Appl. Res. Mem. Cogn. 7, 189–198 (2018).
Article Google Scholar
Hommel, B., Li, K. Z. H. & Li, S.-C. Visual search across the life span. Dev. Psychol. 40, 545–558 (2004).
Article Google Scholar
Woods, A. J. et al. The development of organized visual search. Acta Psychol. (Amst.) 143, 191–199 (2013).
Article Google Scholar
Plaisted, K., O’Riordan, M. & Baron-Cohen, S. Enhanced visual search for a conjunctive target in autism: A research note. J. Child Psychol. Psychiatry 39, 777–783 (1998).
Article CAS Google Scholar
O’Riordan, M. A., Plaisted, K. C., Driver, J. & Baron-Cohen, S. Superior visual search in autism. J. Exp. Psychol. Hum. Percept. Perform. 27, 719–730 (2001).
Article Google Scholar
Abernethy, B. & Russell, D. G. The relationship between expertise and visual search strategy in a racquet sport. Hum. Mov. Sci. 6, 283–319 (1987).
Article Google Scholar
Lanagan-Leitzel, L. K., Skow, E. & Moore, C. M. Great expectations: Perceptual challenges of visual surveillance in lifeguarding: Visual surveillance in lifeguarding. Appl. Cogn. Psychol. 29, 425–435 (2015).
Article Google Scholar
Li, C.-L., Aivar, M. P., Kit, D. M., Tong, M. H. & Hayhoe, M. M. Memory and visual search in naturalistic 2D and 3D environments. J. Vis. 16, 9 (2016).
Article CAS Google Scholar
Helbing, J., Draschkow, D. & Võ, M.L.-H. Search superiority: Goal-directed attentional allocation creates more reliable incidental identity and location memory than explicit encoding in naturalistic virtual environments. Cognition 196, 104147 (2020).
Article Google Scholar
Freedman, E. G. Coordination of the eyes and head during visual orienting. Exp. Brain Res. 190, 369–387 (2008).
Article Google Scholar

Download references

Acknowledgements

This work was supported by a grant from The Neukom Institute for Computational Science at Dartmouth College to C.E.R. Thanks to Ruth Rosenholtz for helpful feedback on experimental design. Thanks to Adam Steel for insightful comments on the manuscript. Thanks to A.J. Haskins, Anna Mynick, and Allie Burrows for helpful discussion.

Author information

Authors and Affiliations

Department of Psychological and Brain Sciences, Dartmouth College, Hanover, NH, 03755, USA
Thomas L. Botch, Brenda D. Garcia, Yeo Bi Choi & Caroline E. Robertson
Department of Computer Science, Dartmouth College, Hanover, NH, 03755, USA
Nicholas Feffer
Department of Computer Science, Stanford University, Stanford, CA, 94305, USA
Nicholas Feffer

Authors

Thomas L. Botch
View author publications
You can also search for this author in PubMed Google Scholar
Brenda D. Garcia
View author publications
You can also search for this author in PubMed Google Scholar
Yeo Bi Choi
View author publications
You can also search for this author in PubMed Google Scholar
Nicholas Feffer
View author publications
You can also search for this author in PubMed Google Scholar
Caroline E. Robertson
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

T.L.B. and C.E.R. designed research; T.L.B. and N.F. developed experimental paradigms; T.L.B., B.D.G., and Y.B.C. collected data; T.L.B. and C.E.R. analyzed data; T.L.B. and C.E.R. wrote the paper.

Corresponding author

Correspondence to Thomas L. Botch.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Botch, T.L., Garcia, B.D., Choi, Y.B. et al. Active visual search in naturalistic environments reflects individual differences in classic visual search performance. Sci Rep 13, 631 (2023). https://doi.org/10.1038/s41598-023-27896-7

Download citation

Received: 31 October 2022
Accepted: 10 January 2023
Published: 12 January 2023
DOI: https://doi.org/10.1038/s41598-023-27896-7
Springer Nature Limited

This article is cited by

Eye and head movements in visual search in the extended field of view
- Niklas Stein
- Tamara Watson
- Szonya Durant
Scientific Reports (2024)
Using a flashlight-contingent window paradigm to investigate visual search and object memory in virtual reality and on computer screens
- Julia Beitner
- Jason Helbing
- Melissa Lê-Hoa Võ
Scientific Reports (2024)

Active visual search in naturalistic environments reflects individual differences in classic visual search performance

Abstract

Similar content being viewed by others

A Comparative Study of Virtual Reality and 2D Display Methods in Visual Search in Real Scenes

Visual search is influenced by 3D spatial layout

Dwelling on simple stimuli in visual search

Introduction

Methods

Participants

Remote data collection

Experiment 1: Naturalistic visual search

Exp. 1—Stimuli and set size manipulation

Exp. 1—Paradigm

Experiment 2: Classic visual search

Exp 2—Stimuli and set size manipulation

Exp 2—Paradigm

Statistical analyses

Results

Naturalistic visual search performance

Classic visual search performance

Reliability of search efficiency

Relating performance on naturalistic and classic visual search tasks

Discussion

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary Information.

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Eye and head movements in visual search in the extended field of view

Using a flashlight-contingent window paradigm to investigate visual search and object memory in virtual reality and on computer screens

Search

Navigation