The Multiple Object Avoidance (MOA) task measures attention for action: Evidence from driving and sport

Mackenzie, Andrew K.; Vernon, Mike L.; Cox, Paul R.; Crundall, David; Daly, Rosie C.; Guest, Duncan; Muhl-Richardson, Alexander; Howard, Christina J.

doi:10.3758/s13428-021-01679-2

The Multiple Object Avoidance (MOA) task measures attention for action: Evidence from driving and sport

Open access
Published: 16 November 2021

Volume 54, pages 1508–1529, (2022)
Cite this article

Download PDF

You have full access to this open access article

Behavior Research Methods Aims and scope Submit manuscript

The Multiple Object Avoidance (MOA) task measures attention for action: Evidence from driving and sport

Download PDF

Andrew K. Mackenzie ORCID: orcid.org/0000-0002-6818-2838¹,
Mike L. Vernon¹,
Paul R. Cox²,
David Crundall¹,
Rosie C. Daly¹,
Duncan Guest¹,
Alexander Muhl-Richardson³ &
…
Christina J. Howard¹

3312 Accesses
8 Citations
12 Altmetric
Explore all metrics

Abstract

Performance in everyday tasks, such as driving and sport, requires allocation of attention to task-relevant information and the ability to inhibit task-irrelevant information. Yet there are individual differences in this attentional function ability. This research investigates a novel task for measuring attention for action, called the Multiple Object Avoidance task (MOA), in its relation to the everyday tasks of driving and sport. The aim in Study 1 was to explore the efficacy of the MOA task to predict simulated driving behaviour and hazard perception. Whilst also investigating its test–retest reliability and how it correlates to self-report driving measures. We found that superior performance in the MOA task predicted simulated driving performance in complex environments and was superior at predicting performance compared to the Useful Field of View task. We found a moderate test–retest reliability and a correlation between the attentional lapses subscale of the Driving Behaviour Questionnaire. Study 2 investigated the discriminative power of the MOA in sport by exploring performance differences in those that do and do not play sports. We also investigated if the MOA shared attentional elements with other measures of visual attention commonly attributed to sporting expertise: Multiple Object Tracking (MOT) and cognitive processing speed. We found that those that played sports exhibited superior MOA performance and found a positive relationship between MOA performance and Multiple Object Tracking performance and cognitive processing speed. Collectively, this research highlights the utility of the MOA when investigating visual attention in everyday contexts.

Can Three-Dimensional Multiple Object Tracking Training Be Used to Improve Simulated Driving Performance? A Pilot Study in Young and Older Adults

Article Open access 24 April 2023

Studying visual attention using the multiple object tracking paradigm: A tutorial review

Article 05 June 2017

Video Games

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

General introduction

Performance in everyday tasks, such as driving and sport, requires appropriate allocation of attention to task-relevant information and the ability to inhibit task-irrelevant information. Yet the ability of this attentional control varies across individuals. Where, for example, there are differences in the speed of attentional processing, the number of objects one can attend to or the ability to successfully inhibit attentional information. There are countless tasks designed to target these and other attentional components, often with the aim to assess an individual’s attentional control and how this relates to performance in more complex tasks. The overall aim of this research was to investigate a novel, open-source, visual-attention task called the Multiple Object Avoidance (MOA) task to assess visual attention function and demonstrate its relatedness to attention in everyday tasks. This is a visuomotor task that was developed with the aim of creating a more active (i.e. involving visuomotor control) Multiple Object Tracking (MOT) task – a task that is often used in attention research given the proposed attentional similarities to complex everyday tasks. The MOA task was originally developed in response to previous research in driving. Mackenzie and Harris (2017) found that MOA performance positively predicted driving performance and eye-movement scanning in a driving task. The findings in that study highlighted the potential importance of such a task in predicting driving behaviour but had several limitations including the absence of an open-source version of the task. As such, this research is presented that continues the line of MOA and driving literature before also exploring the utility of MOA in further everyday domains; that of sport.

We first discuss the importance and development of the MOA in relation to “active vision” in the next section. In Study 1, we discuss the literature on visual attention and driving behaviour with a specific focus towards a more “action-related” visual assessment of these aspects. In this study, we aimed to explore the efficacy of the MOA task in predicting driving performance and hazard perception when driving. In Study 2, we explored the MOA in a sporting domain and aimed to investigate MOA performance differences in those that play sports – a population that has often been found to exhibit superior visual attentional function – and those that do not play sports. In order to establish a degree of construct validity we also aimed to investigate the attentional relatedness of the MOA to other cognitive tasks argued to be important in sporting performance.

A role for active visual attention tasks and the development of the MOA

Given the attentional complexity of “everyday tasks” (such as driving and sport), it is unlikely a task measuring a single attentional domain would predict overall task performance (Bowers et al., 2013; Liebherr et al., 2019). Often, the relatedness, or lack thereof, of the cognitive task or battery to the attentional demands of the everyday task limits its efficacy in predicting task performance. In the case of driving and sport, whilst elements of, for example, selective attention or executive control are important and could be assessed using tasks such as Stroop tasks or inhibitory response tasks, one must also have the ability to sustain and divide attention to dynamic stimuli (Alberti et al., 2014; Bowers et al., 2011; Mackenzie & Harris, 2017) which these types of selective attention tasks do not capture.

One task that may capture the range of attentional complexity in sport and driving is the Multiple Object Tracking task (MOT) (e.g. Cavanagh & Alvarez, 2005; Pylyshyn & Storm, 1988). In a simple MOT task, target and distractor objects are presented on-screen. Observers are asked to attend to all the targets. The target objects are usually denoted as such by a temporary increase in visual salience of the object e.g. by flashing or changing colour. Observers must continue to divide their attention across all target objects as they and the distractors move around the visual scene. Once the objects have stopped, observers must identify which of the objects were originally the targets. One might hypothesise that performance on this task correlates to complex everyday behaviour; in tasks that involve sustained and divided attention to multiple dynamic stimuli whilst ignoring distractor stimuli. Indeed, this has been found in a number of studies where poorer performance in an MOT task correlated to poorer performance on road tests, and poorer ability to detect pedestrians during simulated driving (Alberti et al., 2014; Bowers et al., 2011). Importantly, in the work of Alberti et al., (2014), MOT was a stronger predictor of pedestrian detection than the Useful Field of View (UFOV) task, which is a more reduced task that does not capture the sustained and dynamic elements of attention in everyday tasks. Michaels et al. (2017) investigated the relationship between individuals’ perceptual-cognitive capacity in a MOT task and driving behaviour. They found that individuals who performed more poorly in the MOT task were at a higher crash risk – particularly in older adults. Collectively, these results highlight the link between visual attentional function and task performance, and also the possible importance of using a more dynamic and sustained attention assessment in predicting behaviour.

Mackenzie and Harris (2017) argued that whilst the MOT likely captures attentional properties involved in driving more than tests such as the UFOV, it is still relatively passive in nature because there is no active, visuomotor interaction during the motion phase. Indeed, one of the better strategies to use in order to be successful in the MOT task is to make fewer eye movements and covertly attend to the stimuli by attempting to fixate centrally between the moving targets (Fehd & Seiffert, 2008; Oksama & Hyönä, 2016; Zelinsky & Neider, 2008), so that even the active exploration with the eyes is reduced. In driving, for example, there is a visuomotor element in controlling the vehicle (Kountouriotis et al., 2012; Land & Lee, 1994; Lehtonen et al., 2014) and one must make many eye movements to successfully identify hazards (Konstantopoulos et al., 2012; Underwood et al., 2002, 2005). Eye movements, attention, and action are often intrinsically linked (Hommel, 2010; Humphreys et al., 2010) particularly in everyday settings, including driving (Land, 2006; Tatler et al., 2011) and sport (Land & McLeod, 2000). In addition, different eye-movement strategies are observed between tasks involving action (visuomotor control) and their passive analogies, e.g. ‘real life’ versus video (Foulsham et al., 2011; Mackenzie & Harris, 2015; Risko et al., 2012). Thus, we argue, visual attention tasks incorporating the more active elements of attention may better predict performance.

Attempts have previously been made at capturing this more action-related element of visual attention by developing an interactive MOT task or iMOT (Thornton et al., 2014; Thornton & Horowitz, 2015). In this task, the individual must use a touch screen to move objects and prevent them colliding with each other. Following from this research, Mackenzie and Harris (2017) identified a task similar in nature that did not involve a touch screen element and only involved the control of one object (similar to driving) using a mouse. A non-touch screen design prevented obstruction of the screen from hands and arms. They termed this task the Multiple Object Avoidance (MOA) task. In this task, an individual controls one object (user-controlled object). Three other objects (red hazard balls) are present on screen and begin moving around. The task is to have the blue object avoid these red hazard objects that would move in a predictable, vector-like fashion. As the individual continues to manoeuvre the user-controlled object to avoid the hazard objects, the task gets increasingly harder as more red balls are added (one added every 10 s).

Arguably, a task like this involves similar attentional components to those used in complex everyday tasks. Namely, sustained attention to dynamic stimuli, divided attention, active vision, visuomotor control, and planning ability (i.e. the ability to predict the motion of the objects). In Mackenzie and Harris' (2017) work, performance on this MOA task significantly predicted driving performance and also predicted more effective horizontal spread of visual search – eye movement behaviour we typically see in more experienced drivers (Crundall & Underwood, 1998; Konstantopoulos et al., 2010; Konstantopoulos et al., 2012; Underwood et al., 2011). This relationship was stronger for MOA than a standard MOT task and was also stronger during more complex scenes; scenes that would intuitively involve more scanning type eye movements to detect hazards. They explain this relationship by suggesting that the active nature of the MOA which involves many eye movements to be successful may represent the eye movements one makes when driving and searching for hazards. This is important given that inattention and failures to scan the road are often contributing factors to accidents (Dingus et al., 2006; Lee, 2008).

The broad aims of this research are to replicate and extend Mackenzie and Harris (2017) and investigate how MOA predicts driving and driving related behaviours using a newly developed open-source version of the MOA task (Study 1) and begin exploring how MOA performance might differentiate between those with varying sporting expertise (Study 2).

Study 1: The Multiple Object Avoidance task and driving behaviour

Driving is a complex visuomotor everyday task. It requires the ability to control the vehicle whilst also attending to possible hazards. One’s own visual attentional functioning (that is, performance within specific facets of visual attention e.g. divided attention, speed of processing, working memory capacity etc.), is often therefore a predictor of driving behaviour and driving performance. For example, better ability within these visual attentional components relates to better driving overall and the individual elements of driving, e.g. hazard perception (Wood et al., 2016), vehicle control (Aksan et al., 2017; Louie & Mouloua, 2019) and, importantly, road accidents (Karimi et al., 2015). Thus, the importance of investigating and evaluating visual attention tasks that may help to predict, assess, or even train, driving behaviour is highlighted. In this study, we aimed to replicate and extend the results of Mackenzie and Harris (2017) by exploring the MOA task’s ability to predict simulated driving performance and hazard perception behaviour, and also how it may relate to other measures used in driving such as the Useful Field of View and the Driving Behaviour Questionnaire.

Measuring visual attentional functioning and driving performance

The relationship between visual attentional function and driving ability is evident in a number of studies where superior driving performance is predicted by superior performance in tasks measuring, for example, overall executive functions (Pope et al., 2016), processing speed (Ross et al., 2016) and sustained attention (Tabibi et al., 2015). Early work demonstrates how the Useful Field of View (UFOV) test (Ball et al., 1990) relates to driving behaviour and performance – particularly in older adults. Broadly, this test involves a range of executive functions measuring the ability to process multiple (divided attention) rapidly presented pieces of information (speed of visual processing) whilst ignoring distractors (executive control). Better performance in this test seems correlated to better driving performance, at least in older adults (Ball et al., 1993; Bedard et al., 2008; Clay et al., 2005; Owsley & McGwin Jr., 2010). The link between visual attention function and driving found with many UFOV studies (and other attention task studies) can perhaps be explained by the attentional similarities in what is required in driving and the UFOV task. In driving, one must also be able to process visual information effectively (e.g. hazards), divide attention to several elements of the environment (e.g. control of the vehicle, looking out for hazards etc) and, importantly, ignore distractors. One may argue that if a driver exhibits better attentional function, then they are better able to handle these attentional demands of the road.

It is, however, important to also highlight that some of the relationship between visual-cognitive tools and driving performance in older adults may simply reflect normal ageing. Bédard et al. (2016) investigated this idea using the ANT (Attentional Network Task) and UFOV tasks by running correlations between age and task performance within certain age groups (under 65 and over 65) rather than using the full age range of participants. When age was partialled out, correlations in task performance within these age groups disappeared. In the case of UFOV we know that visual processing speed (which the UFOV largely measures) is a cognitive function where the variability in processing speed is lower within younger populations, the decline is measurably marked with age and there are large differences between younger and older populations (Guest et al., 2015, 2017). It is therefore unsurprising that such a task would capture attentional and driving differences when used across these age groups. The research by Bédard et al. (2016) demonstrates that a large amount of the variability in task performance is simply accounted for by age. It also highlights issues in developing cognitive tasks to predict driving behaviour. We attempt to address this limitation here by developing a tool that correlates with driving performance within a younger adult population where variability of cognitive decline due to normal aging is unlikely to contribute to the variability in task performance.

Nevertheless, there has been some evidence that links performance in paradigms utilising static or brief presentation of stimuli to driving behaviour within adult (non-older) populations. Paradigms such as, for example, the Deceleration Detection Flicker Task, a task that measures ability to respond to a perceived reduction in driver headway (see Crundall, 2009; Lee et al., 2020), and the Attentional Network Task (ANT), a task that measures attention alerting, attention orienting and executive control (Fan et al. 2002, 2009). Weaver et al. (2009) conducted a study comparing performance on the ANT to both simulated and on-road driving performance. They found moderate relationships between overall ANT performance and driving scores (although this was stronger for simulated driving). However, the strength of the relationships between the individual attentional components and driving scores were quite weak. This may be surprising given that these attentional components would likely be used in driving where one must, for example, be vigilant for oncoming hazards (attention alerting), orient attention to potentially hazardous areas (attention orienting) and attend to hazardous areas whilst ignoring non-hazardous areas of the scene (executive control). However, in the study, they used a more general measure of driving (i.e. starting, stopping, signal violations, right of way violations) and a non-hazardous driving route that may not have been sensitive to measure performance in specific driving tasks where these attentional function components are arguably more vital (e.g. hazard perception). Roca et al. (2013) therefore investigated how performance in a version of the ANT predicted attention to specific hazardous events (hazards predicted by a single precursor). They found that attention orienting specifically was the best predictor of safe driving behaviour during specific hazardous events. Therefore, one might argue that tasks such as the ANT might do well in predicting behaviour during more specific and low occurrence road events (e.g. hazards) but may not capture the attentional complexity in more general driving. Our aim is to therefore extend beyond identifying how a task might predict specific driving events and develop a task that might better predict general driving performance. We believe, given the MOA’s targeting of active and divided attention to dynamic stimuli, it is a suitable candidate.

Aims and hypotheses

Mackenzie and Harris (2017) attempted to address the limitations of paradigms utilising static or brief presentation of stimuli as assessments of visual attention in relation to driving. Namely, limitations surrounding these tasks’ inability to capture general driving behaviour and surrounding the minimal contribution of more sustained and dynamic elements of visual attention in such tasks. They proposed the MOA task (Section 1.1.1) and found performance in this task did well in predicting driving performance in complex (e.g. urban) driving environments and also visual scanning, where better performance in MOA predicted better driving performance and wider horizontal scanning. However, there were a number of limitations identified and these are addressed in this first study. Importantly, we address these limitations using a new open-source version of the MOA task.

The first aim (1a) was to replicate the previous work of Mackenzie and Harris (2017) investigating how well performance in the MOA predicts simulated driving performance and to extend this by making a comparison with the predictive power of the UFOV; a more frequently used measure of attention in the driving literature. This was done by measuring performance on the MOA and UFOV and correlating scores, via regression, with driving performance in a driving simulator. We hypothesised that better MOA performance would predict better simulated driving performance.

The second aim (1b) was to identify how well performance in this task predicts hazard perception behaviour. Mackenzie and Harris (2017) previously claimed that being able to predict horizontal spread of visual search was an advantage in the MOA as this eye movement behaviour may help to identify hazards. However, there were no hazards present in that study to provide evidence for this. Therefore, we investigated if performance in the MOA (and UFOV) predicted the time to first fixate hazards during the simulated drive. We hypothesised that better MOA performance would predict earlier hazard first fixation times and this relationship would be stronger for the MOA than the UFOV because of the increased oculomotor activity required during the MOA.

The third aim (1c) was to investigate some measures of reliability and validity in the MOA. We did this by investigating the test–retest reliability (after 6 months) and investigated the concordant validity of this test with typically used self-report measures in driving using the Driving Behaviour Questionnaire (DBQ). We hypothesised there to be modest test–retest reliability. We also hypothesised there to be moderate correlations between the MOA and scores on the DQB where higher MOA scores would correlate to fewer instances of self-reported driving errors for each subscale of the DBQ.

Method

Participants

Forty-two participants (11 males; 31 females) with a mean age of 23.26 years (SD = 4.35) took part in this study. All participants held a valid driver’s licence (M = 4.44 years; SD = 4.05), drove on the left (e.g. United Kingdom) and had normal or corrected-to-normal vision (via contact lens). Participants were paid in £20 shopping vouchers for their participation. A sample calculation was conducted in R using the package pwr (v.1.3-0), which contains functions for basic power calculations using effect sizes and notations from Cohen (1988). A predicted effect size of Cohen f² = 0.4 was used using Mackenzie and Harris’ (2017) previous data where significant effect sizes ranged from f² = 0.20 and f² = 0.59. At a more conservative power level of 0.95, alpha error probability of 0.05 and three modelled predictors (theoretically; two predictors and one covariate), a sample of 45 participants would be recruited. For this, we note the limitation of being underpowered here in obtaining this effect size. For a power level of 0.8, alpha error probability of 0.05 and three modelled predictors, a sample of 29 participants would be recruited. For the MOA retesting, 28 of the participants were successfully recruited (nine males). There was no significant difference between the ages of participants between the original 42 participants (M = 23.26, SE = 0.67) and the 28 participants who returned for retesting MOA ((M = 24.39, SE = 0.87), t(56.14) = – 1.04, p = 0.30). Additionally, the difference in driving experience (years) between the original 42 participants (M = 4.44, SE = 0.63) and the 28 participants who returned retesting MOA (M = 5.41, SE = 0.86) was not significant (t(53.54) = – 0.92, p = 0.36). Ethical approval was given by Nottingham Trent University College Research Ethics Committee.

Stimuli and apparatus

Visual attention tasks

Multiple Object Avoidance (MOA) task

This task was programmed using Python and the packages: pygame, numarray, numeric, and numpy. Initially, four circles are presented on screen, each 40 pixels in diameter (~10.6 mm) size. One of these is blue and three are red. The blue circle is controlled by the participant using a mouse and the objective is to avoid the red hazard circles touching the blue circle as they move around the screen. There is an initial delay of 1 s where the red circles begin to move but are unfilled and unable to collide with the user’s blue circle. This is to give the participants time to identify the stimuli and their trajectory. The hazard circles are then filled in and, at this point, can collide with the blue user-controlled circle. After 10 seconds another moving red circle is added to the display. It initially appears as an unfilled circle for 1 s and is unable to collide with the user’s blue circle before being filled in red completely. All red circle movements followed predictable straight-line vector movements after an initial random trajectory. That is, a red circle will move in a straight line until it connects with either the edge of the screen or another red ball and ‘bounce’ off this object at an angle consistent with 2D vector physics. No red object moves in a random pattern after the initial trajectory and, as such, all movements are theoretically predictable. Speeds for each red circle are randomised and can range from any number from 0 to 680 pixels per second. A new red circle is added every 10 s, thereby increasing the difficulty of the task with more objects to track and avoid (see Fig. 1 for a sequential representation). The time (in seconds) that a participant can avoid colliding with a red circle is recorded as the score for the trial, with higher numbers reflecting greater proficiency at the task. In this study, participants completed ten trials; two practice trials and eight recorded trials. A mean of these final eight trials is taken as the measure of MOA performance. The task window is displayed at a size of 800 by 800 pixels. The task was presented on a 17.5-inch CTX EX951F monitor (Chuntex Electronic Co., Ltd., Taipei, Taiwan) with a refresh rate of 85 Hz.

Useful Field of View (UFOV)

Version 7 of the Useful Field of View task was used (Brain HQ, Posit Science). There are three subtasks. Subtask 1 one measures speed of processing to a single object. An image of either a car or truck appears in the centre of the display screen and it is the participant’s task to identify which object is presented. The duration for which the stimuli is presented at varies, in a stepped fashion, depending on the accuracy of identification (where better performance per trial results in shorter presentation durations). Subtask 2 measures divided attention where the central task remains the same as in Subtask 1, but the participant must identify where another target is on screen. Subtask 3, which measures divided attention amongst distractors, is the same as Subtask 2 but a number of distractors (triangle shaped stimuli) appear in the field of view (Fig. 2). Processing speed, as measured by the software depending on stimuli presentation duration, is used as a measure of performance in each subtask. Of interest for this study are the scores from subtask 3. Arguably Subtasks 1 and 2 are more relevant for Older Adult drivers and a ceiling performance was observed here in our sample for these subtasks. Subtask 3 – divided attention amongst distractors – is more relevant to attention in driving in a younger adult population and will show variance across younger participants. Note: presentation durations and processing speed calculations were all controlled by the UFOV software. Experimenters did not have access to the raw data for these and scores are not given for each trial individually. Performance is a measure of processing speed and is measured as the minimum amount of time required to correctly process the visual information where better performance results in lower minimum speed (in ms). The task was presented on a 17.5-inch CTX EX951F monitor (Chuntex Electronic Co., Ltd., Taipei, Taiwan) with a refresh rate of 85 Hz and at a screen resolution of 1280 x 1024.

Driving simulation

A RijSchoolSimulator driving simulator, developed by Carnetsoft, was used for the driving simulation aspect of the study, as used in previous studies on driving behaviour (Roca et al., 2018; Tejero et al., 2019). The hardware includes a Logitech G27 control set featuring an 11-inch leather wrapped driving wheel; 6-speed gear shifter (including reverse); steel accelerator, brake, and clutch pedals (Fig. 3a). Three display monitors provide a 210-degree horizontal field of view from the cabin in frontal and side view positions. Mirrors, dashboard, and road environment were all displayed across a three-screen panel display (Fig. 3b). The software allows for the preparation and testing of behavioural experiments. The graphic capabilities of the simulator are able to portray a 3D world, rear-view and side-view mirrors, as well as visual and sound effects to simulate changing weather conditions (see Fig. 3 for a representation of the visual field).

The software allows for interactive traffic in the form of moving vehicles as well as animated pedestrians and animals. Participants completed three driving routes of varying complexity. The urban route consisted of typical inner-city driving involving crossroads, single-lane traffic, traffic lights, pedestrians etc. This was the most complex route. The next complex route was a suburban carriageway. This consisted of both single and dual lane roads, junctions and a number of speed limit changes. The final, and least complex route was the inter-city motorway. This consisted of a multi-lane carriageway in a straight line. There was a moderate level of traffic throughout. Driving performance was tracked by the driving simulator throughout. This was a point-deduction system (as opposed to a demerit-based point system that has been previously used e.g. Mackenzie & Harris, 2017; Weaver et al., 2009) where points were deducted, starting from a score of 10, for driving error. Driving assessment included elements such as speed control, lane changing, rules of priority, gear changing, overtaking, steering, indicator usage, and negotiating roundabouts. All scores were controlled by the software. Each of the individual assessment tasks are rated as on a decimal scale from 0 to 10. Scores were recorded for each route and for all three combined (as an average of the three routes). Three hazards were programmed in the urban route. These were pedestrian-based hazards where a pedestrian would step into the road (Fig. 3).

Eye movement recording

Eye movements were recorded using SMI eye tracking glasses (ETG2), sampling binocularly at 60 Hz. The environment is captured using a forward-facing camera sampling at 60 fps. A standard one-point calibration was used using a circular target presented on-screen before each drive. Participants were free to move their head naturally as they drove. Eye movements were automatically overlaid onto forward-facing video by the eye tracking hardware.

The times to first fixate the hazards and hazard precursor were taken as our measures of hazard perception. The precursor for the three hazards are behavioural in nature (Crundall et al., 2012) whereby the precursor is the same stimulus as the hazard and behaves in a manner that allows for future projection of the hazard nature. In this instance, the pedestrians walk towards the road without slowing down, stopping or looking at approaching traffic. The time taken to fixate on precursors has been shown to discriminate between experienced and inexperienced drivers (Crundall et al., 2012). The precursor period is defined as the time between when the pedestrians enter the image and the frame before they step onto the road. Hazard onset times began on the frame pedestrians stepped into the road.

Eye movements were manually coded using “semantic gaze mapping”. This is a method used in real-time eye movement capture to attribute eye movements to a semantically meaningful area of interest. In this instance, the semantic categories of ‘hazard’ and ‘precursor’ were identified. The experimenter manually identified which fixations landed on each of the hazards and precursors and would, using the SMI BeGaze software, assign these to the area of interest. First fixation times for hazards were calculated as the time to first fixate on the hazard minus the hazard onset time. First fixation times for precursors were calculated as the time to first fixate on the precursor minus the time in which the precursor is first available in the field of view.

Questionnaire measures

The Driving Behaviour Questionnaire (DBQ) was used to measure self-report driving behaviour (Reason et al., 1990). We used the 28-item questionnaire with four subscales: Aggressive Violations, Ordinary violations, Attentional lapses and Errors. Each item describes a particular driving behaviour and participants rate on a Likert scale from 0 to 5 how often they exhibit the behaviour. The scales have been found to have reasonable internal consistency with alphas ranging from 0.65 to 0.86 (Oreyzi & Haghayegh, 2010) and has demonstrated evidence of construct validity in on-road behaviour (Zhao et al., 2012).

Procedure

Participants completed the driving simulation, the MOA and the UFOV tasks. Participants either performed the driving simulation first or the MOA and UFOV tasks first and this was counterbalanced. The order in which the tasks were completed was counterbalanced for each participant. Breaks were given between each component of the study.

For the driving task, participants were instructed in how to use the driving simulator. This included how to use the steering wheel and pedals, the gears, the vehicle indicators etc. Participants were asked to follow UK road rules as they would when driving on real roads (stopping at red lights, giving way, maintaining lane positioning, etc). They were instructed they would be completing three designated routes and were to follow the auditory satellite navigation to navigate the route. This navigation comprised simple directional instructions such as “turn left at the next intersection”. These directions were given well in advance of having to make any manoeuvres. Eye-movement calibration was conducted before each route. The order in which the routes were completed were randomised for each participant.

For the UFOV, participants were instructed to identify, using a mouse, the object appearing in the middle of the screen (either a car or a truck) and, in the case of Subtasks 2 and 3, were asked to identify where the secondary target appears in the periphery. For the MOA task, participants were instructed to control the blue circle, with the mouse, and avoid the red circles that appear and move around the screen. They were told that the task would get increasingly harder as more circles were added on the screen and the trial would end if they collided with any red circle. Ten trials (two practice trials) were completed.

For the test–retest measure, participants were recalled 6 months after the initial testing phase to complete another MOA testing phase. The same MOA testing procedure as described above was used.

Statistical design

Hierarchical linear regressions were conducted to investigate how MOA and UFOV performance predicted simulated driving performance and first fixation times (note, first time MOA scores were used and not test–retest scores). UFOV performance was initially entered into the models followed by MOA performance. Driving experience (years) was added into each model as a covariate. A paired samples t test and correlation were conducted to determine test–retest reliability after a 6-month period. Correlations were conducted to determine any relationships between the DQB subscales and MOA or UFOV.

Results

All data and R scripts are available on the OSF. Link: https://osf.io/3gdcv/?view_only=d400ccc4769149fd9125300d8cb165ce

Driving performance

An overall measure of driving performance was used here and was a point deduction system where a higher score suggests better driving performance. Driving performance was measured for each of the three courses separately and combined (as an average of the three routes). Driving experience was used as a covariate in these models. The relationship between driving experience and overall driving performance was not a linear relationship but rather a logarithmic relationship and therefore driving experience will be modelled as log transformed. A linear regression revealed that an increase in (log) driving experience predicted better overall driving performance (F(1,36) = 4.95, R² = 0.1, p = 0.03). For the MOA task, individual trial performance was measured as the time (in seconds) until the participant-controlled blue circle collided with one of the hazard red circles. Ten trials were completed by participants with two initial practice trials. Performance was averaged across the remaining eight trials. To provide evidence that the number of trials used here is suitable to reliably measure actual performance, a one-way ANOVA was used to examine the variability in performance across the order of the trials. Performance across the trials was not significantly different overall (F(7,287) = 1.32, p = 0.24) or between Trial 1 and Trial 8 (t(41) = – 1.56, p = 0.13).

Participants’ scores for Subtask 3 in the UFOV were used (calculated by the software) where a lower value suggests better performance (faster speed of processing). Descriptive statistics for driving and task performance can be viewed in Table 1. Four participants did not complete the driving tasks in full, so their data was not used for the analyses involving driving scores. Two participants did not successfully finish the MOA task and one participant did not successfully complete the UFOV task, so their data were not used for any analyses.

Table 1 Descriptive statistics and correlations (r values) of driving, task performance and driving experience

Full size table

Table 1 highlights that MOA performance correlates with all driving tasks. Hierarchical linear regressions were conducted with performance on the different driving routes as outcome variables and UFOV and MOA performance as predictors. For each of the hierarchical linear regressions, the first model featured UFOV performance as a standalone predictor variable and (log) driving experience used a covariate (although this is merely a theoretical distinction; both act as predictors). The second model featured both UFOV and MOA performance as predictor variables, and (log) driving experience as a covariate.

Overall driving performance was not significantly predicted by UFOV performance and driving experience (F(2,34) = 2.64, p = 0.09, adjusted R² = 0.08). When MOA performance was added to the model, the model significantly predicted overall driving performance (F(3,33) = 3.48, p = 0.027, adjusted R² = 0.17). The difference between the first and second model was significant (F(1,33) = 4.61, p = 0.039). An increase in MOA performance predicted an increase in driving performance.

For the urban route, UFOV performance and driving experience significantly predicted driving performance (F(2,34) = 4.99, p = 0.013, adjusted R² = 0.18). However, this was largely driven by driving experience rather than UFOV performance (Table 2). Adding MOA performance to the model improved the model fit (F(3,33) = 5.41, p = 0.004, adjusted R² = 0.27). There was a significant difference between the two models (F(1,33) = 5.05, p = 0.031). An increase in MOA performance predicted an increase in driving performance.

Table 2 Summary of regression models

Full size table

The models predicting driving performance for the suburban route were non-significant. This was the case when UFOV performance and driving experience were the predictors (F(2,34) = 0.68, p = 0.512, adjusted R² = – 0.02), and when MOA performance was added as a predictor (F(3,33) = 1.69, p = 0.188, adjusted R² = 0.05). The difference between the two models was non-significant (F(1,33) = 3.60, p = 0.067). The models predicting driving performance for the motorway route were also non-significant. This was found when UFOV performance and driving experience were predictors (F(2,34) = 2.73 p = 0.08, adjusted R² = 0.09), and when MOA performance was added as a secondary predictor (F(3,33) = 2.26, p = 0.1, adjusted R² = 0.1). There was no significant difference between the first and second model (F(1,33) = 1.28, p = 0.27).

Hazard perception (time to first fixate)

Recorded eye movement data were analysed to examine the relationship between effective eye movements during hazard perception, and the two attention tasks. The time to fixate on hazards were calculated (TTF) for both the precursor and hazard stimuli. A Pearson’s correlation of TTF for precursor eye movements found no significant relationship when paired with either UFOV (p = 0.81) or MOA performance (p = 0.59). In comparison, when examining TTF for hazard onsets, a Pearson’s correlation was significant for MOA scores R(32) = – 0.41, p = .02 (Fig. 4a) and UFOV (R(32) = 0.44, p = 0.01 (Fig. 4b). Better performance in these tasks correlated with faster detection of hazards. There was also a significant relationship between (log) Driving Experience and hazard fixation times with increased driving experience correlating with faster detection times (R(32) = – 0.54, p = 0.001)

To further investigate the relationships between effective eye behaviour (TTF performance) and UFOV and MOA performance, hierarchical regressions were conducted with TTF for the hazard onset period as the outcome variable. As in the hierarchical regressions of driving performance, the UFOV scores were entered as a single predictor in the first model with (log) driving expertise a covariate and then with MOA performance added as a secondary predictor in the second model. In the first model, UFOV performance and driving experience predicted TTF (F(2,29) = 8.31, p = 0.001, adjusted R² = 0.32); (UFOV: β = 0.29, t = 1.88, p = 0.07); (Driving experience: β = – 0.44, t = – 2.80, p = 0.01).

TTF performance was also significantly predicted when MOA performance was added as a second predictor along with UFOV performance and driving experience (F(3,28) = 6.30, p = 0.002, adjusted R² = 0.34). There was no significant difference between the two models (F(1,28) = 0.82, p = 0.19). In the second model, UFOV (β = 0.30, t = 1.96, p = 0.06), MOA (β = – 0.23, t = – 1.35, p = 0.19), and driving experience (β = – 0.32, t = – 1.84, p = 0.08) were not significant predictors.

Test–retest reliability

For both test stages of the MOA task, performance was measured by the time (seconds) in which the target object collided with another object across eight trials. Descriptive statistics of the performance across the two tasks are shown in Table 3. Performance across trials during did not significantly differ overall F(7,216) = 0.88, p = 0.52), or between Trial 1 and Trial 8 (t(27) = – 0.43, p = 0.67 suggesting no evidence for general improvement across trials within the testing session.

Table 3 Descriptive statistics for the Multiple-Object Avoidance (MOA) at both time points

Full size table

Two methods of analysis were employed to assess the consistency and reliability of the MOA task over time. Paired t tests were used to examine the difference in mean response time over time. There was no significant difference between MOA0 and MOA1 performance, t(27) = 1.15, p = 0.26. Pearson’s correlation coefficients were calculated to examine the reliability of individual subjects across both samples. Performance between the two tasks were positively correlated, r(27) = 0.42, p = 0.027. Figure 5 shows the relationship of MOA performance between MOA0 and MOA1.

Relationship with Driving Behaviour Questionnaire

MOA performance correlated with the Lapses subscale (r = – 0.35, p = 0.03). As performance in MOA increased the number of instances of attentional lapses decreased. There was no relationship between MOA performance and the number of Errors (r = – 0.21, p = 0.2), the number of Aggressive Violations (r = 0.06, p = 0.73) and the number of Ordinary Violations (r = – 0.12, p = 0.47). UFOV performance correlated with the Ordinary Violations (r = – 0.45, p = 0.006) and weakly with the Errors subscale (r = – 0.33, p = 0.05). Interestingly, as performance in the UFOV task decreased (slower processing speed) the number of instances of Ordinary Violations and Errors also decreased. There was no relationship for the Lapses subscale (r = – 0.31, p = 0.06), and the Aggressive Violations subscale (r = – 0.3, p = 0.07).

Discussion

The overall aim of this study was to replicate and expand on Mackenzie and Harris' (2017) exploration of the relationship between MOA performance and driving behaviour. The first aim was to replicate previous findings and investigate how well MOA performance predicted simulated driving performance and compare this to the predictive power of the UFOV task. We showed that the MOA does well in predicting driving performance, particularly in complex environments and does better than the UFOV at predicting driving performance. The second aim was to investigate how well the MOA predicts hazard perception; as measured by first fixation eye movements (TTF). We hypothesised MOA performance would more strongly correlate to first fixation times than the UFOV task given the active nature of the MOA. We found a relationship between MOA performance and TTF, but this was more robust for UFOV. The third aim was to investigate MOA task test-re-test reliability and explore its convergent validity with a self-report driving measure of visual attention when driving. We observed the hypothesised correlation in performance between testing and re-testing stages; suggesting reliability of this test to measure attentional function. We also found evidence of convergent validity with the subscale of the Driving Behaviour Questionnaire that specifically measures attentional lapses.

The MOA, attention, driving performance and hazard perception

We find here that attentional function, as measured by the MOA task, significantly predicted simulated driving performance, replicating the previous result of Mackenzie and Harris (2017). This is line with other studies that demonstrate the relationship between superior attentional function as measured by reduced attention tasks and improved driving performance (Karimi et al., 2015; Owsley & McGwin Jr., 2010; Roca et al., 2013; Weaver et al., 2009; Wood et al., 2016). If one demonstrates superior attentional function in a reduced task, then it seems unsurprising this would extend to a more complex task that may also involve these attentional components to a degree given what we know about cognitive transfer (Peng & Miller, 2016; Posner et al., 2015). If one is better able to, for example, divide attention across task operations such as vehicle control and identifying potential hazards, sustain their attention to the important aspects of the driving environment and effectively ignore irrelevant stimuli, they likely would be a better driver.

Previous research has discussed how some experimentally-reduced tasks such as the UFOV and ANT measure only properties of attention to brief stimulus presentation (Bowers et al., 2011; Mackenzie & Harris, 2017) where stimuli are presented for up to several hundred milliseconds. Tasks such as Multiple Object Tracking, which provide a measure of sustained divided attention to dynamic stimuli, are arguably more representative of driving due to their more temporally extended nature. We go further with the MOA where we argue that it also represents the more active visual attentional function involved in driving. In the MOA, one must actively be in visuomotor control of an object; a task we suggest targets the more active visual elements of everyday tasks (see Hommel, 2010; Humphreys et al., 2010; Land, 2006; Land & Lee, 1994; Mackenzie & Harris, 2015; Tatler et al., 2011). In addition, in MOA as in driving, one must divide attention to relevant stimuli, but the relevant importance of the objects changes in real-time during the MOA task. Only those objects that are either near or judged to potentially collide with the user’s object are directly relevant. This divided attention involved in controlling an object and predicting the behaviour (e.g. motion) of other objects could possibly be analogous to controlling a vehicle while predicting potentially hazardous events. The similarities in attentional processing between MOA and driving may explain the predictive power of the MOA task here. Supporting this claim may be the finding that this attention task correlates with the attentional lapses subscale on the DQB, suggesting convergent validity in the MOA’s ability to measure attentional function in relation to driving performance.

It is important to note that, whilst performance on the MOA predicted overall general driving performance, this was largely driven by driving performance during the most complex environment. As the complexity of the driving environment decreased, so did the predictive power of MOA performance. This is perhaps unsurprising however given the routes used in this study. This finding indeed mimics the eye movement finding in Mackenzie and Harris (2017) where MOA performance predicted increased visual scanning during more complex drives. The complex urban environment would demand more attentional resources in order to, for example, detect and respond to pedestrians walking across the road, searching intersections before committing to cross, turning across traffic (right turn in the UK) etc. In comparison to a simple and straight two-lane motorway where traffic is more regular and (arguably) more predictable; where maintaining lane positing may be more of the priority in order to drive successfully. As such, it would make sense for a relatively demanding visual attention task such as the MOA to predict performance during a drive that would place more of a demand on visual attention function.

Concerning the relationship between MOA performance and hazard perception we find some evidence that better performance was related to earlier hazard fixation times (TTF). The MOA requires one to make many eye movements; but mainly eye movements to open space to which the users’ target will imminently be moved, and to the hazard circles. This type of top-down intentional eye movement behaviour may mimic one looking ahead to where they are manoeuvring the vehicle and searching for hazards on the road, and thus, may explain the link we find here. However, the evidence for a strong predictive relationship was not observed when UFOV performance is included. UFOV performance had a stronger relationship to TTF than MOA (albeit not significant in the full regression model). The responsive nature rather than the predictive nature of the hazards and the speed of visual processing nature of the UFOV may explain this effect. There was no evidence that task performance (either MOA or UFOV) predicted fixations towards hazard precursors. In other words, participants were able to make similarly timed eye movements to the pre-onset hazards irrespective of visual attentional function. It was only after the pedestrians began to step into the road (and thus became fully developed hazards) that UFOV performance better related to fixation times. If an individual has faster visual processing speed (as measured by UFOV) then this may better allow them to make a re-fixation to the hazard when they acknowledge that the pedestrian is now a fully developed hazard. Whilst there could be an element of visual processing speed in the MOA, arguably this is better measured by the UFOV.

However, the predictive effect was not strong for either task. It would be of interest to investigate the relationship between MOA ability and the ability to make fixations to less subtle hazards, e.g. environmental hazards (Crundall et al., 2012; Shahar et al., 2010). We also highlight the limitation in our measure of TTF. Future work could investigate the relationship between MOA performance and the visual processing of a range of hazard types, where, for example fixation durations might offer more sensitive insights into hazard processing. For example, they may offer insights into processing times and indeed offer insights where longer processing might result in subsequent inattention to other relevant environmental cues (Velichkovsky et al., 2002). One might argue, however, that hazard fixations are not a good measure of hazard perception at all. It would be interesting to correlate MOA performance on more sensitive measures of hazard perception, or specifically, hazard prediction, such as the What Happens Next test (Crundall, 2016; Jackson et al., 2009) – a task that arguably better capture the ability to understand hazardous situations.

In summary, Study 1 has measured the ability of MOA task performance to predict simulated driving performance relative to the UFOV. It does this well for complex driving environments and the results highlight the potential for future work to be carried out in a number of further areas such as hazard prediction. Study 2 now investigates the MOA’s ability to measure individual differences in visual attention function in relation to sporting expertise whilst also investigating elements of construct validity raised in Study 1 by investigating MOAs relationship to cognitive processing speed and object tracking.

Study 2: The MOA in sport and its potential attentional composition

The MOA was initially developed to measure visual attention function in relation to driving behaviour. As with Multiple Object Tracking (MOT) tasks, the use of the MOA to investigate visual attention function in other ‘everyday’ tasks might be attractive to researchers given the attentional similarities between these tracking tasks. In this study, we therefore aim to investigate how the MOA might discriminate visual attentional function between individuals who play sports and individuals who do not whilst also investigating its possible attentional relatedness to other tasks often reported as being important in sporting performance: object tracking and cognitive processing speed.

Individual differences in visual attention in sport

The ability to control attention is important during complex activity such as sport (Engle, 2002). In many “strategic” sports (Voss et al., 2010), such as football or basketball, environments are typically visually noisy and require individuals to attend to multiple objects, inhibit task-irrelevant information, store object location information and make fast decisions. One might argue that an individual who engages in these types of sport might therefore exhibit superior attentional function in these domains or, indeed, vice versa where individuals who have superior attentional function exhibit superior sporting performance. There is mounting evidence for individual differences in lab-based measures of visual attentional control – both within sporting experience (i.e. high and low skilled players) and across sporting experience (sports players and non-sports players). For example, working memory capacity in basketball players (Furley & Memmert, 2012), temporal processing in tennis players (Overney et al., 2008), attentional window in soccer players (Scharfen & Memmert, 2019) and inattentional blindness in basketball players (Furley et al., 2010).

In their meta-analysis, Voss et al., (2010) report a medium effect for differences in processing speed (measured in a number of ways) across the literature. Arguably, fast processing times are needed for most sports players in order to react and make decisions in real-time. Importantly however, sporting type seemed to drive the differences between sporting and non-sporting individuals. The effect was stronger for what was termed “interceptive sports” (a sport that requires coordination between the body or a held implement and an object in the environment, e.g. badminton) compared to strategic sports (a sport that usually involves varied situations and requires the simultaneous processing of information regarding teammate, opponents, ball position etc., e.g. football).

Concerning object tracking and divided attention, similar results are observed. Howard et al., (2018) showed that those who engaged in team ball sport showed superior performance in a modified MOT task as well as a rapid visual presentation task, with both tasks requiring sustained attention to rapidly changing stimuli. Similarly, Qiu et al., (2018) report that elite basketball players outperformed intermediate and non-athletes during a MOT type task. Furthermore, Harris et al., (2020) found a similar result in football and rugby players. They argue this expertise is linked with an increased processing capacity rather than a more effective perceptual-cognitive strategy. In explaining the differences in attentional processing between sporting and non-sporting individuals in general, one might favour the “deliberate practice” view (e.g. Ericsson et al., 1993) where one obtains expertise through effortful and continuous engagement in a task. In other words, if one continually engages in tasks that target divided attention mechanisms or require fast processing speed, one improves in this attentional domain. However, see Hambrick et al., (2016) for a detailed discussion on the possible direction of causality. Irrespective of current debates in the field, with these individual differences described above, one could hypothesise the MOA might allow us to discern differences in visual attention function between those with different levels of sporting expertise. We test this here and aim to investigate any differences in MOA performance between individuals who play interoceptive/strategic sports and those that do not. Given the individual differences in attentional performance described above, we hypothesised that individuals who played sport would outperform those who do not.

In addition, we have suggested, in Study 1, that the two measures of visual attention function discussed above that are important in sport (object tracking and processing speed), might be attentional components involved in MOA performance, and we have offered these as explanations that might explain the efficacy of MOA in predicting driving behaviour and hazard perception performance. To provide some support to these claims, we used Study 2 as an opportunity to also investigate the relationship between these individual measures of attention and MOA performance. This would help to provide evidence of the proposed attentional composition of the MOA – providing some evidence of construct validity – but also aids in establishing the task beyond driving as these proposed attentional components are likely to be involved sport. This study aimed therefore to also investigate the contribution of sustained divided attention and inhibitory control (as assessed by the MOT task) and cognitive processing speed (assessed by a digit symbol substitution task) on MOA performance. This was investigated statistically in a hierarchical regression by examining the variability of MOA performance that can be explained by cognitive processing speed and object tracking ability. We hypothesised that MOA performance should positively correlate with MOT performance, given previous findings and the similar nature of spatiotemporal tracking involved, and also positively correlate with DSST performance, given the visually demanding nature of the MOA.