Auditory cognition and perception of action video game players

Stewart, Hannah J.; Martinez, Jasmin L.; Perdew, Audrey; Green, C. Shawn; Moore, David R.

doi:10.1038/s41598-020-71235-z

Auditory cognition and perception of action video game players

Article
Open access
Published: 01 September 2020

Volume 10, article number 14410, (2020)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

Auditory cognition and perception of action video game players

Download PDF

Hannah J. Stewart^1,2,
Jasmin L. Martinez^1,3,
Audrey Perdew¹,
C. Shawn Green⁴ &
…
David R. Moore^1,5,6

7751 Accesses
8 Citations
5 Altmetric
Explore all metrics

Abstract

A training method to improve speech hearing in noise has proven elusive, with most methods failing to transfer to untrained tasks. One common approach to identify potentially viable training paradigms is to make use of cross-sectional designs. For instance, the consistent finding that people who chose to avidly engage with action video games as part of their normal life also show enhanced performance on non-game visual tasks has been used as a foundation to test the causal impact of such game play via true experiments (e.g., in more translational designs). However, little work has examined the association between action video game play and untrained auditory tasks, which would speak to the possible utility of using such games to improve speech hearing in noise. To examine this possibility, 80 participants with mixed action video game experience were tested on a visual reaction time task that has reliably shown superior performance in action video game players (AVGPs) compared to non-players (≤ 5 h/week across game categories) and multi-genre video game players (> 5 h/week across game categories). Auditory cognition and perception were tested using auditory reaction time and two speech-in-noise tasks. Performance of AVGPs on the visual task replicated previous positive findings. However, no significant benefit of action video game play was found on the auditory tasks. We suggest that, while AVGPs interact meaningfully with a rich visual environment during play, they may not interact with the games’ auditory environment. These results suggest that far transfer learning during action video game play is modality-specific and that an acoustically relevant auditory environment may be needed to improve auditory probabilistic thinking.

Training with an auditory perceptual learning game transfers to speech in competition

Article Open access 21 September 2021

Does the Noise Matter? Exploring Salient Audio Components in Video Prompting Interventions

Article 05 October 2017

Action Video-Game Training and Its Effects on Perception and Attentional Control

Introduction

Computer-based sensory and cognitive training has long held the promise of dramatic improvements on real world abilities. However, finding an auditory training task that successfully leads to improved speech perception in noise, a frequently reported auditory disability^1,2, remains elusive. Two problems have frequently occurred when training paradigms meant to improve such skills have been examined via carefully controlled experiments. The paradigms have either: (A) failed to produce benefits above and beyond those seen from placebo control conditions e.g., Ref.³; or (B) produced improvements on trained tasks, but with little improvement on untrained tasks, particularly those that were quite different from the trained task. Indeed, in a recent review of training studies aiming to improve auditory performance in adults with hearing loss, Ferguson and Henshaw⁴ clearly noted a general trend of improvement in the trained task (‘on-task’ learning) with little or no improvement in off-task abilities. Off-task abilities similar to those trained improved in some cases, a process termed ‘near transfer’, but there was little evidence of ‘far transfer’ to complex, off-task abilities. As an example of this latter situation, in one RCT⁵, training on a speech (phoneme) discrimination task⁶ produced robust on-task learning and some limited far transfer to auditory and visual divided attention and working memory tasks, but no generalized benefits for speech perception in noise.

In contrast to these often null or variable results, there has been a series of positive results demonstrating that training on one particular type of video game—dubbed action video games—produces enhanced, far transfer of visual cognition and visual perception abilities compared to non-players^7,8. Action video games require the player to collect objects and avoid obstacles while battling enemies and maintaining their game character’s health and lives. The games are typically fast paced and require skills in hand to eye coordination and fast reaction times. Examples of such games are first and third person shooters—e.g., Call of Duty and Gears of War. A significant body of work in this domain has established that a causal relation exists between the act of playing action video games and the observed enhancements via controlled intervention studies (i.e., where individuals are specifically trained on either an action video game or a control video game e.g.,^9,10,11,12). Yet much of the work in the field has been cross-sectional in nature e.g.,^13,14,15. In such designs, the perceptual or cognitive skills of individuals who choose to play a great deal of action video games as part of their daily life (referred to as ‘action video game players’ or AVGPs) are contrasted against those of individuals who do not play such games (here labeled as ‘non-players’ or NPs). Although such cross-sectional designs cannot be used to infer a causal relation, the cross-sectional methodology has the advantage that because action video games are the most popular video game bought and played in America¹⁶, it is relatively easy to identify and recruit AVGPs. Given the extreme cost and difficulty of running full intervention studies, cross-sectional designs are thus often utilized by researchers to determine whether a full scale intervention is warranted (i.e., if AVGPs do not show enhanced performance on a given measure as compared to NPs, despite typically having played hundreds if not thousands of hours of action video games, it would seem unlikely that a training study where individuals are asked to play, at most tens of hours of action video games, would produce a significant effect).

One perceptual task that illustrates far transfer of training on action video games is the Multiple Object Tracking (MOT) task¹⁷. The basic MOT task consists of mentally labeling, continuously monitoring, then identifying the colour of up to 16 moving dots. Although the stimuli are far removed from popular ‘first person shooter’ action video games, the results nonetheless show that not only do AVGPs outperform NPs on this task^18,19,20,21, but that deliberate action video game training also produces similar benefits, indicating that the relationship is causal¹⁰.

With AVGPs showing improvement on a wide range of skills assessed on tasks far removed from video game environments (for reviews see:^22,23) it has been suggested that action gaming is training ‘probabilistic thinking’²⁴. Auditory training paradigms have often attempted to train a restricted set of tasks and skills (e.g., speech phonemes⁵). This in turn often produces (at best) near transfer to outcome measures very similar or identical to the trained task. Instead, Bavelier et al. propose that action video game training induces a form of ‘learning to learn’ whereby individuals become generally better at learning to extract task relevant statistics. As a result, they are in turn better able to use a wide variety of task-relevant information occurring outside the trained game while ignoring distracting task-irrelevant information, as demonstrated by far transfer to complex tasks dissimilar to the trained task. Identification of a target in a noisy environment is a common challenge in sensory perception and pathology, for instance, attending to the relevant speaker while ignoring background speech. In terms of mechanisms, a test of the hypothesis of improved probabilistic thinking would be that the enhanced ability of AVGPs crosses modalities, evidenced by improved auditory cognition and perception.

There have been limited investigations into cross-modality improvements among AVGPs. Tetris, a visual puzzle game, has been found to improve frequency discrimination and auditory working memory when play was interspersed with a frequency discrimination task²⁵. Interestingly, auditory cognition and perception did not improve when the participants were exposed to the tetris stimuli without the gaming environment. One AVGP study designed an auditory perceptual task to match a visual perceptual task²⁶. Both tasks required a spatial decision about a target while the signal-to-noise ratio was manipulated. In the visual task the participants had to decide in which direction the majority of dots on a video display were moving for different levels of motion coherence. In the auditory task they had to decide in which ear they heard a target tone (the volume of which was adjusted between trials) while ignoring broadband noise in both ears. Both tasks showed that AVGPs were significantly faster than NPs at making these decisions, particularly at lower signal-to-noise ratios, while showing roughly equivalent levels of accuracy. This result suggests that AVGPs also have improved auditory cognition and perception, supporting cross-modal learning. However, the interpretation of this study is limited by the fact that the white noise masker would only have interfered with the target to the extent that it covers the same time/frequency regions. An informational masker, such as speech babble, would provide a more ecologically-valid masker requiring additional processes such as object formation and selection and linguistic processing^27,28.

As in previous studies of video game players we used a self-report measure of the number of hours a range of gaming categories (e.g., first/third person shooter, turn-based strategy, music games, etc.) were played in the current year and previous years. Consistent with previous work demonstrating that such questionnaires only support the division of gamers into broad categories of play time²⁹, we used four gaming classifications (full definitions in Table 1): AVGPs who played almost exclusively first/third person shooters; tweeners (TWs) who played multi-genre video games, typically online³⁰; others (OTs) who do not fit a clear definition of gaming; and NPs who played at most 5 h a week across all game categories.

Table 1 Categorization rules: with four possible formula options for categorisation of action video game players (AVGPs) and one formula option each for categorisation of tweeners (TWs) and non-players (NPs) based on weekly hours of play during the past year, and prior to the past year. Participants that did not fall into these three categories were labelled as others (OTs).

Full size table

This case–control study aimed to expand our understanding of the extent to which action video game experience is associated with cross-modal differences. We did this by examining performance on a variety of auditory tasks with varying demands, from simple RT based auditory attention to complex speech-in-babble identification. A finding of cross-modal differences in AVGPs as compared to NPs could prompt controlled intervention studies using action video games in therapeutic auditory training procedures. Thus moving away from training multiple specific mechanisms (whether auditory or cognitive) and towards the broader and potentially more motivating training offered by action video game play e.g.³¹. The participants were grouped by pre-existing gaming experience and received no study-related training. It was hypothesized that, compared to NPs, AVGPs would have better visual cognition and perception and better auditory cognition and perception. We expected that, across the tasks, TWs and OTs would perform numerically better than NPs but not as high as the genre-pure AVGPs.

Results

We tested 80 participants (Table 2) with a range of gaming experience in the past year, and prior to the past year (see Fig. 1). Using strict categorization rules we separated the participants into AVGPs, TWs, OTs, and NPs (Table 1).

Table 2 Descriptives of participants tested including gender and age (M mean, SD standard deviation).

Full size table

Visual multiple object tracking (MOT)

We measured the ability of participants (n = 80; Tables 1, 2; Fig. 1) to perform the MOT (Fig. 2A). As noted previously, the MOT has previously been shown to demonstrate transfer of learning derived from action video game play. As expected, accuracy for detection of targets decreased and reaction time increased as the number of targets increased from 1–7 (Fig. 3).

Replicating previous work, AVGPs performed better than NPs (a history of at most 5 h a week experience across game categories). For accuracy, group (AVGPs, TWs, OTs and NPs) and number of blue dots (set size) were analyzed in a 4 × 7 repeated measures analysis of variance (ANOVA; Fig. 3A). Mauchly’s Test of Sphericity indicated that the assumption of sphericity had been violated, χ²(20) = 75.40, p < 0.001, therefore degrees of freedom were corrected using Greenhouse–Geisser estimates of sphericity. A main effect of group was observed, F(3, 75) = 5.79, p = 0.001, η_p² = 0.19, with AVGPs performing more accurately than the NPs (p = 0.004, η_p² = 0.40), as expected. TWs and OTs were more accurate than the NPs (p = 0.006, η_p² = − 0.37; p = 0.004, η_p² = − 0.40 respectively). A main effect of set size was also observed (F(4.69, 351.88) = 40.71, p < 0.001, η_p² = 0.35) with accuracy decreasing as the set size increased. Group interacted with set size, F(14.08, 351.88) = 1.94, p = 0.021, η_p² = 0.072). Post hoc t tests (all p-values are Bonferroni corrected) showed that this interaction was led by AVGPs performing more accurately than NPs at set size 3 (p = 0.025) and 5 (p = 0.005) and NPs less accurately than OTs and TWs at set size 7 (both p = 0.001).

The same design of ANOVA was run on reaction time (RT; Fig. 3B). Mauchly’s Test of Sphericity indicated that the assumption of sphericity had been violated, χ²(20) = 232.81, p < 0.001, therefore degrees of freedom were corrected using Greenhouse–Geisser estimates of sphericity. There was a main effect of set size, F(2.46, 174.53) = 13.30, p < 0.001, η_p² = 0.16, with slower reaction times for larger set sizes. No main effect of group was observed (F = 0.80, η_p² = 0.033, BF₁₀ = 0.90) and no group interaction with set size (F = 1.05, η_p² = 0.042, BF_{10MainEffects}/BF_{10Interaction} > 100).

Test of Attention in Listening (TAiL)

TAiL (Fig. 2B) was developed as a simple, quick, multifaceted, RT-based index of attention modulation of auditory perception^32,33. It measures the ability to focus on a task dimension, tone frequency or location, and ignore an irrelevant (distracting) dimension. Here, AVGPs and NPs were similarly distracted by task-irrelevant auditory information and their ability to deal with conflicting auditory information (Fig. 4). Univariate ANOVAs showed no group differences in distraction (attend-frequency: p = 0.58, η_p² = 0.026, BF₁₀ = 0.15; attend-location: p = 0.93, η_p² = 0.006, BF₁₀ = 0.087) or attend-frequency conflict resolution (p = 0.23, η_p² = 0.056, BF₁₀ = 0.34). A group difference was found for attend-location conflict resolution (F(3, 75) = 2.94, p = 0.038, η_p² = 0.10). This was led by NPs being significantly more conflicted than OTs (p = 0.029, η_p² = 0.75).

Bamford–Kowal–Bench Speech-in-Noise (BKB-SiN)

The BKB-SiN presents simple sentences having 3–5 key words against a 4 talker ‘babble’ speech masker^34,35. Performance is measured by the speech–noise ratio (SNR) required to attain 50% correct key word responses (SNR-50). All groups achieved similarly sensitive, low SNRs, indicating good listening in noise performance (Fig. 5A). A univariate ANOVA showed no significant differences between the groups (p = 0.80, η_p² = 0.013, BF₁₀ = 0.10).

Listening in Spatialized Noise-Sentences (LiSN-S)

Another test of speech hearing in noise, the LiSN-S^36,37 measures ability to hear and recall spoken target sentences against a background of distracting talkers (Fig. 2C). The talkers may be the same or different voices, or come from the same or different directions. By subtracting performance on two versions of each condition, the LiSN-S achieves a degree of isolation between the auditory and cognitive contribution to each of three indices, Talker, Spatial and Total advantage³⁷. All groups scored similarly on the standardized Talker, Spatial and Total Advantage scores, with higher scores indicating better performance (Fig. 5B). Univariate ANOVAs found no significant group differences (Talker: p = 0.34, η_p² = 0.044, BF₁₀ = 0.23; Spatial: p = 0.48, η_p² = 0.033, BF₁₀ = 0.17; Total: p = 0.12, η_p² = 0.075, BF₁₀ = 0.54).

Listening environments during play

As part of the background questionnaire completed during recruitment we collected data on how the participants played and interacted with their video games. During first person shooter games, 47% of AVGPS and 43% of OTs used headphones and 53% of AVGPs, 47% of OTs and 47% of TWs used open-field loudspeakers. In action role-playing games, 33% of AVGPs and 28% of OTs used headphones and 67% of AVGPs, 69% of OTs and 53% of TWs used loudspeakers (Fig. 6A). However, a range of listening environments was used. Only 13% of AVGPs and 3% of OTs used surround sound during first person shooters and 7% of AVGPs and 3% OTs during action role-playing games. Of those that responded, the majority (53% AVGPs, 41% OTs, 24–29% TWs) never used surround sound and about a third (33% AVGPs, 41% OTs, 24% TWs) only sometimes used the feature (Fig. 6B). Through discussions with participants, we found that while playing gamers (AVGPs, TWs and OTs) often simultaneously listened to a separate and irrelevant auditory source (e.g., television, podcasts).

Discussion

This study assessed whether action video game play is associated with changes in visual and/or auditory skills. In terms of visual skills, pre-existing AVGPs were found to maintain better performance at higher set sizes as compared to NPs on the visual MOT task, replicating the highly cited finding that extensive action video game play is associated with enhanced visual cognition and perception⁸. However, we did not find any differences in performance on the auditory tasks between groups, suggesting that action video game play is not associated with a cross-modal benefit. Bavelier and colleagues (2012) proposed that all tasks AVGPs improve on may share the fundamental computational principle of making a decision based upon limited information in noise. However, we found this does not hold true for limited auditory information in noise, implying that ‘learning to learn’ has narrower boundaries than previously suggested and that probabilistic thinking is modality-specific. It may be that the probabilistic thinking process used to make judgements with limited visual information in noise is not adequate when dealing with the auditory system.

Green and colleagues (2010) found better performance by AVGPs on an auditory task compared to NPs. However, there are key differences in the function of the outcome measure used in that study compared to those used in the study reported here. First, TAiL is perceptually more demanding than the auditory tone location task used by Green. In TAiL, the task-relevant and -irrelevant information are part of the same sound object, while in Green’s task, the task-relevant (pure tone) and -irrelevant (white noise) information were separate sound objects. Second, the white noise used by Green et al. would have created energetic masking. The BKB-SiN and LiSN-S tasks used in this current study have speech babble maskers that create both informational and energetic masking^27,28 thus generating ecologically-valid everyday listening environments.

Improvement in speech-in-noise ability has been found in a randomized, double-blind study³⁸ showing that training on an audiomotor game leads to improved speech-in-noise ability in elderly hearing-impaired players, while training on an auditory working memory game does not. In Whitton’s study the training game involved interacting with auditory task-relevant stimuli where the participants monitored auditory feedback as they moved their finger through a virtual soundscape on a tablet device. The goal of the game was to complete a hidden puzzle by finding and rotating pieces. Participants improved their performance by monitoring the deviations between their expected and actual auditory feedback. Key to Whitton’s (2017) study, feedback was given through subtle variations in sound level, pitch or modulation rate, while ignoring task-irrelevant auditory information in the form of speech babble. It could be argued that, during play, participants were directly training auditory probabilistic thinking with task-relevant auditory stimuli. However, the task-irrelevant speech babble created informational and energetic masking mirroring the outcome measure (speech-in-noise). Therefore, using the definitions of Ferguson and Henshaw⁴, this training would be labeled as near transfer, while in the study reported here we were looking for evidence of far transfer training effects.

To be categorized as an AVGP in this study and previous studies e.g.,^8,30,39 the participants had to be heavy players of action video games. This was defined as playing for, what some may consider, an extreme number of hours each week and over a prolonged period of time (see Table 1). Using these definitions we replicated the previous finding that AVGPs (n = 15) showed superior performance on the MOT task compared to NPs. However, we also extended our analysis into TWs (n = 17) and OTs (n = 32) to cover a wider range of action video game experience. Similar to the results of³⁰ we found that, at least numerically, the TWs and OTs performed in between the AVGPs and NPs. This thus contributes to new directions in this field exploring the impact of groups beyond just the typical AVGP and NP populations that have been the focus of much of the literature thus far (e.g., to players of different genres or to intermediate players)^30,39,40.

Musicians (professional and amateaur) have been found to have superior speech in noise perception compared to non-musicians e.g.,^41,42,43; but see^44,45. These musicians complete extensive auditory training by playing/writing/conducting music for many hours each week and over a prolonged time. Strait and Kraus⁴⁶ suggest the interactive auditory environment musicians experience to be the key to their auditory learning. It may be that musicians are the auditory parallel to AVGPs in that musical training directly trains auditory probabilistic thinking in a similar way action video games trains visual probabilistic thinking.

The findings from our questionnaire on previous game play experience (Tables 1, 2; Fig. 1) suggest that a possible alternative reason for the absence of cross-modal benefit is that, for these gamers, interaction with the auditory information differed from interaction with the visual information in their gaming environment. This difference may stem from the fact that in the majority of video games visual information is task-relevant, while the auditory information is not. For example, visual information such as where the enemy is hiding or where an explosive was thrown is vital for successful game performance, whereas auditory information such as the sound of an explosion is considered an additional effect, rather than ‘life or death’ information within the game. Importantly, the auditory information usually lacks appropriate sound cues that, in this example, may include interaural differences indicative of the location of the explosion. Some games, for example those with surround sound, do make it possible to locate the source of the sound. However, the vast majority of our gamers reported that they did not have or listen to such meaningful sound.

A category of game where the auditory information is needed to successfully compete is audio games for the blind. In these ‘video-less’ games, images are replaced with musical cues and navigation by voice prompts, making audition the task-relevant modality. To maintain high performance, the auditory information in these games is vital. A game such as blind cricket (https://www.audiogamehub.com) could be used to assess the modality effects of game training by providing a controlled environment where the levels of visual and auditory information can be manipulated, for example by contrasting training using full visual graphics, reduced visual graphics (e.g., black and white with lower resolution), and only auditory cues. Such an investigation would provide a further assessment of whether probabilistic thinking requires modality-specific training.

Limitations

This was a case–control study to assess the potential value of using action video games to train speech in noise abilities. Two limitations arise from the retrospective design of the study in terms of gaming experience. First, the participants’ auditory cognition and perception prior to their gaming experience were unknown, and therefore not controlled. Controlled intervention studies have found a causal relation between playing action video games and enhancement of visual cognition and perception e.g.,^9,10,11,12. However, we were unable to assess whether prior auditory cognitive and perceptual ability affected auditory learning. Second, we used a questionnaire to gather data on the participants’ gaming history. Questionnaires have been shown to have a bias for both under and over-reporting prior behaviour; diaries and game play timers provide more accurate measurement e.g.,⁴⁷. There is also evidence that the more games a participant plays the larger the discrepancies in their questionnaire responses²⁹. However, questionnaires are able to reliably categorize the two ends of behavior^48,49. In order to include a wide range of gaming experience in our analysis we expanded the categories investigated from the typical AVGP and NP to include TW (multi-genre game playing, mostly online) and OT (gaming experience not fitting a clear definition).

Conclusion

While we replicate the finding that extensive action video game play is associated with better performance in the visual cognitive domain, we did not find a benefit for auditory cognition and perception. This suggests that the underlying probabilistic thinking video games are thought to improve may not be supramodal. If training probabilistic thinking is indeed modality-specific, then training using rich auditory information within a game format may lead to far transfer on auditory tasks. However, the acoustic characteristics of the game chosen may prove to be key.

Methods

Participants

A total of 85 individuals were recruited into this study through word of mouth and by using Institutional Review Board (IRB)-approved advertisements and materials via print, electronic, social and digital media at Cincinnati Children’s Hospital locations, and in the local and regional area. Five participants did not have normal hearing acuity (pure tone thresholds < 20 dB HL bilaterally at all octave frequencies from 250 to 8,000 Hz⁵⁰ and did not go on for further testing. The remaining 80 participants’ ages ranged from 18–30 (M = 25.07 years, SD = 3.72 years, 29 females and 51 males) (Table 2). All procedures were approved by the IRB at Cincinnati Children’s Hospital Medical Center. At recruitment participants consented to completing screening questionnaires covering background information and gaming experience. Informed written consent was obtained from each participant prior to testing and they were compensated for their time and effort. All experiments were performed in accordance with relevant guidelines and regulations.

Grouping

Participants were grouped into AVGPs, TWs, OTs and NPs by the number of hours spent playing different categories of games each week during the past year, and prior to the past year. The definitions of these groups can be found in Table 1, along with the break-down of gaming experience, regardless of categorization, in Fig. 1. We actively recruited participants that fitted the AVGP, TW and NP categories.

Equipment

Participants were tested individually in a sound-attenuated booth. All tests were presented on a PC, with a 21 inch flat screen monitor placed in front of the participant at full screen brightness. All auditory stimuli were presented through Sennheiser 25 circumaural headphones. The MOT was presented using MATLAB v2016a. TAiL⁵¹ and LiSN-S³⁶ were presented through their own, stand-alone software. The BKB-SiN task⁵² was played from its auditory recording. A horizontally placed custom made three choice button box was used to record reaction times in the MOT and TAiL. A hand print was placed in front of the button box to serve as a base for participants to place their dominant hand preceding each trial.

Stimuli and procedure

Four tests were administered in a single testing session lasting approximately 2 h. The initial test was counterbalanced across participants using a Latin square design.

Visual Multiple Object Tracking (MOT)¹⁷

Each trial began with 16 dots moving in a random, continuous manner within a circular, grey background (Fig. 2A). Stimuli consisting of yellow and blue dots were presented for 10 s. Participants were instructed to focus on the central fixation cross. After 2 s, the blue dots turned yellow. Four seconds later one dot, the target stimulus, turned white and the participant was prompted to indicate the original color of the white dot. Participants continued to fixate on the cross throughout each trial while responding as quickly and accurately as possible. Participants each underwent one experimental block of 40 trials. Average RT and accuracy were calculated for each participant for 1 to 7 blue dots.

Test of Attention in Listening (TAiL)³³

In each trial participants were presented with two 100 ms pure tones (gated on/off by 10 ms cos ramps) at 70 dB SPL with an inter-stimulus-interval of 300 ms (Fig. 2B). Tone pairs ranged from 476.2 to 6,187.5 Hz and were always at least 2.1 equivalent rectangular bandwidths (~ 4 semitones) apart. Using the button box, participants were tasked to indicate the correct response as quickly and accurately as possible. If the trial’s tones had the same task-relevant information (i.e. pitch in the attend-frequency condition; or ear presentation in the attend-location condition) participants were instructed to press the right button on the button box. If the trial’s tones differed in task-relevant information the participants were instructed to press the left button on the button box.

RT (from correct trials only) and accuracy were calculated for each TAiL condition for distraction and conflict resolution measures. Trials where the participant responded in less than 200 ms or longer than 2,500 ms were discarded in case of preemptive responding and interruption in performance. Distraction measures were calculated as the difference of responding to trials where the task-irrelevant information changed and trials where it did not, regardless of the task-relevant information (i.e. in the attend-location task: the difference between different frequency and same frequency trials, regardless of the location). Conflict resolution was calculated as the difference between incongruent and congruent trials (i.e. difference between trials where only one sound property changed and trials where both the sound properties changed or stayed constant).

Prior to testing, participants underwent practice trials for each condition in which they received a pass (60% correct) or fail. If passed, participants proceeded to do 3 blocks of 40 trials each for both conditions, a total of six blocks alternating between conditions. If failed, participants were given two more opportunities to complete the practice trials. If they still failed, they did not complete the blocks of TAiL testing.

Bamford–Kowal–Bench Speech-in-Noise (BKB-SiN)³⁵

The BKB-SiN test is a standardized speech perception test utilizing a simultaneous four talker babble noise to simulate a realistic listening environment. The recording was presented with the babble noise at 65 dB SPL through binaural headphones with one sentence at each signal to noise ratio (SNR) ranging from + 21 to − 6 dB in 3 dB intervals. Participants repeated the target sentence with the tester marking their responses following standard BKB-SiN scoring.

Listening in Spatialized Noise-Sentences (LiSN-S)³⁶

In this standardized test (Fig. 2C), a target signal was broadcast binaurally through headphones along with two other distracting signals. Both the target and distracting signals consisted of sentences spoken in American-English by an adult female. The target (T) and distractor (D1, D2) voices were manipulated with respect to talker (same voice, different voices) and direction (0°, ± 90° azimuth), creating four different listening conditions. From these four listening conditions, three difference scores were calculated: Talker advantage (different voices – same voice); Spatial advantage (different directions – same direction); and Total advantage (different voices and directions – same voices and directions).

Participants were asked to repeat the sentences of the target voice only. Distracting sentences remained constant at 55 db SPL. After each correct trial the target voice descended in level (4 dB), but if the participant incorrectly repeated back over 50% of the sentence the level increased (by 2 dB). The LISN-S software calculated the difference scores for each participant.

Data availability

The dataset generated during and analysed during the current study are available at GitHub: https://github.com/stewarthannahj/VGPs.git.

References

Moore, D. R. et al. Relation between speech-in-noise threshold, hearing loss and cognition from 40–69 years of age. PLoS ONE 9, e107720 (2014).
ADS PubMed PubMed Central Google Scholar
Motlagh-Zadeh, L. et al. Extended high-frequency hearing enhances speech perception in noise. Proc. Natl. Acad. Sci. https://doi.org/10.1073/PNAS.1903315116 (2019).
Article PubMed Google Scholar
Saunders, G. H. et al. A randomized control trial: supplementing hearing aid use with listening and communication enhancement (LACE) auditory training. Ear Hear. 37, 381–396 (2016).
PubMed Google Scholar
Ferguson, M. & Henshaw, H. How does auditory training work? Joined-up thinking and listening. Semin. Hear. 36, 237–249 (2015).
PubMed PubMed Central Google Scholar
Ferguson, M. A., Henshaw, H., Clark, D. P. A. & Moore, D. R. Benefits of phoneme discrimination training in a randomized controlled trial of 50-to 74-year-olds with mild hearing loss. Ear Hear. 35, e110 (2014).
PubMed PubMed Central Google Scholar
Moore, D. R., Rosenberg, J. F. & Coleman, J. S. Discrimination training of phonemic contrasts enhances phonological processing in mainstream school children. Brain Lang. 94, 72–85 (2005).
PubMed Google Scholar
Bediou, B. et al. Meta-analysis of action video game impact on perceptual, attentional, and cognitive skills. Psychol. Bull. 144, 77 (2018).
PubMed Google Scholar
Green, C. S. & Bavelier, D. Action video game modifies visual selective attention. Nature 423, 534–537 (2003).
ADS CAS PubMed Google Scholar
Green, C. & Bavelier, D. Action video game experience alters the spatial resolution of vision. Psychol. Sci. 18, 88–94 (2007).
CAS PubMed PubMed Central Google Scholar
Green, C. S. & Bavelier, D. Effect of action video games on the spatial distribution of visuospatial attention. J. Exp. Psychol. Hum. Percept. Perform. 32, 1465–1478 (2006).
PubMed PubMed Central Google Scholar
Feng, J., Spence, I. & Pratt, J. Playing an action video game reduces gender differences in spatial cognition. Psychol. Sci. 18, 850–855 (2007).
PubMed Google Scholar
Li, R., Polat, U., Makous, W. & Bavelier, D. Enhancing the contrast sensitivity function through action video game training. Nat. Neurosci. 12, 549 (2009).
PubMed PubMed Central Google Scholar
Colzato, L. S., Van Leeuwen, P. J. A., Van Den Wildenberg, W. & Hommel, B. DOOM’d to switch: superior cognitive flexibility in players of first person shooter games. Front. Psychol. 1, 8 (2010).
PubMed PubMed Central Google Scholar
Torner, H. P., Carbonell, X. & Castejón, M. A comparative analysis of the processing speed between video game players and non-players. Aloma Rev. Psicol. Ciènc. Educ. Esport 37 (2019).
Schubert, T. et al. Video game experience and its influence on visual attention parameters: an investigation using the framework of the Theory of Visual Attention (TVA). Acta Psychol. (Amst.) 157, 200–214 (2015).
Google Scholar
Entertainment Software Association. Essential facts about the computer and video game industry (2015).
Pylyshyn, Z. W. & Storm, R. W. Tracking multiple independent targets: evidence for a parallel tracking mechanism. Spat. Vis. 3, 179–197 (1988).
CAS PubMed Google Scholar
Dobrowolski, P., Hanusz, K., Sobczyk, B., Skorko, M. & Wiatrow, A. Cognitive enhancement in video game players: the role of video game genre. Comput. Hum. Behav. 44, 59–63 (2015).
Google Scholar
McDermott, A. F., Bavelier, D. & Green, C. S. Memory abilities in action video game players. Comput. Hum. Behav. 34, 69–78 (2014).
Google Scholar
Oei, A. C. & Patterson, M. D. Enhancing cognition with video games: a multiple game training study. PLoS ONE 8, e58546 (2013).
ADS CAS PubMed PubMed Central Google Scholar
Trick, L. M., Jaspers-Fayer, F. & Sethi, N. Multiple-object tracking in children: the ‘Catch the Spies’ task. Cogn. Dev. 20, 373–387 (2005).
Google Scholar
Palaus, M., Marron, E. M., Viejo-Sobera, R. & Redolar-Ripoll, D. Neural basis of video gaming: a systematic review. Front. Hum. Neurosci. 11, 259–276 (2017).
Google Scholar
Bavelier, D. & Green, C. S. Enhancing attentional control: lessons from action video games. Neuron 104, 147–163 (2019).
CAS PubMed Google Scholar
Bavelier, D., Green, C. S., Pouget, A. & Schrater, P. Brain plasticity through the life span: learning to learn and action video games. Annu. Rev. Neurosci. 35, 391–416 (2012).
CAS PubMed Google Scholar
Zhang, Y.-X., Tang, D.-L., Moore, D. R. & Amitay, S. Supramodal enhancement of auditory perceptual and cognitive learning by video game playing. Front. Psychol. 8, 1086 (2017).
PubMed PubMed Central Google Scholar
Green, C. S., Pouget, A. & Bavelier, D. Improved probabilistic inference as a general learning mechanism with action video games. Curr. Biol. 20, 1573–1579 (2010).
CAS PubMed PubMed Central Google Scholar
Brungart, D. S. Informational and energetic masking effects in the perception of two simultaneous talkers. J. Acoust. Soc. Am. 109, 1101–1109 (2001).
ADS CAS PubMed Google Scholar
Shinn-Cunningham, B. G. Object-based auditory and visual attention. Trends Cogn. Sci. 12, 182–186 (2008).
PubMed PubMed Central Google Scholar
Green, C. S. et al. Playing some video games but not others is related to cognitive abilities: a critique of Unsworth et al. (2015). Psychol. Sci. 28, 679–682 (2017).
PubMed Google Scholar
Dale, G. & Green, C. S. Associations between avid action and real-time strategy game play and cognitive performance: a pilot study. J. Cogn. Enhanc. 1, 295–317 (2017).
Google Scholar
Kollins, S. H. et al. A novel digital intervention for actively reducing severity of paediatric ADHD (STARS-ADHD): a randomised controlled trial. Lancet Digit. Health 2, e168–e178 (2020).
Google Scholar
Stewart, H. J. & Amitay, S. Modality-specificity of selective attention networks. Front. Psychol. 6, 1826 (2015).
PubMed PubMed Central Google Scholar
Zhang, Y.-X., Barry, J. G., Moore, D. R. & Amitay, S. A new Test of Attention in Listening (TAIL) predicts auditory performance. PLoS ONE 7, e53502–e53502 (2012).
ADS CAS PubMed PubMed Central Google Scholar
Bench, J., Kowal, A. & Bamford, J. The BKB (Bamford–Kowal–Bench) sentence lists for partially-hearing children’. Br. J. Audiol. 13, 108–112 (1979).
CAS PubMed Google Scholar
Brungart, D. S., Sheffield, B. M. & Kubli, L. R. Development of a test battery for evaluating speech perception in complex listening environments. J. Acoust. Soc. Am. 136, 777–790 (2014).
ADS PubMed Google Scholar
Cameron, S., Glyde, H. & Dillon, H. Listening in Spatialized Noise—Sentences Test (LiSN-S): normative and retest reliability data for adolescents and adults up to 60 years of age. J. Am. Acad. Audiol. 22, 697–709 (2011).
PubMed Google Scholar
Cameron, S. & Dillon, H. Development of the listening in spatialized noise-sentences test (LISN-S). Ear Hear. 28, 196–211 (2007).
PubMed Google Scholar
Whitton, J. P., Hancock, K. E., Shannon, J. M. & Polley, D. B. Audiomotor perceptual training enhances speech intelligibility in background noise. Curr. Biol. 27, 3237-3247.e6 (2017).
CAS PubMed PubMed Central Google Scholar
Dale, G., Kattner, F., Bavelier, D. & Green, C. S. Cognitive abilities of action video game and role-playing video game players: data from a massive open online course. Psychol. Pop. Media Cult. https://doi.org/10.1037/ppm0000237 (2019).
Article Google Scholar
Dale, G. & Green, C. S. The changing face of video games and video gamers: future directions in the scientific study of video game play and cognitive performance. J. Cogn. Enhanc. 1, 280–294 (2017).
Google Scholar
Strait, D. L., Parbery-Clark, A., Hittner, E. & Kraus, N. Musical training during early childhood enhances the neural encoding of speech in noise. Brain Lang. 123, 191–201 (2012).
PubMed PubMed Central Google Scholar
Zendel, B. R. & Alain, C. Concurrent sound segregation is enhanced in musicians. J. Cogn. Neurosci. 21, 1488–1498 (2009).
PubMed Google Scholar
Parbery-Clark, A., Skoe, E. & Kraus, N. Musical experience limits the degradative effects of background noise on the neural processing of sound. J. Neurosci. 29, 14100–14107 (2009).
CAS PubMed PubMed Central Google Scholar
Mankel, K. & Bidelman, G. M. Inherent auditory skills rather than formal music training shape the neural encoding of speech. Proc. Natl. Acad. Sci. U.S.A. https://doi.org/10.1073/pnas.1811793115 (2018).
Article PubMed PubMed Central Google Scholar
Ruggles, D. R., Freyman, R. L. & Oxenham, A. J. Influence of musical training on understanding voiced and whispered speech in noise. PLoS ONE 9, e86980 (2014).
ADS PubMed PubMed Central Google Scholar
Strait, D. L. & Kraus, N. Biological impact of auditory expertise across the life span: musicians as a model of auditory learning. Hear. Res. 308, 109–121 (2014).
PubMed Google Scholar
Kahn, A. S., Ratan, R. & Williams, D. Why we distort in self-report: predictors of self-report errors in video game play. J. Comput. Mediat. Commun. 19, 1010–1023 (2014).
Google Scholar
Room, R. Measuring alcohol consumption in the United States. In Research Advances in Alcohol and Drug Problems 39–80 (Springer, 1990).
Shakeshaft, A. P., Bowman, J. A. & Sanson-Fisher, R. W. A comparison of two retrospective measures of weekly alcohol consumption: diary and quantity/frequency index. Alcohol Alcohol 34, 636–645 (1999).
CAS PubMed Google Scholar
British Society of Audiology. Pure-tone air-conduction and bone-conduction threshold audiometry with and without masking (2011).
Zobay, O. et al. A new software implementation of the Test of Attention in Listening. In BSA Basic Auditory Science meeting (2016).
Etymōtic Research. Bamford–Kowal–Bench Speech-in-Noise Test (2005).

Download references

Acknowledgements

This research was funded by the Cincinnati Children’s Research Foundation and by the National Science Foundation (Extended Learning Network, NSF BCS-1057625). David Moore is supported in part by the NIHR Manchester Biomedical Research Centre.

Author information

Authors and Affiliations

Communication Sciences Research Center, Cincinnati Children’s Hospital Medical Center, Cincinnati, OH, 45229, USA
Hannah J. Stewart, Jasmin L. Martinez, Audrey Perdew & David R. Moore
Division of Psychology and Language Sciences, University College London, London, UK
Hannah J. Stewart
Department of Communication Sciences and Disorders, University of Cincinnati, Cincinnati, OH, USA
Jasmin L. Martinez
Department of Psychology, University of Wisconsin-Madison, Madison, WI, USA
C. Shawn Green
Department of Otolaryngology, University of Cincinnati, Cincinnati, OH, USA
David R. Moore
Manchester Centre for Audiology and Deafness, University of Manchester, Manchester, UK
David R. Moore

Authors

Hannah J. Stewart
View author publications
You can also search for this author in PubMed Google Scholar
Jasmin L. Martinez
View author publications
You can also search for this author in PubMed Google Scholar
Audrey Perdew
View author publications
You can also search for this author in PubMed Google Scholar
C. Shawn Green
View author publications
You can also search for this author in PubMed Google Scholar
David R. Moore
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

This study was designed by H.J.S., A.P., C.S.G. and D.R.M. Data collection and analysis was done by H.J.S., J.L.M., A.P. Manuscript was written and edited by all authors.

Corresponding author

Correspondence to David R. Moore.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Figure 1.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Stewart, H.J., Martinez, J.L., Perdew, A. et al. Auditory cognition and perception of action video game players. Sci Rep 10, 14410 (2020). https://doi.org/10.1038/s41598-020-71235-z

Download citation

Received: 20 December 2019
Accepted: 05 August 2020
Published: 01 September 2020
DOI: https://doi.org/10.1038/s41598-020-71235-z
Springer Nature Limited

This article is cited by

Novel 3-D action video game mechanics reveal differentiable cognitive constructs in young players, but not in old
- Tomihiro Ono
- Takeshi Sakurai
- Toshiya Murai
Scientific Reports (2022)
Multisensory GPS impact on spatial representation in an immersive virtual reality driving game
- Laura Seminati
- Jacob Hadnett-Hunter
- Karin Petrini
Scientific Reports (2022)
Training with an auditory perceptual learning game transfers to speech in competition
- E. Sebastian Lelo de Larrea-Mancera
- Mark A. Philipp
- Aaron R. Seitz
Journal of Cognitive Enhancement (2022)

Auditory cognition and perception of action video game players

Abstract

Similar content being viewed by others

Training with an auditory perceptual learning game transfers to speech in competition

Does the Noise Matter? Exploring Salient Audio Components in Video Prompting Interventions

Action Video-Game Training and Its Effects on Perception and Attentional Control

Introduction

Results

Visual multiple object tracking (MOT)

Test of Attention in Listening (TAiL)

Bamford–Kowal–Bench Speech-in-Noise (BKB-SiN)

Listening in Spatialized Noise-Sentences (LiSN-S)

Listening environments during play

Discussion

Limitations

Conclusion

Methods

Participants

Grouping

Equipment

Stimuli and procedure

Visual Multiple Object Tracking (MOT)17

Test of Attention in Listening (TAiL)33

Bamford–Kowal–Bench Speech-in-Noise (BKB-SiN)35

Listening in Spatialized Noise-Sentences (LiSN-S)36

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary information

Supplementary Figure 1.

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Novel 3-D action video game mechanics reveal differentiable cognitive constructs in young players, but not in old

Multisensory GPS impact on spatial representation in an immersive virtual reality driving game

Training with an auditory perceptual learning game transfers to speech in competition

Search

Navigation

Visual Multiple Object Tracking (MOT)¹⁷

Test of Attention in Listening (TAiL)³³

Bamford–Kowal–Bench Speech-in-Noise (BKB-SiN)³⁵

Listening in Spatialized Noise-Sentences (LiSN-S)³⁶