Pupil-BLAH-metry: Cognitive effort in speech planning reflected by pupil dilation
In reading research, a longstanding question is whether any stages of lexical processing require central attention, and whether such potential demands are frequency-sensitive. In the present study, we examined the allocation of cognitive effort in lexical processing by examining pupil dilations and naming latencies in a modified delayed naming procedure. In this dual-task/change procedure, participants read words and waited for various delays before being signaled to issue a response. On most trials (80%), participants issued a standard naming response. On the remaining trials, they were cued to abandon the original speech plan, saying “blah” instead, thereby equating production across different words. Using feature-matched low- and high-frequency words, we observed the differences in pupil dilations as a function of word frequency. Indeed, frequency-sensitive cognitive demands were seen in word processing, even after naming responses were issued. The results suggest that word perception and/or speech planning requires the frequency-sensitive allocation of cognitive resources.
KeywordsWord recognitionCognitive controlAutomaticityAttention
Word frequency effects are classic and robust in psycholinguistic research, predicting performance across many tasks, including lexical decision (Rubenstein, Garfield, & Millikan, 1970; Stone & Van Orden, 1993), naming (Forster & Chambers, 1973; Waters & Seidenberg, 1985), and perceptual identification (Manelis, 1977). Even in silent reading, word frequency has powerful effects on eye movements and fixation durations (Inhoff & Rayner, 1986; Rayner & Duffy, 1986). Despite the ubiquity of frequency effects, questions remain about their locus in word processing: Are frequency effects restricted to early processes of perception and lexical access, or do they continue into postaccess processes? A related question concerns the automaticity of lexical access: Does word perception require central attention, and might cognitive demands differ across high-frequency (HF) and low-frequency (LF) words?
The word frequency effect is most commonly assessed using response times (RTs); across variations in methods, stimuli, and participants, people are faster to process HF words, relative to LF words. Critically, however, frequency interacts with many other variables, such as context (Becker & Killion, 1977), stimulus quality (Yap, Balota, Tse, & Besner, 2008), word repetition (Scarborough, Cortese, & Scarborough, 1977), and neighborhood density (Andrews, 1992; Grainger & Segui, 1990). Frequency effects also change across tasks; for example, they are approximately three times larger in lexical decision, relative to naming. As suggested by Balota and Chumbley (1984), this interaction suggests that the decision stage (a necessary component for lexical decision, but not for naming) is frequency-sensitive. Thus, frequency appears to affect processes beyond perception in lexical decision. Indeed, Balota and Abrams (1995) later reported that frequency influences response mechanics (e.g., arm movement force) in lexical decision. In contrast, the naming task does not entail a decision stage, and is often assumed to provide a “cleaner” estimate of frequency (and other lexical) effects.
The delayed naming task
In a standard naming task, participants try to correctly name stimulus words as quickly as possible following their presentation (e.g., Forster & Chambers, 1973). Given this procedure, however, an RT difference between LF and HF words might reflect lexical access, or it might reflect uncontrolled differences in articulatory complexity (or other factors) across the words. To address this issue, the delayed naming task was developed. In delayed naming, the participant sees a word and then waits, often for variable intervals, for a “go” signal to initiate the naming response. The logic is simple: Because lexical access is allowed to finish, any remaining frequency (or other) effect should arise from speech planning or execution. As reviewed by Goldinger, Azuma, Abramson, and Jain (1997), frequency effects have been inconsistently reported in delayed naming. Indeed, the procedure was originally used as a control task, to ensure that frequency effects in naming times did not unduly reflect phonetic variations across items (Andrews, 1989; Forster & Chambers, 1973). Conversely, in a well-known study, Balota and Chumbley (1985) had participants view HF and LF words, wait for response cues at various delays, and then quickly speak. Balota and Chumbley observed reliable frequency effects following relatively long delays (650 ms or longer), suggesting that frequency affected postaccess processes. These findings have been replicated and extended (Connine, Mullennix, Shernoff, & Yelen, 1990), but have also been questioned (McRae, Jared, & Seidenberg, 1990). For example, whenever participants must produce different words, there remains a potential for uncontrolled phonetic variations, leading to differences in voice-key activation.
To address the issue of interword phonetic variations, Goldinger et al. (1997) modified the delayed naming task not only to wait out lexical access, but also to equate production, leaving only the speech-planning process free to vary. To accomplish this purpose, they used a dual task/change procedure. Most trials (80%) entailed “standard” delayed naming, in which participants saw words, waited for variable delays, and spoke the words in response to a high-pitched tone. In the remaining, catch trials, a low-pitched tone indicated that participants should abandon the speech plan, quickly saying “blah” instead. Using delays ranging from 150 to 1,400 ms, Goldinger et al. (1997) observed robust frequency effects in both standard and catch trials, with slower responses following LF words. That is, people were slower to produce LF words, as well as slower to switch away from LF words to the “blah” response. The results suggested that word frequency affects speech planning, independent of speech production, with LF words demanding extra attention. Still, other researchers have employed the delayed naming procedure and have not observed any reliable frequency effects (McCann & Besner, 1987; McRae et al., 1990; Savage, Bradley, & Forster, 1990). In general, frequency effects in delayed naming are fragile, appearing and disappearing with changes to the stimuli and methods. As such, it is difficult to conclude that word frequency truly affects lexical processing beyond the early stages of perception.
Given such fragile effects in delayed naming, one goal of the present study was to widen our focus, expanding from relying on individual RTs to examining a more continuous estimate of cognitive effort. Our specific approach was to collect continuous estimates of pupil dilation while participants performed a modified delayed naming task, as in Goldinger et al. (1997). This approach allowed for the assessment of potential frequency effects in delayed naming, independent of the presence or absence of an RT effect. Although the pupils change reflexively in response to general factors, such as emotional arousal, stress, and anxiety, such tonic changes are independent of phasic changes, which are based on the onset of stimuli for cognitive processing (Karatekin, Couperus, & Marcus, 2004). Phasic changes in pupil diameter have been used to infer cognitive effort across many domains, including lexical decision (Kuchinke, Võ, Hofmann, & Jacobs, 2007), attention allocation (Karatekin et al., 2004), working memory load (Granholm, Asarnow, Sarkin, & Dykes, 1996; Van Gerven, Paas, Van Merriënboer, & Schmidt, 2004), face perception (Goldinger, He, & Papesh, 2009), and general cognitive processing (Granholm & Verney, 2004).
The appeal of pupillometry for the investigation of cognition lies in its relative independence: Pupil dilation and constriction occur automatically, so cognitive effort can be gauged without influences from task-specific strategies. Prior research has shown that, when people perform more difficult cognitive operations, their pupils dilate (Porter, Troscianko, & Gilchrist, 2007), which may represent the summed index of the required brain activity (Beatty, 1982). Task-evoked peak pupil diameters have been used to index the cognitive load associated with memory tasks (Kahneman & Beatty, 1966), showing that pupils dilate relatively quickly with the onset of mental effort and constrict relatively quickly after its cessation. Indeed, Kahneman (1973) used the pupillary reflex as his primary index of mental processing load in his theory of attention, citing its sensitivity to variations within or between tasks and its ability to reflect individual differences in cognitive ability. Relevant to the present study, Kuchinke et al. (2007) recorded pupil dilation during a lexical decision task, using HF and LF words of varying emotional valences. Average peak pupil dilation in response to LF words significantly exceeded peak dilation to HF words (approximately 1,200 ms after stimulus onset), suggesting that pupil dilation can indirectly reflect the cognitive effort devoted to word processing. Given their use of lexical decision, however, the role of the decision stage remains unclear. A central aim of the present study was to assess “hidden” frequency effects in lexical processing without requiring lexical decisions.
When using pupillometry, particular caution must be exercised in stimulus generation and development of the procedural details. The pupils dilate reflexively to changes in the luminance, color, and spatial frequencies of visual input. Therefore, care must be taken to equate, as closely as possible, the stimulus characteristics across conditions. Porter and Troscianko (2003) identified several methods to minimize unwanted pupillary reflexes, including the use of relatively low stimulus contrast, avoiding colored stimuli, and using relatively long stimulus exposure durations. To address these considerations in the present investigation, we maintained constant background lighting and a constant screen color and (of greatest importance) equated the visual features of our stimulus words as closely as possible. Although we presented words briefly (500 ms), we examined extended time courses for pupillary reflexes, by utilizing response tone delays up to 2,000 ms and extending trials up to 6,000 ms postresponse. Examining an extended time course is critical, because pupillary reflexes appear several hundred milliseconds after the onset of stimulus processing (see Kuchinke et al., 2007). It is important to note, however, that pupillary responses to cognitive activity occur within approximately 250 ms (Beatty, 1982) and can be used to examine activities (such as reading; Heitz, Schrock, Payne, & Engle, 2008) that extend continuously in time.
In the present research, we adopted the modified, dual-task delayed naming paradigm used by Goldinger et al. (1997) and continuously monitored participants’ pupil diameters to estimate cognitive effort during speech planning. After reading briefly presented words, the participants waited for a response tone to indicate one of two responses, with different tones signaling either a standard naming response or a catch response (i.e., “blah”) .1 Response tones occurred following one of four randomly selected delays, ranging from relatively short (250 ms) to relatively long (2,000 ms). Given the nature of phasic pupillary reflexes, we expected to observe diverging pupil diameters for LF and HF words across longer delays. Specifically, if word frequency influences speech planning, preparing to speak LF words should demand greater cognitive resources, reflected by relatively enlarged pupils, following the offset of the printed word. Furthermore, this result should persist even when participants eventually execute a catch response, in which HF and LF words are phonetically equal. That is, we expected pupil dilation to reflect the greater effort required to abandon the speech plans for LF words. Although results in delayed naming RTs have been equivocal, we expected to observe a physiological manifestation of frequency effects by examining pupil diameters across standard and catch trials. In theoretical terms, such a finding would indicate that speech planning requires central attention, and that its demands vary across words in a frequency-sensitive manner. To foreshadow our results, we observed a frequency effect in standard delayed-naming RTs, but not in catch-trial RTs. However, we did observe clear differences in the allocation of cognitive effort across HF and LF words, which persisted for several hundred milliseconds after naming responses.
A group of 43 native English speakers (18–30 years of age, M = 19.50, SD = 2.2) from Arizona State University participated for partial course credit. All of the participants reported normal or corrected-to-normal vision and no hearing difficulty. Two of the participants were dropped from the analysis because of missing pupil data (greater than 7% of their data).
Summary statistics for the stimulus set
Mean frequency (KF) †*
Mean frequency (HAL)*
Mean frequency (Subtitle) ††*
Mean orthographic N
Mean orthographic Nf*
Mean phonologic N
Mean phonologic Nf
Mean length (letters)
Mean length (phonemes)
Participants were tested individually in a dimly lit, sound-attenuated booth. A chinrest maintained head position and viewing distance at 60 cm. The stimuli were presented in lowercase, 24-pt black Arial font on a constant gray background (RGB 150) at 1,024 × 768 resolution on a 17-in. monitor. Eye movements and pupil diameter were continuously recorded binocularly at 50 Hz, using a Tobii 1750 eyetracker. Naming latencies were recorded by an SR response box with a voice key. The experiment was controlled and data were collected using E-Prime 1.1 software (Psychology Software Tools, 2006).
Participants were first familiarized with the experiment and the eyetracker. The chinrest was adjusted such that eye position was maintained centrally on the horizontal axis and slightly above center on the vertical axis, and then participants were calibrated. For the calibration routine, we randomly presented nine fixation points (indicated by a blue dot) over the range of the display; participants “followed the dot” as it moved to each location. If the software or researcher identified any missing fixations, the calibration routine was repeated. All of the participants were successfully calibrated within two attempts.
Trials with errors (i.e., mispronunciations, pretone responses, and voice key errors) were removed from the analysis.4 Missing pupil data (due to blinks) were filled in by linear interpolation, and peak diameters were averaged during specific trial events (fixation, word presentation, wait, tone, response preparation, response, and postresponse intertrial interval [ITI]). The missing pupil data were not systematically distributed across experimental variables, and none of the 41 participants whose data were retained had more than 6% missing data. Baseline-corrected peak pupil diameters were computed, on a trial-by-trial basis, as the difference between the average of the participant’s pupil diameter during the fixation cross and the event peak of the current trial. This procedure minimized the influence of “carryover” effects from the word frequency or difficulty of the preceding trial. All pupil analyses were conducted on baseline-corrected data from each participant’s right eye. For all analyses, alpha was maintained at .05, and multiple comparisons were corrected with Bonferroni adjustments.
Average naming response times (in milliseconds) in all conditions
Following the omnibus analysis, we conducted separate 2 × 4 (Word Frequency × Delay) RM ANOVAS on the standard trials and the catch trials. In the standard trials, we observed a reliable, 17-ms effect of word frequency, F(1, 35) = 7.48, p < .01, ηp2 = .18. The main effect of delay was marginal, F(3, 105) = 2.46, p = .07, ηp2 = .06, reflecting the small (20-ms) decrease in average naming RTs across delays. The interaction was not reliable (F < 1). In the catch trials, the main effect of word frequency was not reliable (F < 1), but there was a robust effect of delay, F(3, 105) = 67.92, p < .01, ηp2 = .66, reflecting the large (964-ms) decrease in RTs across delays. The interaction was not reliable (F < 1). Taken together, the naming latencies partially replicated the prior results from Goldinger et al. (1997), although two effects (delay in standard trials and frequency in catch trials) were weaker than had been previously reported. We consider these results in the Discussion section, after examining the pupillary results.
Baseline-corrected peak pupil diameters were analyzed in separate 2 (word frequency) × 2 (trial type) × 4 (delay) within-subjects RM ANOVAs for each of the seven trial events. That is, pupil diameters were not examined in veridical “real time,” but in time-invariant sequences of trial events. For example, the “postresponse” trial period in the 250-ms delay condition occurred in real time at approximately 2,100 ms; this was the real-time equivalent of the “wait” trial period in the 2,000-ms delay condition. Because of such timing differences, pupillary responses were analyzed according to trial events. (Note, however, that the same results were observed in a set of real-time analyses.) Complementary analyses were carried out for each delay condition separately, on baseline-corrected, event-locked peak pupil diameters. For ease of presentation, these results are discussed in order of the trial events, as peak millimeter differences from baseline (mmd). The full ANOVA results can be found in Appendix B; only results of theoretical interest are discussed in the text.
When participants correctly issued verbal responses, we again observed several reliable patterns. There was a frequency effect: Issuing an LF response (either standard or catch) demanded greater effort (0.25 mmd), relative to emitting an HF response (0.18 mmd). Although it was present in both standard and catch trials, the frequency effect was stronger in catch trials (ηp2 = .62) than in standard trials (ηp2 = .14). Actually producing the catch response, however, demanded fewer resources (0.16 mmd), relative to standard naming (0.27 mmd). We interpret this finding as a speech–motor benefit for repeated items. On average, participants successfully issued more than 50 “blah” responses per session. Over time, the associated motor program may have become well-practiced, demanding fewer resources.
After participants issued their spoken responses, they waited for variable ITIs (up to 6,000 ms) before the start of the next trial. During the postresponse period, we again observed a reliable main effect of word frequency, suggesting that the physiological reflection of the effort devoted to LF words levels off slowly, relative to the HF words: Whereas pupil diameters following HF words began to quickly constrict during this period (0.45 mmd), diameters following LF words were still more dilated (0.49 mmd). The main effect of delay was again evident, but was only reliable in catch trials.
In the present study, we assessed the relative cognitive demands of naming LF and HF words. By coupling a modified delayed naming procedure with pupillometry, we examined both standard (i.e., naming latencies) and novel (i.e., pupillary reflexes) manifestations of the word frequency effect. In the majority (80%) of experimental trials, participants issued standard delayed naming responses, following the perception of HF or LF words. In the remaining trials, they had to abandon the speech plan, saying “blah” instead, thereby equating speech production across HF and LF words. Our results generally replicated and extended those from Goldinger et al. (1997), although some differences were observed. Considering first the similar findings, we verified that standard naming is faster than switching to the “blah” response. Second, across all delay periods, we observed slower standard naming latencies for LF than for HF words. Although we did not observe frequency effects in the catch trials, the pupillary results support and extend those from Goldinger et al. (1997): Using pupil dilation as a proxy for cognitive effort (Kahneman, 1973), we observed that preparing to speak LF words, relative to HF words, demanded greater cognitive resources, a difference that persisted even after speech programs had been executed. The results suggest that lexical processes can be examined by monitoring pupillary reflexes (Kuchinke et al., 2007), potentially revealing differences in cognitive demands, even in cases with equivalent overt performance (Karatekin et al., 2004).
Although the pupillary results support earlier findings, it is important to note that two results from Goldinger et al. (1997) were not replicated in the present RT data. First, in the standard naming trials, the effect of delay was small. Whereas responses typically become faster with longer naming delays (Balota & Chumbley, 1985; Goldinger et al., 1997; Kawamoto, Liu, Mura, & Sanchez, 2008), we observed only a 20-ms effect. In contrast, the catch trials showed a robust delay effect. Together, these results suggest that participants may have been anticipating switch (“blah”) trials to an unreasonable degree. In a result of greater relevance to the present article, although we observed a reliable (albeit small) frequency effect in the standard delayed naming trials, we found no such effect in the catch trials, and thus failed to replicate this result from Goldinger et al. (1997).
We have two main hypotheses regarding the differences across the studies. Our primary hypothesis is that the present stimulus items were chosen in a manner that maximally preserved visual features across matched LF and HF words, but also minimized phonetic differences. Many pairs contained similar sound patterns (e.g., drawn/drown, mouse/moose, sweet/sweep), and some were fully homophonic (e.g., course/coarse, pain/pane, read/reed). We selected items with strict criteria in order to avoid unwanted pupillary reflexes, and we verified (in a pilot study) that they produced a reliable frequency effect in standard naming. Although the executed motor plan was always the same in catch trials, the initial speech motor plan (i.e., speaking the printed word) was left free to vary, although perhaps not enough to elicit a difference in task-switching time. Once word perception was complete in delayed naming, it may have been that the speech–motor programs were too similar across LF and HF words to reliably elicit a difference.
Our second hypothesis regards the pace of the experiment—in particular, its overall slow RTs. In the present experiment, the average RT in standard delayed naming was approximately 780 ms; analogous RTs in Goldinger et al.’s (1997) study were approximately 600 ms. Once again, this difference may reflect the phonetic composition of the stimulus materials. The present experiment included many items with “soft” initial phonemes, which are well known to affect voice-key responses (Kawamoto et al., 2008; Kessler, Treiman, & Mullennix, 2002). It is also possible that participants responded slowly because of the pace of the experiment. In order to allow pupils adequate time to return to baseline dilation, we introduced long ITIs (from 4 to 6 s) that may have slowed performance. The present experiment lacked the rhythm of most naming experiments, which has been shown to strongly affect RTs (see Kello, 2004). Our procedure also involved a forced choice response (standard/catch) after the response tone, whereas most delayed naming procedures merely involve naming. By introducing an extra task demand, we may have disrupted the normal rhythm of delayed naming, thereby slowing naming latencies and obscuring RT-based frequency effects. Whether either hypothesis is correct, it is important to emphasize that RTs represented only one dependent measure. Pupil dilation, measured in those same trials, revealed effects that conceptually replicated those from Goldinger et al. (1997), suggesting that the lower-frequency words imposed greater cognitive load.5
As noted earlier, word frequency effects are ubiquitous in perceptual identification (Goldiamond & Hawkins, 1958; Manelis, 1977), lexical decision (Whaley, 1978), silent reading (Inhoff & Rayner, 1986; Rayner & Duffy, 1986), recognition memory (Glanzer & Adams, 1985), and other domains. Despite the theoretical importance of word frequency, debate surrounds its locus in the stream of cognitive processing. For example, although word frequency clearly affects perception (O’Malley, Reynolds, Stolz, & Besner, 2008; but see McCann & Besner, 1987; Paap & Johansen, 1994), interactions between word frequency and tasks (e.g., lexical decision vs. naming) suggest that the decision stage is also frequency-sensitive. To examine decision-free frequency effects, researchers may use the delayed naming procedure, wherein naming occurs once lexical access is complete. If word frequency exerts an influence solely during the perception and decision stages, it should disappear in delayed naming; however, if postaccess processes are frequency-sensitive, the differences should persist. We observed the latter result, suggesting that frequency modulates the cognitive effort required for speech planning (Goldinger et al., 1997; see also Herdman, 1992).
With the exception of event-related potential (Dambacher, Kliegle, Hofmann, & Jacobs, 2006; Hauk & Pulvermüller, 2004; Rugg, 1990) and fMRI (Chee, Westphal, Goh, Graham, & Song, 2003; Kronbichler et al., 2004) investigations, frequency effects are typically observed as overt behavioral differences across tasks, such as naming times or fixation durations. Although behavioral measures clearly identify frequency-based differences in lexical processing, they may mask dynamic attentional differences that persist after the offset of a behavioral response. In our study, we examined naming times but also estimated cognitive demands via pupillometry. As noted earlier, researchers have long examined the pupillary reflex to study cognitive processes, including lexical decision (Kuckinke et al., 2007), visual search (Porter et al., 2007), attention and concentration (Bradshaw, 1968), imagery (Paivio & Simpson, 1966), and memory (Papesh & Goldinger, 2011; Papesh, Goldinger, & Hout, 2012). In such studies, the pupils begin to dilate within 200–300 ms following the onset of a stimulus for cognitive processing. Constriction then follows the offset of cognitive demand, on a time course similar to that of dilation. For example, in a short-term memory task in which participants held digits in working memory prior to a “recall” signal, pupils reached maximum dilation during a preresponse pause, and then gradually constricted with each recalled digit (Kahneman & Beatty, 1966).
In the present study, participants’ pupil diameters increased sharply during the response preparation phase (the period directly following the response tone); this increase was consistently higher for LF than for HF words. Because this occurred in both standard and catch trials, it suggests that, regardless of the speech–motor plan, LF words demanded greater cognitive resources during processing. In fact, the frequency effect was numerically stronger during catch trials, wherein participants issued the same speech responses, regardless of the perceived word. Although the motor program for catch trials was repeated often throughout the experiment, thus producing smaller peak differences from baseline, such trials yielded large pupillary frequency effects: Planning to speak LF words, relative to HF words, demanded greater cognitive resources. (Notably, the numerically larger effect for catch trials may have been an artifact of having fewer observations, by design.) Furthermore, even after the speech–motor plan had been executed, LF words continued to exert an influence over the availability of cognitive resources. As can be seen in Figs. 3 and 4, we observed persistent frequency effects in the postresponse phase of each trial. This phase lasted at least 4,000 ms, which is sufficient time to allow the pupils to constrict following a naming response (Kahneman & Beatty, 1966). We thus observed a lasting effect of word frequency, continuing even as participants prepared for the subsequent trial. Indeed, as we noted in note 3, pilot testing for this experiment showed that the word frequency of trial n significantly influenced the starting diameter during trial n + 1, forcing us to use a trial-by-trial baseline correction procedure on our pupil data. (Note that, in the righthand panel of Fig. 4, the uncorrected functions show a frequency-based diameter difference prior to word onset.) LF words demand more cognitive resources during processing, leaving a diminished “pool” of available resources after they are processed. Although this hypothesis requires further testing, it suggests that higher-level cognitive operations, such as reasoning or text comprehension, may suffer persistent lags when a reader encounters LF words with minimal supporting context.
In summary, the present results suggest that word frequency affects postlexical access processes, such that LF words demand greater attention in speech planning and beyond. By monitoring pupil diameters as participants perceived and prepared to speak words of varying frequencies, we observed that the cognitive demand imposed by LF words reliably exceeded that of HF words. This difference in demand occurred over an extended time course, including the postspeech “recovery” time. Current theories of frequency effects in word perception typically focus on discrete stages, from perception through behavioral output. Analogous to “spillover” effects in reading (Rayner & Duffy, 1986; Reichle, Pollatsek, & Rayner, 2006), our results suggest that frequency may continue to affect cognitive processing after perception, lexical access, and naming are complete. The results also suggest that pupillometry offers a reliable method to estimate the cognitive demand imposed by different variables in word recognition.
Note that the present method is agnostic with respect to cascaded versus discrete stages of processing. By allowing perceptual processes to complete and equating production (during catch trials), our method allowed us to examine the influence of word frequency during speech planning.
Although we expected the difficulty of the response tone discrimination to interact with any observed frequency effects, this was not borne out by the data. All of the analyses were collapsed across differences in the response tones.
In pilot tests without a 6,000-ms blank screen, we found that frequency strongly influenced pupil diameters in subsequent trials. Additional pilot testing identified 6,000 ms as the maximum time necessary for the pupils to sufficiently constrict in order to eliminate artifacts from preceding trials.
Participants averaged 17.44 errors each. Of the 715 total errors committed, 502 were voice-key errors (i.e., the participant did not speak loud enough to trigger the voice key), 147 were mispronunciations, 13 were pretone responses, 49 were catch trial errors, and 4 were standard trial errors (i.e., saying “blah” in a standard trial). Generally speaking, all errors were more likely when trials contained LF words, but we refrained from analyzing the data, given the preponderance of voice-key errors.
We thank Michael Hout, Kathleen Powers, Justin Wilson, and Anthony Barnhart for their support on this project. We also thank David Plaut, Keith Rayner, Michael Cortese, and Derek Besner for their valuable comments on previous versions. Support was provided by NIH/NIDCD Grant R01-DC04535-11 to the second author.