Coordinating multiple actions can be challenging compared to concentrating on only one action at a time. A plethora of research has been dedicated to the difficulties associated with simultaneous execution of two actions (vs. only one), which can be observed in terms of slower and more error-prone responses (i.e., dual action costs, e.g., Fagot & Pashler, 1992; Koch et al., 2018; Pashler, 1994). Some studies, however, have also demonstrated difficulties associated with the overt execution of only a single action, namely in situations when this is accompanied by a requirement for simultaneous inhibition of another, highly prepotent action (e.g., Miller, 2006). In these situations, executing both actions turns out to be easier due to the absence of inhibitory demands (dual-action performance benefit). This benefit is often most pronounced in the error data due to frequent false-positive executions of a prepotent second response in single-action trials. The present study investigated how the occurrence of false-positive action executions in single-action trials is affected by stimulus-to-response (S-R) translation ease for the to-be-inhibited response, a factor that up to now has not received systematic empirical attention.

Concurrent executive and inhibitory response demands, giving rise to dual-action benefits, have been studied in variations of the single-dual-switch (SDS) paradigm (see e.g., Huestegge et al., 2023; Huestegge & Strobach, 2021; Strobach & Huestegge, 2021). In the SDS paradigm, participants are required to frequently switch between executing single actions and dual actions, thereby ensuring a high baseline activation of each action type across a block of trials (Lussier et al., 2012; Schumacher et al., 2001). In particular, dual-action benefits are typically observed in a variant of this paradigm involving a single stimulus triggering both single actions and dual actions (see Fagot & Pashler, 1992). Specifically, a cue typically indicates the type of action required (i.e., either a single action [A], the other single action [B], or the combined dual action [A + B]), while a separate target stimulus (or stimulus dimension) indicates the full specification (e.g., direction) of the currently relevant action(s). The requirement to execute only single action [A] can create an (implicit) inhibitory demand (inhibiting the other action [B]) whenever some preconditions are met that affect the urge to execute the currently unwarranted action [B], that is, its action prepotency. The typical finding is a preponderance of false-positive executions of the currently unwarranted, but prepotent response alternative along with the required response in single-action trials (Huestegge & Koch, 2014; Raettig & Huestegge, 2018). Inhibitory failures of this kind have been demonstrated for saccadic eye movements combined with manual key presses (Huestegge & Koch, 2014), and for manual key presses combined with directional vocal responses (Raettig & Huestegge, 2018). This might indicate similar mechanisms underlying involuntary co-execution with different response modality combinations, although this remains to be tested more explicitly in the future. Dual-action trials, in contrast, are usually less error-prone due to the absence of inhibitory demands. Sometimes, even faster dual-action (vs. single-action) reaction times (RTs) have been observed, in particular when presenting cue (action type) and imperative stimulus (action direction) as a single compound (Raettig & Huestegge, 2021, 2023).

The generation of inhibitory failures in the SDS paradigm is hard to explain with some traditional models of multiple action control assuming separate (serial or parallel) selection processes for each individual action without reference to content-based interaction between processing streams (Meyer & Kieras, 1997; Navon & Miller, 2002; Pashler, 1994). In contrast, models incorporating the notion of information crosstalk as a central mechanism can more readily explain the phenomenon of unwanted action co-execution (Huestegge, 2011; Logan & Gordon, 2001; Navon & Miller, 1987). In one such model, action selection is achieved by binding the correct action-relevant representations (codes) in working memory during a unitary central mapping selection stage (Huestegge, 2011; Huestegge & Koch, 2010) in which all currently relevant responses are prepared. For example, the specification of a manual-oculomotor dual action to the left would require binding (at least) a “left” (vs. “right”) spatial code to both a “manual” and a “saccade” modality code. In contrast, a single manual action to the left would require binding the “left” code only to the “manual” (not the “saccade”) code. In this case, activation could easily spread to the saccade code due to its strong a-priori activation (assuming a generally high prepotency for oculomotor responses, see below), thereby erroneously binding this response modality (code) into the action plan. Correct single-manual execution would therefore require some degree of active inhibition or downregulation of the prepotent saccade modality code (or its association with the spatial codes, see Huestegge & Koch, 2014).

Previous research has established that the likelihood of inhibitory failures (e.g., false-positive saccades) indeed depends to a large degree on the a priori activation of the to-be-inhibited response, that is, its prepotency (Ridderinkhof et al., 2014). A number of factors were shown to affect action prepotency. For example, it has been demonstrated that the erroneous execution of an unwarranted response is more likely when this very response was recently required, that is, it is especially difficult to withhold a saccade or manual response in a current trial when saccade or manual response execution was required in the previous trial. Importantly, this phenomenon even occurred irrespective of its particular direction (Kürten et al., 2022, 2023), suggesting that prepotency can be regulated at the level of the whole effector system (e.g., “oculomotor system”), not only at the level of the individual (fully specified) response (e.g., a “saccade of a specific length to the left”). On the other hand, the lack of any requirement to execute the to-be-inhibited response throughout a block of trials renders saccadic inhibition comparatively easy, but, notably, still far from perfect – while manual inhibition almost always succeeds in such a condition (Raettig & Huestegge, 2023). This indicates that some effector systems are inherently more prone to unwarranted execution, such as oculomotor responses compared with manual (or vocal) responses (Kürten et al., 2022). Further, pre-knowledge of the relevant response modality (modalities) allows for pro-active inhibition of unwarranted effector systems given sufficient time, even without pre-knowledge of the required response direction (Kürten et al., 2023; see also Greenhouse et al., 2012). Taken together, these observations show that action prepotency can be dynamically modulated. The present study asked whether a particularly easy (more automatic) S-R translation contributes to action prepotency and whether a reduction of S-R translation ease helps to reduce the frequency of inhibitory failures.

Previous studies on unwarranted response co-execution have exclusively used target stimuli with an inherent association with the required response direction. For example, a salient onset of peripheral (left/right) visual stimuli (e.g., Kürten et al., 2022) or centrally presented directional words indicated that a left/right response should be executed (Raettig & Huestegge, 2018, 2021, 2023). In such a setting, the spatial code is available almost instantly, allowing for a quick activation of an associated, but unwarranted effector system before inhibition is complete. However, it appears plausible to assume that the ease of S-R translation should differ between, for example, peripheral and central (symbolic) stimuli (Luo & Proctor, 2018; Müller & Rabbitt, 1989). This in turn might affect the degree of action prepotency, and thereby the occurrence of inhibitory failures underlying dual-action benefits. In this case, the propensity toward inhibition failures might decrease if S-R translation ease is reduced because of a generally lower action prepotency. Furthermore, one might speculate that inhibitory failures are to a large extent instances of relatively short-latency reflexive actions (such as express saccades, e.g., Findlay & Walker, 1999; Fischer & Ramsperger, 1984; Huestegge et al., 2019) caused by peripheral stimuli instead of more “top-down controlled” actions (e.g., in response to a symbolic central stimulus). Alternatively, action prepotency might depend on dimensional (here: spatial) overlap between the imperative stimulus and the (to-be-inhibited) response. If so, the occurrence of inhibitory failures might completely vanish if there was no spatial information inherent in the stimulus at all (Kornblum et al., 1990). However, these assumptions have not yet been explicitly tested within a single experimental setup.

The present study therefore aimed to determine the impact of S-R translation ease on action prepotency and inhibitory failures in a situation requiring frequent simultaneous action execution and inhibition. To this end, we had participants switch between executing spatial (left vs. right) single manual key presses, single saccades, and manual-oculomotor dual actions in response to a single visual stimulus. Overall, we expected to replicate the strong preponderance of false-positive saccade executions in single manual trials over other types of errors (i.e., false-direction errors, false-negative errors), particularly in comparison to dual-action trials (i.e., we expected to find relative dual-action benefits). Note that false-positive key presses have been observed in such a combination of effector systems (Kürten et al., 2023), but to a much smaller degree than false-positive saccades, which is why we base our interpretations mainly on effects in the latter response modality. Critically, we also manipulated the stimulus mode and thus the ease of S-R translationFootnote 1. A classic way of manipulating S-R translation ease is the introduction of spatial S-R incompatibility: For example, it is more difficult to execute a “right” response based on a stimulus indicating a “left” response (Fitts & Seeger, 1953; Kornblum et al., 1990). However, in the current setting, such a manipulation of S-R translation ease would introduce a spatial conflict between the prepotent but unwarranted response tendency and the actually required response in single-action trials. For instance, in an incompatible single-manual trial a stimulus indicating a “left” response would create a general tendency towards a left response that would have to be overcome only for the manual response but not for the unwarranted oculomotor response. To avoid such a potential confound and to allow for a better comparison with previous SDS studies, we therefore resorted to a procedure typically used in cueing paradigms, namely the utilization of peripheral versus central (symbolic) stimuli (Posner, 1980, 2016; Remington, 1980). Specifically, we used three types of visual stimuli: (1) a salient peripheral (left vs. right) square, (2) a centrally presented arrow pointing to the left or to the right, and (3) a centrally presented shape arbitrarily mapped to left versus right responses. We reasoned that the onset of the peripheral square should be translated into the corresponding spatial response code in a most direct, quasi-automatic manner due to its inherent spatial properties (Lu & Proctor, 1995; Müller & Rabbitt, 1989). With the central arrow being a symbolic stimulus, participants must resort to learned (symbolic) conventions to extract its meaning (Miles & Proctor, 2012). As a consequence (and similar to exogeneous vs. endogenous cues utilized in typical cueing paradigms, see Posner, 1980, 2016), arrows should imply a somewhat reduced S-R translation ease when compared with the peripheral stimuli. Finally, participants should have the strongest difficulties with translating the arbitrary central shape into left (vs. right) response codes due to the lack of any (physical or symbolic) dimensional overlap (Kornblum et al., 1990). On a neurophysiological level, this is reflected, for example, by later lateralized readiness potentials with arbitrary central (color) cues compared to central arrow cues signaling eventual response direction (Eimer, 1995; Eimer et al., 1995). The three levels of S-R translation ease should be reflected by increasing corresponding RT levels in either response modality (RT: peripheral square < central arrow < central shape), which thus serves as a manipulation check. Central to the present study, however, was the effect of the S-R translation ease manipulation on error rates, particularly, on false-positive saccade executions in single manual trials as indications of inhibitory failures.

The following prediction alternatives were made based on competing theoretical assumptions. First, the frequency of inhibitory failures could decrease with decreasing S-R translation ease: Activation from the spatial stimulus should spread more quickly to an unwarranted saccade code the faster this spatial information is available, granting less time for inhibitory control. Specifically, if inhibition of the saccade in single-manual trials proceeds in parallel with (and independent from) S-R translation as is usually assumed in independent race models of inhibitory control (Logan & Cowan, 1984; Matzke et al., 2018), any prolongation of this process should yield better inhibitory control (extended time for inhibitory control account). Taken together, this reasoning would be reflected by a decreasing rate of false-positive saccades with increasing RTs (due to more difficult S-R translation). Second, an increasing rate of false-positive saccades with increasing RTs (due to more difficult S-R translation) could also be expected, for example, based on the assumption of fewer processing resources being available for effector system inhibition due to a more difficult direction specification. This would, of course, require a dependence of effector system inhibition and S-R translation on a common, limited processing capacity (translation-inhibition conflict account, see e.g., Verbruggen & Logan, 2015). Finally, a third possible outcome would be a comparable frequency of false-positive saccades across levels of S-R translation ease. This would, on a theoretical level, indicate that the failure (or success) of effector system inhibition was (at least in part) determined independently from (i.e., before) the specification of the spatial response parameters conveyed by the imperative stimulus (hierarchical action specification account). This prediction appears reasonable based on studies in the domain of manual motor control demonstrating that the selection of the hand (effector) occurs prior to target specification in manual aiming movements (Herbort & Rosenbaum, 2014).

Any potential differences in inhibitory failure rates across conditions could be further affected by two additional mechanisms. First, false-positive errors could largely represent short-latency reflexive actions such as express saccades (Findlay & Walker, 1999; Fischer & Ramsperger, 1984; Meeter et al., 2010) executed independently from the actually required (manual) action. If this is the case, we would expect particularly frequent false-positive saccades with the peripheral square stimulus (reflexive inhibition failures). If, in contrast, false-positive errors represent failures to implement the proper behavior during action selection (i.e., when saccades are co-activated when the time has come to initiate the manual response as posited by the spreading activation model described above), we would not expect exceptionally high rates of false-positive saccades with the peripheral stimulus (Everling & Johnston, 2013). Finally, if spreading activation leading up to false-positive errors was mainly triggered by the stimulus, for example, due to spatial dimensional overlap with the to-be-inhibited response, we should observe false-positive saccades and a potential modulation by S-R translation ease with the peripheral square and the central arrow but not with the central shape stimulus (stimulus-based co-activation, see Kornblum et al., 1990). If inhibitory failures were instead triggered not only by strong a-priori stimulus-response associations but by a strong coupling between both effector systems, we should instead observe frequent false-positive saccades even with the central square stimulus (action-based co-activation).

Methods

Preregistration, raw data, and analysis scripts are available at https://osf.io/9shkf/?view_only=9d93819f46b746449803eabbd579036a.

Participants

Based on a previous study (Kürten et al., 2023), we determined that a sample size of 10 would suffice to demonstrate accuracy-related dual-action benefits in saccade errors (main effect action demand, \({\widehat{\eta }}_{p}^{2}\) = 0.544) with a power of 1-\(\beta\) = 0.95 at a significance level of \(\alpha\) = 0.05. To achieve sufficient power to find a potential modulation of dual-action benefits by stimulus mode, we decided to collect data from forty-eight volunteers (mean age = 25.1 years, SD = 4.4, 76.79% female, 98.21% right-handed) with normal or corrected-to-normal vision (without color blindness). Eight data sets were replaced with the help of new participants due to either missing data (2 data sets), a global error rate of > 30% (6 data sets), and/or due to producing < 10 valid correct RTs in any of the experimental conditions (3 data sets). All participants gave their informed consent and received monetary compensation.

Apparatus, stimuli and procedure

Fig. 1
figure 1

Experimental setup. A. Schematic trial structure. B. Exemplary color-to-action demand mappings (counterbalanced across participants). C. Stimulus modes. For each stimulus mode, 24 practice trials followed by three experimental blocks of 60 trials each (i.e., 20 per action demand, 10 per action demand and direction) were administered. See the online article for the color version of this figure

After giving their informed consent, participants were seated in front of a 21” cathode-ray tube monitor with a spatial resolution of 1024 × 768 pixels and a temporal resolution of 100 Hz at a viewing distance of 67 cm. Latencies and directions of eye movements of the right eye were recorded using an EyeLink 1000 eye-tracking system (SR Research, Ontario, Canada) with a sampling rate of 1000 Hz. Head movements were minimized using a chinrest with forehead support. Manual responses of the left and right index fingers were recorded using two buttons on a Cedrus response pad (Cedrus corporation) placed on the desk instead of the regular computer keyboard (and thus not in the line of sight of the participants). All stimuli were presented against a black background. The experiment comprised three sub-sessions, one for each stimulus mode, consisting of written and verbal instructions followed by a 24-trial training block that could be repeated in case of insufficient task understanding and three experimental blocks with 60 trials each (20 per action demand, 10 per action demand and direction). The order of stimulus-mode sub-sessions was counterbalanced across participants. Figure 1 depicts a schematic trial structure. Each trial started with the presentation of the central fixation cross at the screen center (a plus sign, size 10 pixels, 0.33°VA) and two saccade landing targets (black squares with white borders size 10 pixels, 0.33°VA, border width 1 pixel) at an eccentricity of 8°VA to the left and right of the screen center. After 500 ms, the cross changed color to either red, green or blue to indicate action demand (e.g., red = single saccade, green = single manual, blue = dual, mapping counterbalanced across participants). After 200 ms the colored cross disappeared while the saccade targets remained visible. After another 200 ms, the imperative stimulus was presented for 1000 ms. If no response was recorded within this time window, an error message (“Zu langsam”, German for “Too slow”) was displayed in white Times New Roman font (size 20) for 1000 ms. No other error feedback was given. Three types of stimuli were used. (1) A white square (size 10 pixels, 0.33°VA) that filled either the left or the right saccade landing target, (2) a white arrow (size 10 pixels, 0.33°VA) presented at the screen center, and (3) a white shape (diamond and circle, size 10 pixels, 0.33°VA) presented at the screen center. Trials were separated by a 500 ms blank-screen inter-trial interval (ITI). Participants responded spatially (left vs. right) with a manual button press, with a saccadic eye movement to one of the (left/right) saccade landing targets, or with both responses. Responses to the peripheral stimulus and central arrow were always spatially compatible to the stimulus (i.e., left peripheral square/left-pointing central arrow required a left button press and/or a leftward saccade and vice versa). The mapping of the central shape to response direction was arbitrary (e.g., diamond = left, circle = right) and counterbalanced across participants. The experiment was programmed using Experiment Builder software (version 2.3.527, SR Research).

Data analysis

Reaction times (RTs, ms) and error rates (ERs, %) served as dependent variables. Practice blocks were removed from all analyses. From the remaining nine experimental blocks, we removed premature responses (RTs < 50 ms, 0.34% of all trials). The remaining trials were analyzed for errors, including false-direction errors (i.e., wrong button pressed/saccade toward the wrong target), false-negative errors (e.g., no saccade executed in single saccade/dual-action trials) and, indicating inhibition failure, false-positive errors (e.g., saccade executed in single manual trials). From RT analyses, we removed all error trials (14.14% of all trials) and outliers (RT > 3 SDs away from cell mean, 1.84% of all trials). In total, 15.91% of all trials were removed from RT analyses. RTs and ERs were subjected to repeated measures analyses of variance (ANOVAs) with the within-subject independent variables action demand (single manual, single saccade, dual) and stimulus mode (peripheral, central arrow, central shape). Since errors in both response modalities could occur in every trial, saccades and manual responses were analyzed separately. Note that ERs comprised different types of errors that can occur at different action demands in each response modality (see Table 1). For example, false-positive saccade errors can only occur in single manual trials while false-positive manual errors can only occur in single saccade trials. In RT analyses, the factor action demand was effectively reduced to two levels (single vs. dual) for each response modality since we only included correct responses (i.e., no saccade RTs in single manual trials, no manual RTs in single saccade trials). The significance level was set to \(\alpha\) = 0.05 for main analyses. Greenhouse-Geisser corrections were routinely applied to within-subject factors with more than two levels. Significant main effects of factors with more than two levels and significant interaction effects were followed up by Bonferroni-corrected pairwise comparisons. Data processing and analysis was conducted in R (R core team), the papaja package (Aust & Barth, 2014/2022). was used to write a computationally reproducible methods and results section. Additionally, we employed the TOSTER R package (Lakens & Caldwell, 2023) for equivalence tests on false-positive ERs, specifically saccade errors in single-manual trials and manual errors in single-saccade trials. This procedure was aimed at assessing potential modulations by stimulus mode within the context of a previously observed reduction of false-positive ERs by extending preparation time between cue and imperative stimulus from 100 ms to 400 ms (Kürten et al., 2023). Equivalence bounds were set to 12% for false-positive saccades and to 2% for false-positive manual key presses. Equivalence tests were corrected for multiple comparisons based on a suggestion by Lauzon and Caffo (2009).

Table 1 Possible error types for different action demands and response modalities

Results

Correct RT data

Fig. 2
figure 2

Correct reaction time data. Correct RTs (ms) as a function of action demand (dual, single saccade, single manual) and stimulus mode (peripheral square, central arrow, central shape). Every saccade (manual) response in single manual (single saccade) trials represented a false-positive error and thus produced no correct reaction time. Error bars represent standard error of the mean (SEM)

Complete descriptive summary statistics can be obtained from Table S1 in the supplementary material. Correct RTs of both response modalities as a function of action demand (trial type) and stimulus mode are displayed in Fig. 2.

Saccade RTs. The main effect of action demand was not significant, \(F(1,47)=0.09\), \(p=0.766\), \({\widehat{\eta }}_{p}^{2}=0.002\). The main effect of stimulus mode was significant, \(F(1.61,75.53)=316.90\), \(p<0.001\), \({\widehat{\eta }}_{p}^{2}=0.871\). Saccade responses to the peripheral square (M = 229 ms) were faster than saccade responses to the other two stimulus modes (both ps < 0.001). Saccade responses to the central arrow (M = 314 ms) were faster than saccade responses to the central shape (M = 402 ms, p < 0.001). The interaction of action demand and stimulus mode was not significant, \(F(1.84,86.50)=2.37\), \(p=0.104\), \({\widehat{\eta }}_{p}^{2}=0.048\).

Manual RTs. The main effect of action demand was significant, \(F(1,47)=41.71\), \(p<0.001\), \({\widehat{\eta }}_{p}^{2}=0.470\). Manual responses were, overall, faster in single manual trials (M = 462 ms) than in dual-action trials (M = 505 ms). The main effect of stimulus mode was significant as well, \(F(1.62,76.00)=204.71\), \(p<0.001\), \({\widehat{\eta }}_{p}^{2}=0.813\). Manual responses to the peripheral square (M = 408 ms) were faster than manual responses to the other two stimulus modes (both ps < 0.001). Manual responses to the central arrow (M = 472 ms) were faster than manual responses to the central shape (M = 571 ms, p < 0.001). The interaction of action demand and stimulus mode was significant, \(F(1.98,93.29)=4.60\), \(p=0.013\), \({\widehat{\eta }}_{p}^{2}=0.089\). Dual-action costs were significant for all stimulus modes (all ps < 0.001) but they were largest for the central arrow stimulus (\(\varDelta M=55\)ms).

Error data

Fig. 3
figure 3

Error rates as a function of stimulus mode and action demand. Note. ERs (%) of both response modalities (saccade, manual) as a function of action demand (dual, single saccade, single manual) and stimulus mode (peripheral square, central arrow, central shape). Error bars represent the standard error of the mean (SEM)

Figure 3 displays ERs as a function of action demand and stimulus mode. Saccade ERs. The main effect of action demand was significant, \(F(1.17,54.97)=72.23\), \(p<0.001\), \({\widehat{\eta }}_{p}^{2}=0.606\). Saccade errors were least frequent in dual-action trials (M = 7.03%), not significantly more frequent in single saccade trials (M = 7.96%, p = 0.100) and, crucially, most frequent in single manual trials (M = 18.16%, both ps < 0.001) indicating overall dual-action benefits due to frequent false-positive saccades. The main effect of stimulus mode was significant as well, \(F(1.53,72.10)=30.28\), \(p<0.001\), \({\widehat{\eta }}_{p}^{2}=0.392\). Saccade errors were least frequent with the central arrow stimulus (M = 7.84%) but were not significantly less frequent compared to the peripheral square stimulus (M = 9.07%, p = 0.401). Saccade errors were most frequent with the central shape stimulus (M = 16.24%, both ps < 0.001). The interaction of action demand and stimulus mode was significant, \(F(2.75,129.16)=14.17\), \(p<0.001\), \({\widehat{\eta }}_{p}^{2}=0.232\). Visual inspection of Fig. 3 suggests that this interaction was mainly driven by the high ERs in dual-action and single saccade trials in conditions involving the central shape stimulus and Fig. 4 indicates that the these mainly contained false-direction errors instead of false-negative errors indicating difficulties with correctly classifying the arbitrary central shape. Saccade errors in single manual trials (false-positive saccades) were significantly more frequent than saccade errors in dual-action trials (false-direction & false-negative saccade errors) with all stimulus modes (all ps < 0.003). Crucially, however, false-positive saccade ERs were on a high level across all stimulus modes. Bonferroni-corrected comparisons of false-positive saccade ERs in single manual trials yielded no significant differences (all ps > 0.066). However, based on exploratory equivalence tests, we could establish equivalence between stimulus modes regarding false-positive errors within equivalence bounds of ± 12% points (all ps < 0.001).

Manual ERs. The main effect of action demand was significant, \(F(1.77,83.30)=10.61\), \(p<0.001\), \({\widehat{\eta }}_{p}^{2}=0.184\). Manual ERs did not differ significantly between dual-action trials (M = 4.89%) and single saccade trials (M = 3.31%, p = 0.502). Manual errors were, more frequent in single manual trials (M = 8.16% compared to the other two action demands (both ps < 0.027). The main effect of stimulus mode was significant as well, \(F(1.25,58.58)=58.60\), \(p<0.001\), \({\widehat{\eta }}_{p}^{2}=0.555\). The frequency of manual errors with the peripheral square stimulus (M = 3.35%) did not differ significantly from those with the central arrow stimulus (M = 3.31%, p > 0.999). Manual errors were more frequent with the central shape stimulus (M = 9.70%) compared to the other two stimulus modes (both ps < 0.001). Finally, the interaction of action demand and stimulus mode was significant, \(F(2.74,128.90)=21.45\), \(p<0.001\), \({\widehat{\eta }}_{p}^{2}=0.313\). This was mainly due to the high manual ERs in dual-action trials and single saccade trials in conditions involving the central shape stimulus. False-positive manual ERs in single saccade trials did not differ significantly between conditions involving the peripheral and the central arrow stimuli nor between conditions involving the peripheral and the central shape stimuli (both ps > 0.134), but they did differ between conditions involving the central arrow and the central shape stimuli (p < 0.001). Based on exploratory equivalence tests, we could not reject the equivalence of false-positive manual ERs across stimulus modes within bounds of ± 2% points (all ps > 0.062).

Discussion

The present study investigated the effect of S-R translation ease on inhibitory failures in a setting that sometimes required the execution of one (e.g., manual) response and the simultaneous inhibition of another (e.g., saccade response). The main results can be summarized as follows. Correct RTs in both response modalities steadily increased as a function of stimulus mode (RT: peripheral square < central arrow < central shape), indicating that the manipulation of S-R translation ease indeed worked as intended. While saccades showed a comparable latency in single-action and dual-action trials, we observed dual-action costs for the manual key press across stimulus modes. This could indicate a motor bottleneck, since the key press was usually executed after the initiation and completion of the saccade (Bratzke et al., 2008, 2009) and/or prioritization of the “saccade part” of the oculomotor-manual dual action (Hoffmann et al., 2019; Pieczykolan & Huestegge, 2014). Qualitative differences between the two inherently spatial stimulus modes (peripheral squares and central arrows) and the non-spatial stimulus mode (central shapes) are reflected by a substantially higher frequency of directional and false-negative errors in the latter mode compared to the former modes (for a depiction of error rate as a function of stimulus mode and error type, see Figure S1 in the supplemental material). Thus, as expected, the presence of spatial information within the stimulus substantially improved the ability to select the appropriate response direction. Again, this corroborates the effectiveness of our manipulation of S-R translation ease. The main finding of the present study, however, was the relative independence of the frequency of false-positive saccades in single-manual trials from the manipulation of S-R translation ease.Footnote 2

Stimulus-based and action-based co-activation

False-positive saccades with very low (< 100 ms) RTs occurred more often with the peripheral square stimulus (3.46% of false-positive saccades) compared to the central arrow stimulus (0.86%) and the central shape stimulus (0.53%). However, false-positive saccades were clearly not limited to such express saccades, ruling out a simple reflexive inhibitory failures. This is remarkable given that we employed a blank screen gap between the offset of the central cue and the onset of the (peripheral or central) stimulus, a setting that has been shown to elicit a rather large amount of express saccades in situations without additional manual response demands (e.g., Fischer & Ramsperger, 1984). Further, erroneous saccade co-execution was not dependent on inherently spatial stimuli as there was no sharp drop in false-positive saccade rates in the central shape stimulus condition compared to the other conditions, speaking against a purely stimulus-based co-activation mechanism. Instead, it seems that false-positive saccade generation was tightly linked to the concurrent processing of the manual response, indicating action-based co-activation as an additional mechanism. Most importantly, even though RT levels within the different stimulus modes suggested a clear hierarchy of S-R translation ease, we did not find a corresponding gradual modulation of false-positive saccade frequency. We even found evidence for significant statistical equivalence regarding the false-positive ERs. The results thus quite clearly demonstrate that the overall frequency of false-positive saccades was rather unaffected by stimulus mode. Thus, we found no support for an extended time for inhibitory control account. It seems unlikely that participants simply used the extra time provided by prolonged S-R translation processes for (a) completing the selection of the proper effector system and (b) inhibiting the unwarranted effector system after the imperative stimulus was presented (as false-positive saccade frequency did not gradually decrease with S-R translation ease). On the other hand, the frequency of false-positive saccades also did not gradually increase as S-R translation ease decreased as predicted by a translation-inhibition-conflict account. Inhibitory control did not seem to suffer substantially from any limited capacity being diverted to S-R translation demands. This indicates that success (or failure) of saccade inhibition was, to some extent, already determined before the direction of the action was specified, compatible with a hierarchical action specification account.

This particular assumption is consistent with a model of saccade programming differentiating between a WHEN and WHERE pathway that are processed rather independently and potentially sequentially in our particular paradigm (Findlay & Walker, 1999; Huestegge et al., 2019). In addition, it is also compatible with more general motor programming studies (Rosenbaum, 1985) demonstrating that some action parameters (e.g., the hand used for aiming) are selected independently of other parameters (e.g., the target to aim for, see Herbort & Rosenbaum, 2014). Conversely, inhibitory control to prevent a false-positive saccade would need to act at the level of the effector system per se rather than on an already spatially specified saccade (Kürten et al., 2023).

Generation of a false-positive saccade in single-manual trials can be readily explained within the model of action selection proposed by Huestegge and Koch (2010, see also Huestegge, 2011). An illustration of the proposed mechanisms is depicted in Fig. 4. Proactive inhibition of the prepotent saccade system is initiated by the prior cue (Greenhouse et al., 2012). This is counteracted by activation spreading to the saccade modality code both from the spatial code (triggered by the stimulus, stimulus-based co-activation) as well as from the frequently associated manual modality code (action-based co-activation). Activating the saccade code above a certain threshold results in the false-positive execution along with the required manual response. False-positive saccades are likely preceded by competition between executory and inhibitory processes, given that they tend to have, on average, longer latencies, and smaller amplitudes than correct saccades (see Table S2 and Figure S2 in the supplementary material). This ties in with studies demonstrating a negative impact of increasing mental workload on saccadic latency and amplitude (e.g., Huestegge & Koch, 2009; May et al., 1990; Spering, 2022; Stuyven et al., 2000). It is important to note that the present framework is functional in nature rather than physiological. However, detailed physiologically grounded models of various processes relevant to the current results exist. Particularly, there is a large body of research on saccade initiation and inhibition in the stop-signal and anti-saccade paradigms mediated by fixation neurons in the superior colliculus (e.g., Bompas et al., 2020; Coe & Munoz, 2017). Another field potentially relevant to the current results is eye-hand coordination in reaching/pointing toward common targets controlled by a parieto-frontal network (e.g., Battaglia-Mayer et al., 2006; Mascaro et al., 2003). It should be noted, however, that these models might not directly apply to the current situation given the differences in the paradigms used. While exceeding the scope of the current study, future work may thus be dedicated to generalizing these models to the current setting yielding erroneous saccade co-execution.

Fig. 4
figure 4

Conceptual framework of (false-positive) action selection. A framework of action selection in single manual trials for the three different stimulus modes. The perceptual processing stage contains cue and imperative stimulus. The mapping selection stage (dashed box) contains potentially action-relevant modality codes (hand symbol = manual system, eye symbol = oculomotor system) and spatial codes (left-pointing arrow = “left”, right-pointing arrow = “right”) in working memory. The “fill level” of the modality codes indicate action prepotency (oculomotor system > manual system). Solid lines without arrowheads represent the currently relevant binding between the manual modality code and the “left” spatial code, dashed lines without arrow heads represent erroneously spreading activation due to frequent association. The action execution stage contains all potential task-relevant actions (left keypress, leftward saccade, right keypress, rightward saccade). Filled symbols designate the actually executed actions within a trial. In a single-manual trial, the colored cue activates the manual modality code (and prompts inhibition of the oculomotor modality), the stimuli activate the “left” spatial code at a different rate depending on S-R translation automaticity (indicated by line strength from stimulus to spatial code). Activation spreading from the manual and the “left” code to the prepotent saccade code (action-based co-activation of the other effector system) that counteracts the original inhibitory command based on the cue eventually resulting in execution of a false-positive leftward saccade along with the required left keypress. See the online article for the color version of this figure.

Saccade prepotency versus dual-action prepotency

Of course, most single-manual trials with a false-positive saccade also involved the execution of the required manual response.Footnote 3 The majority of false-positive saccades went into the same direction as the required manual response (see Figure S3 in the supplemental material). An interesting issue is therefore how these “erroneous dual actions” compare to the “correct dual actions” initiated in dual-action trials, as this has repercussions on what behavior actually is prepotent in single-manual trials: executing a saccade, or executing an oculomotor-manual dual action. On the one hand, false-positive saccades might be initiated rather independently from the required manual response. This would be in line with the idea that the saccade per se is prepotent. On the other hand, false-positive saccades could be part of a “Gestalt-like” compound action that is erroneously executed in some of the single-manual trials (of course, a mixture of both could also be assumed). It is very difficult to finally decide between these alternatives based on the current data. However, a preliminary answer could be generated by comparing the correlations of saccade and manual RTs and the inter-response intervals (IRIs) between correct and erroneous dual actions. Specifically, correlations between saccade and manual RTs were, on average, much lower in erroneous dual actions compared to correct dual actions (see Table S3 in the supplementary material). Additionally, IRIs in correct dual actions were distributed in a unimodal manner with a rather small variability indicative of compound responding (cf. Huestegge & Koch, 2009). In contrast, IRIs of erroneous dual actions were much more variable (see Figure S4 in the supplementary material). These exploratory analyses should be treated with caution, of course, given the low and variable (across participants) number of single-manual trials involving an erroneous dual action. However, they tentatively indicate that the saccade per se (and not an oculomotor-manual compound action) was prepotent in single-manual trials. This prepotent saccade was then erroneously triggered by the requirement to produce a frequently associated manual response. Such a mechanism of erroneous co-activation of a currently unwarranted response modality is reminiscent of the concept of response-response priming (Moeller & Frings, 2019a, b; see also Hommel, 2020). In corresponding experiments, it has been shown that the repetition (vs. change) of a response can retrieve (and thus facilitate) a recently associated response (independent from a stimulus repetition or change), even across different effector systems (Moeller & Frings, 2019c).

Conclusion

To conclude, the results of the current study allow us to further specify the mechanisms underlying erroneous (saccade) co-execution in situations requiring only a single (manual) response. Generally, the results emphasize the robustness of dual-action benefits characterized by false-positive responses in single-action contexts. Dual actions were, overall, easier to execute than single (manual) actions due to costly inhibitory demands for the latter. Importantly, the frequency of false-positive saccades in single-manual trials did not meaningfully depend on the ease of the oculomotor S-R translation process, ruling out simple explanations in terms of extended time for inhibitory control, translation-inhibition conflicts, or reflexive false-positive (express) saccades. Instead, they appeared to be tightly linked to the execution of a secondary manual response (action-based co-execution) and to be determined largely at the level of effector system representations, not at the level of a fully specified response (in line with a hierarchical action specification account).

The authors have no competing interests to declare that are relevant to the content of this article.