The abilities to select responses to specific stimuli (i.e., response selection) and to override a prepotent tendency to respond to other stimuli (i.e., response inhibition) have been attributed to core executive action control processes (Norman & Shallice, 1986). These functions are critical in everyday life, because they allow people to flexibly adjust their behavior according to their goals. Indeed, deficits in these cognitive operations contribute to several psychopathological conditions, including obsessive–compulsive disorder (Penades et al., 2007), attention-deficit hyperactivity disorder (Nigg, 2001), and addiction and eating disorders (Crews & Boettiger, 2009; Houben, 2011; Noel, Brevers, & Bechara, 2013). However, despite the similarities between response selection and inhibition—that is, both require the selection or nonselection of a response—the extent to which these processes relate and interact with each other is still unknown.

Response selection is a central decision-making process that maps incoming perceptual information to goal-appropriate motor output (Pashler, 1984; Welford, 1952). One theoretical framework that explicitly illustrates the various steps involved in the decision-making process is sequential-sampling models. According to these models, simple decision-making involves the accumulation of information from the environment until a certain response criterion threshold (i.e., the amount of information needed to select a response) has been reached (Brown & Heathcote, 2008; Ratcliff & Smith, 2004; Smith & Ratcliff, 2004; Usher & McClelland, 2001). The option that reaches the response criterion threshold first is selected and then executed. Whereas the processes at the perceptual and motor stages are thought to operate in parallel, the central response selection stage has been proposed to be capacity-limited—only capable of acting on a single task at any given time. This sensory–motor translation stage is therefore often studied using paradigms such as the psychological refractory period (PRP; Pashler, 1984; Welford, 1952), dual-task (Dux, Ivanoff, Asplund, & Marois, 2006; Dux et al., 2009; Schumacher et al., 2001; Sigman & Dehaene, 2008), and high-response-load single-task tests, because these measures place strong demands on the response selection system. Although conditions differ across these paradigms, they all require participants to encode sensory information and make a decision, and in paradigms with two tasks, participants must make simple response selections for items that occur simultaneously or in close succession.

In contrast, response inhibition is thought to be a heterogeneous construct that can be parsed into at least two different forms: task-relevant and task-irrelevant inhibition (Harnishfeger, 1995; Nigg, 2000). Successful inhibition of distractor stimuli is often measured in tasks where task-irrelevant information needs to be ignored, such as the Eriksen flanker task (Eriksen & Schultz, 1979) and Stroop task (Stroop, 1935). Task-relevant response inhibition, however, is often measured in tasks such as the stop-signal (Lappin & Eriksen, 1966; Verbruggen & Logan, 2008) and go–no-go (Donders, 1969) paradigms, which require the inhibition of task-relevant, prepotent response tendencies on a subset of trials. According to the popular race model account (Logan & Cowan, 1984), successful inhibition in the stop-signal task relies on the outcome of a race between independent “go” and “stop” processes. Inhibitory control succeeds when the stop process finishes the race before the go process, whereas inhibition fails if the go process reaches the response threshold first.

The attentional blink (AB) is another dual-task paradigm worth noting, since it is thought to tap both response selection and inhibition operations. The AB is a perceptual limitation that is reflected in the inability to consciously perceive and report the second of two targets presented in close succession (Jolicœur & Dell’Acqua, 1998; Raymond, Shapiro, & Arnell, 1992). Although most explanations of the AB invoke a consolidation/response-selection bottleneck or cognitive-control mechanism(s) (Dux & Marois, 2009), research suggests that a suppressive mechanism that inhibits the deployment of attentional resources to irrelevant distractor stimuli may also play an important role in target selection (Dux & Harris, 2007; Dux & Marois, 2008; Raymond et al., 1992; Wyble, Bowman, & Nieuwenstein, 2009).

Traditionally, response selection and response inhibition processes have been investigated separately, but increasing evidence is suggesting that these two operations may draw on a common resource (Mostofsky & Simmonds, 2008; van Gaal, Ridderinkhof, Fahrenfort, Scholte, & Lamme, 2008; Verbruggen, McLaren, & Chambers, 2014). For example, neuroimaging and transcranial magnetic stimulation (TMS) studies have shown that mid-dorsolateral and ventrolateral prefrontal areas (Bunge, 2004; Duncan & Owen, 2000) and the presupplementary motor area (pre-SMA; Buch, Mars, Boorman, & Rushworth, 2010) are activated by tasks that require the selection and inhibition of responses. For instance, Verbruggen, Aron, Stevens, and Chambers (2010) demonstrated that TMS of the right inferior frontal gyrus (IFG) impaired performance on both a stop-signal task and a dual task, suggesting that this region supports both response selection and inhibition processes. Taken together, these findings indicate that response inhibition and response selection protocols might tap, at least in part, a common mechanism of action control.

Although response selection and response inhibition processes may partially overlap at the neural level, evidence also suggests that response selection and response inhibition tasks recruit distinct brain regions. Indeed, neuroimaging studies have observed greater activity in the left hemisphere posterior lateral prefrontal cortex (pLPFC) during dual-task as compared to single-task trials, suggesting a role for this area in response selection (Dux et al., 2006; Dux et al., 2009; Jiang & Kanwisher, 2003; Miller & Cohen, 2001). Similarly, Filmer and colleagues demonstrated that transcranial direct current stimulation of the left but not the right pLPFC disrupts response selection and the associated training effects for both single and dual tasks (Filmer, Mattingley, & Dux, 2013; Filmer, Mattingley, Marois, & Dux, 2013). In contrast, right IFG appears to be crucial for the successful inhibition of an inappropriate motor response (Aron, Fletcher, Bullmore, Sahakian, & Robbins, 2003), since it shows greater activity during successful as compared to failed inhibition trials (for a recent review, see Aron, Robbins, & Poldrack, 2014). Moreover, TMS of the right IFG has been found to temporarily increase stop-signal response times (SSRTs)—an index of poorer inhibitory control—but TMS of other cortical regions, including the left IFG, right middle frontal gyrus, and right angular gyrus had no impact on SSRTs (Chambers et al., 2007; Chambers et al., 2006). These studies suggest process-specific functional differences and converge with the idea that response selection and response inhibition produce distinct neural signatures; namely, the left pLPFC is involved in response selection, whereas the right IFG is involved in response inhibition. Although no study has directly assessed whether these two processes can be fully dissociated in the brain, the neural findings do at least suggest that these two processes could reflect distinct forms of action control.

The independence of response selection and response inhibition processes is further suggested by behavioral studies. Whereas dual-task contexts typically result in performance decrements when central response selection processes overlap in time, performance on various inhibitory control tasks indicates that the go process can occur independently from the stop process (activated by the appearance of the stop signal; Logan & Burkell, 1986; Verbruggen & Logan, 2009; but see Verbruggen & Logan, 2015). For example, a recent study by Yamaguchi, Logan, and Bissett (2012) directly examined stop-signal performance in a PRP task, in which participants were required to correctly inhibit one response while carrying out the other response in a dual-task setting. In line with the independence assumption, response inhibition performance was not influenced by dual-task interference, supporting the notion that response inhibition is distinct from response selection demands. In agreement with the aforementioned neural work, these findings further suggest that response selection and response inhibition operations reflect different elements of action control.

While the above-mentioned studies provide valuable insight into the cognitive and neural substrates of action control, they also highlight the lack of clarity about whether a general action control mechanism underpins performance in a wide range of response selection and inhibitory control paradigms, or whether the differences that we observe in these tasks reflect distinct underlying processes. The previous work has been unable to resolve this conflict because the studies have only compared single examples of response selection and inhibition tasks to test for possible overlap or dissociation between the two processes. A drawback of this approach is that it is impossible to infer the specific nature of the process (which is independent of the paradigm used) that leads to any observed overlap or dissociation. This problem can be overcome by employing a latent variable approach to better understand the relations among commonly used measures of response selection and response inhibition (Miyake et al., 2000). Confirmatory factor analysis (CFA) is a form of latent variable analysis that statistically extracts the common variance among tasks that are expected to tap the same putative function. By only analyzing what is common among the tasks, and excluding the systematic variance attributable to non-cognitive-control processes (e.g., sound and color processing), the “purer” latent variable (Miyake et al., 2000) can be extracted so as to examine how different cognitive-control functions relate to one another. Importantly, CFA allows the evaluation of different models on an a priori basis, utilizing knowledge from previous research about hypothesized task demands (i.e., tasks that are thought to share the same underlying construct should load on the same latent factor and are specified before the different models are run).

The application of CFA to exploring the relations among executive functions has been successfully demonstrated in a series of studies by Miyake and colleagues (Friedman & Miyake, 2004; Miyake et al., 2000). Specifically, Miyake et al. (2000) used a battery of tasks thought to tap three core cognitive-control functions—shifting, updating the contents of working memory, and inhibition of automatic responses. Although the CFA results revealed that the three cognitive operations shared some common overlap, a three-factor model provided the best fit to the obtained data, suggesting that all three functions represent distinct cognitive constructs that are qualified by shared variance between the executive-control factors. The diversity of cognitive control is further supported by Friedman and Miyake’s (2004) study that investigated the extent to which various measures of inhibitory control are related to one another and how these functions contribute to other cognitive-control-demanding tasks. The authors focused primarily on three potentially distinct inhibitory control functions: prepotent-response inhibition (the ability to override an automatic response), resistance to distractor interference (the ability to suppress irrelevant information), and resistance to proactive interference (the ability to suppress memory interference from previously relevant task information). CFA revealed that prepotent-response inhibition and resistance to distractor interference were closely related to each other, while resistance to proactive interference was unrelated to response and distractor inhibition.

Here, we employed a latent variable approach similar to that of Miyake et al. (2000) and Friedman and Miyake (2004), to understand the relations among response selection and response inhibition. This approach not only provides insights into how performance varies between individuals, but it can also identify the extent to which indicators of action control relate to each other. If response selection and response inhibition processes reflect two distinct aspects of action control, we would expect response selection tasks to load on one latent factor, and response inhibition tasks to load on a second factor. If, on the other hand, these two processes reflect “two sides of the same coin” (Mostofsky & Simmonds, 2008), we would expect each paradigm to load highly on a single action control factor.

Method

Participants

Eighty-seven participants (mean age = 20 years, range = 17–39 years; 69 females, 18 males) from the University of Queensland participated in this study for course credit. All participants had normal or corrected-to-normal vision. The University of Queensland Human Research Ethics Committee approved the study, and all participants gave informed, written consent. The target sample and participant exclusion criteria were determined before data collection and were based on a 5:1 ratio of sample size to the number of free parameters (Bentler & Chou, 1987). Participants were removed from the final analysis if they did not reach the behavioral cutoff for one or more tasks. In total, data from 21 participants were excluded from the final analysis, due to failure to attend a testing session (four participants) or because of poor performance (more than three standard deviations above the response time or accuracy mean) in one or several of the seven tasks (17 participants).

Apparatus

The experiment was run on an Apple Mac Mini computer and programmed in MATLAB (The MathWorks, Natick, MA) with the Psychophysics Toolbox extension (Brainard, 1997; Pelli, 1997). Stimuli were presented on a 21-in. CRT monitor (100-Hz refresh rate), which participants viewed from a distance of approximately 57 cm during all the tasks in the two sessions.

Procedure

Participants took part in two 2-h sessions that were administered seven days apart from each other. During each session, participants completed the following tasks, with the task order randomized across participants and sessions: stop-signal task (SST), go–no-go task, Stroop task, flanker task, single- vs. dual-response selection paradigm, PRP, and AB. For all seven measures, each trial began with a black fixation cross that was presented in the center of a gray screen (RGB 128 128 128) for a variable interval (200–600 ms). At the end of each block, the mean response time (RT) and accuracy were displayed (with the exception of the AB task, since participants are usually very poor at identifying the second target). Following the tasks in the first session, participants provided demographic information about their age, gender, handedness, years of education, history of neuropsychiatric illness, and current neuropsychiatric medication.

Tasks

Response inhibition tasks

Stop-signal task

Participants completed go-signal and stop-signal trials. On go-signal trials, participants were required to discriminate between two different 2-D abstract shapes (gray with a black outline; see Fig. 1) by pressing the relevant response key (“F” or “J”) as quickly and as accurately as possible. On stop trials (25 % of trials), the primary go stimulus was followed by an auditory stop signal (750-Hz sine-wave tone, 200-ms duration), which instructed participants to inhibit their response. The stop-signal delay (SSD; i.e., the time between the onsets of the go stimulus and stop signal) was initially set at 250 ms and continuously adjusted with an adaptive staircase procedure to obtain a stopping probability of 50 %. If participants failed to inhibit their response in a stop-signal trial, the SSD decreased by 50 ms, but if they succeeded, the SSD increased by 50 ms. The fixation cross was replaced by the go stimulus, which remained on screen for 200 ms. Participants completed one practice block of 24 trials and four test blocks of 36 trials. The dependent variable was the stop-signal RT (SSRT), calculated by subtracting each participant’s mean SSD from the mean go RT (Verbruggen & Logan, 2009). Better inhibitory performance was indicated by lower SSRTs.

Fig. 1
figure 1

Schematic representations of the paradigms and task performance on the seven measures of interest across Sessions 1 and 2. Task performance: (A) Mean go response times (Go-RT), the mean stop-signal delay (SSD), and the mean stop-signal RT (SSRT) in the stop-signal task. (B) Task performance in the go–no-go task: Mean go-RTs and commission errors (failed no-go trials). (C and D) Mean RTs for the congruent, incongruent, and neutral conditions in the Stroop and flanker tasks, respectively. (E) Mean RTs for the single and dual visual stimulus conditions and the single and dual auditory conditions in the single- vs. dual-response selection task. (F) Mean RTs as a function of stimulus onset asynchrony (SOA) for Target 1 (T1) and Target 2 (T2) in the psychological refractory period paradigm. (G) Mean accuracy as a function of lag for Target 2 accuracy, given that Target 1 is correct (T1 | T2), and T1 accuracy in the attentional blink task. The error bars represent the standard errors of the means (SEMs)

Go–no-go task

On each trial of this task, participants were instructed to make a speeded response to the go stimulus (a white, abstract 3-D shape; see Fig. 1 for an example) by pressing the “G” key, but to withhold from responding if the no-go stimulus appeared (another white, abstract 3-D shape; 25 % of trials). The go and no-go stimuli were counterbalanced across participants. On each trial, a fixation cross appeared (200 ms), followed by one of the two target stimuli and an 1,800-ms response window. Participants completed a block of 24 practice trials and four test blocks of 36 trials. The dependent variable of interest was the number of commission errors on no-go trials (i.e., failure to inhibit a response). Better inhibitory control was indicated by fewer commission errors.

Stroop task

In this task, participants had to report the presentation color (red, RGB 237 32 36; green, RGB 10 130 65; yellow, RGB 250 250 0; or blue, RGB 44 71 151) of both color (“red,” “green,” “yellow,” and “blue”) and noncolor (“cup,” “fork,” “spoon,” “saucer”) words as quickly and as accurately as possible via keypress. At the beginning of each trial, the fixation cross was replaced by a word target for 500 ms. The word target was equally likely to be congruent (i.e., the printed color word matched its color on screen; e.g., “blue” printed in blue), incongruent (e.g., “blue” printed in red), or neutral (i.e., a noncolor word printed in any of the four colors; e.g., “cup” printed in green). Each of the four possible presentation colors was mapped onto a corresponding response key, and participants completed a practice block of 24 trials to familiarize themselves with the stimulus–response mappings. The experimental phase consisted of four blocks of 36 trials, and the order of trial types was randomized. The dependent variable was the “Stroop congruency effect,” which was calculated as the difference in RTs between congruent and incongruent trials. A smaller congruency effect reflected better performance.

Flanker task

Participants had to respond to the direction of a central arrow target (> or <) as quickly and as accurately as possible, by pressing the “<” key for leftward-pointing arrows and the “>” key for rightward-pointing arrows. On congruent trials, the target was flanked by two arrows on each side that pointed in the same direction as the target arrow (e.g., >>>>>). On incongruent trials, the target was flanked by two arrows on each side that pointed in the opposite direction from the target arrow (e.g., >><>>). On neutral trials, the flankers were two horizontal lines that appeared on each side (e.g., -- < --). The fixation cross was replaced by the concurrent onset of the target and flankers, which remained on the screen for 200 ms. Participants completed 24 practice trials followed by four blocks of 36 test trials. There were equal numbers of trials per condition, and the order of trial types was randomized. The dependent variable was the “flanker congruency effect,” which was calculated as the difference in RTs between the congruent and incongruent trials. Lower RT difference scores represented better performance.

Response selection tasks

Single- vs. dual-response selection task

Participants first practiced on three blocks of the two different, two-alternative forced choice (AFC) RT tasks. Block 1 was a visual task in which participants discriminated between two different colored circles (red, RGB 237 32 36, and blue, RGB 44 71 151; 12 trials), and Block 2 was an auditory discrimination task (two complex tones, previously used in Filmer, Mattingley, Marois, & Dux, 2013). Each stimulus was mapped to a specific response key and hand (“A” or “S” for left-hand responses, and “K” or “L” for right-hand responses), with the mapping of hand to task counterbalanced across participants. After learning the specific stimulus–response mappings of each task, participants completed a third practice block in which one visual and one auditory stimulus were presented simultaneously on each trial. The experimental phase included four blocks of 36 trials, with each trial type randomly intermixed within blocks. In each test trial, the central fixation cross was followed by a visual stimulus only, an auditory stimulus only, or the simultaneous presentation of both for 200 ms. Participants were instructed to respond as quickly and as accurately as possible via keypress to the stimulus or stimuli. Of key interest was the RT to the single-task trials (averaged across the two tasks), as a pure measure of response selection, and the difference in RTs when the tasks were performed with and without a concurrent task, to measure dual-task executive function. Lower RTs/difference scores indicated better performance.

Psychological refractory period

Participants first trained on three blocks of 20 trials of the two different four-AFC tasks. Each task was assigned a response hand and a specific key for each stimulus (“A,” “S,” “D,” or “F” for left-hand responses, and “J,” “K,” “L,” or “;” for right-hand responses), with the mapping of hand to task counterbalanced across participants. In Block 1, participants trained on the visual task (Task 1), which required a manual keypress response to one of four different symbols (%, &, @, or #). Block 2 was an auditory task (Task 2) that required participants to make a manual keypress response to four different complex tones (previously employed in Filmer, Mattingley, Marois, & Dux, 2013, but different from the tones used in the single- vs. dual-response selection task here). After learning the specific stimulus–response mappings of each task, participants completed a third practice block that was identical to the trials used in the experimental phase. Here, participants were presented with the visual Target 1 (T1) stimulus, which was presented centrally for a variable interval between 200 and 600 ms (randomly selected at the beginning of each trial).Footnote 1 After either a long (1,000 ms) or short (200 ms) stimulus onset asynchrony (SOA), the second, auditory target (T2) was presented for 200 ms. Participants were instructed to respond as quickly and as accurately as possible to the two tasks. The experimental phase consisted of four blocks, each containing 40 trials, with the two SOA conditions being presented equally often across blocks. The dependent variable was the PRP effect, which was calculated by subtracting the T2 RT in the 1,000-ms condition from that in the 200-ms condition (Van Selst, Ruthruff, & Johnston, 1999). Lower differences scores indicated better performance.

Attentional blink

Here, on each trial, participants were presented with a rapid serial visual presentation (RSVP) stream containing two targets (black letters of the alphabet, excluding I, L, O, Q, U, V, and X) and eight distractors (black digits ranging from 2 to 9). The participants’ task was to report the identity of both targets at the end of the stream. There were no time constraints for responding, and participants were encouraged to guess if they were unsure. The fixation cross was replaced by the first stimulus in the RSVP stream. Each item was presented for 100 ms. T1 always appeared at Serial Position 3, and T2 appeared either 200 ms (lag 2), 300 ms (lag 3), 500 ms (lag 5), or 700 ms (lag 7) after T1. Participants completed 24 practice trials and four test blocks of 24 trials. The dependent variable was AB magnitude, which was calculated by subtracting the average T2 | T1 accuracy of lags 2 and 3 from lags 5 and 7 (Kelly & Dux, 2011). A smaller AB magnitude indicated better performance.

Results

Outlier screening was performed for each participant, in each session and for each task separately. Trials greater than three standard deviations above the mean were removed.

Test–retest reliability

To assess the stability of performance for each measure, we first calculated the test–retest reliability between Sessions 1 and 2 for each paradigm. As can be seen in Fig. 2, all seven paradigms were shown to have moderate to strong test–retest reliability (r range = .42–.69), demonstrating that individual task performance remained relatively stable across the seven days. It was therefore appropriate to use these measures in intercorrelation analyses with the other tasks.

Fig. 2
figure 2

Scatterplots of the relationship between an individual’s (A) stop-signal response time (SSRT), (B) go–no-go commission errors, (C) Stroop congruency effects (Stroop Cong Effect), (D) flanker response time (RT) effect, (E) single-task RTs, (F) psychological refractory period magnitude (PRP Magnitude), and (G) attentional blink magnitudes (AB Magnitude) across Sessions 1 and 2

Relationships among the tasks

To assess whether the data contained clusters of interrelated variables that were in line with our a priori predictions, we first explored the relationships between the different tasks. The measures of interest (SST, SSRT; go–no-go, commission errors [%]; Stroop, congruency effect; flanker, congruency effectFootnote 2; dual- vs. single-response selection task, dual-task cost and single-task response selection RT; PRP, PRP magnitude; AB, AB magnitude) were first collapsed across sessions for each paradigm, to obtain an overall score. An attenuation correctionFootnote 3 (Hunter & Schmidt, 1990) was applied to each correlation (r c) to control for differences in reliability due to measurement error (see Table 1 and Supplemental Fig. 1). Out of the response selection measures, PRP correlated positively with other measures of response selection, such as the single-response selection (r c = .53, p < .01) task, and only marginally with the AB (r c = .36, p = .06) measure. Interestingly, the Stroop task, often employed to measure inhibition-related processes, did not significantly correlate with any of the inhibitory control tasks, but instead correlated significantly positively with the PRP (r c = .57, p < .01). The PRP also correlated significantly positively with the SST (r c = .38, p < .05) response inhibition measure. In addition, the SST significantly positively correlated with the go–no-go (r c = .88, p < .01) paradigm. Of interest, the flanker paradigm did not correlate with any of the other measures, and was therefore removed from further analysis.

Table 1 Attenuated Pearson r correlations (Hunter & Schmidt, 1990) between the stop-signal (SST), go–no-go (No-Go), Stroop, flanker, dual vs. single response selection (Single-Task), psychological refractory period (PRP), and attentional blink (AB) tasks

Confirmatory factor analysis

To examine how response inhibition and response selection processes are related, we performed a series of CFAs. Using the well-established tasks that are known to differ in their response selection and inhibition requirements, we first empirically compared a set of different models (see Table 2) and selected the best-fitting model via multiple appropriate fit indices. The fit of each model was evaluated with the chi-square statistic, the standardized root-mean-square residual (SRMR), and Akaike’s information criterion (AIC). The chi-square statistic (nonsignificant values indicating a satisfactory fit) and AIC (lower AIC values indicate better fit) both measure the fit between the observed and predicted covariances. The SRMR represents the square root of the average covariance residuals between the observed and predicted models, with SRMR values less than .08 indicating an acceptable fit, and values less .05 indicating a good fit (Hu & Bentler, 1998). In addition, we examined the models with Bentler’s comparative fit index (CFI), which compares each model to an independent baseline model (i.e., a model in which all covariances are set to zero). Here, values between .95 and 1.00 are indicative of a good model fit (Hu & Bentler, 1998).

Table 2 Fit statistics for the confirmatory factor analysis models

To examine whether one model significantly improved the model fit compared to another, chi-square difference tests were performed on nested, more restrictive models. For these calculations, the chi-square value (as well as the degrees of freedom) of the full model is subtracted from that of the smaller, more restricted model. If the chi-square difference is nonsignificant, then the more restrictive model represents a significantly better fit. The factor models (see Fig. 3) include the standardized factor loadings (standardized regression coefficients), the error variance for each task (including measurement error and idiosyncratic task requirements), and the correlations between the latent variables.

Fig. 3
figure 3

Best-fitting model (Model 1) of the action control functions. Standardized factor loadings are given next to the straight arrows leading from the latent factors to the indicators. The standardized residual variance for each task is listed beside the error term circle. The number between the two latent factors is the correlation between them. Stop-Signal = stop-signal task; Go-Nogo = go–no-go task; PRP = psychological refractory period task; Stroop = Stroop task; Single-Task = single vs. dual-response selection task; AB = attentional blink task

First, we constructed a two-factor model of response selection and response inhibition by selecting the key performance measure for each task as an indicator (e.g., AB represents the AB magnitude). Given that the Stroop task only correlated significantly with response selection measures and not, as anticipated, with any inhibitory control measures, performance on the PRP, Stroop, single-task response selection, and AB served as indicators of the response selection factor. We specifically chose single-task response selection performance instead of dual-task cost as a response selection indicator in order to ensure that the latent response selection variable was not merely driven by dual-task measures (i.e., multitasking). Please note, however, that using the dual-task cost as a response selection indicator still resulted in an acceptable fit (see Table 2, Model 4). The SST and go–no-go task served as indicators of the Response Inhibition factor. The fit between the hypothesized model and the data was acceptable, with a nonsignificant chi-square value [χ 2(8) = 4.32, p = .83; SMRI = .057, CFI = 1.00, and an AIC value of 1,103.92). As is shown in Fig. 3, the correlation between response selection and response inhibition was relatively weak (r = .26).

However, because the coefficient predicting performance on the AB from the Response Selection factor was relatively small, we performed post-hoc model modifications to see whether it was possible to develop a better-fitting model. In our second model, we dropped the AB variable and reestimated the model [χ 2(4) = 3.33, p = .5; SMRI = .06, CFI = 1.00, and an AIC value of 915.68]. Both the chi-square and the AIC indicated a slightly better-fitting model (see Table 2) after performance, but dropping the AB as an indicator did not significantly improve the model fit [∆χ 2(4) = 0.99, p = .91]. Given the nonsignificant improvement in overall fit, the full two-factor model (Model 1) was considered the more parsimonious model.

Next, to test whether response selection and response inhibition processes are reflective of a unitary mechanism, we constructed a general action control one-factor model that collapsed the response selection and response inhibition variables into a single general action control latent variable. As is shown in Table 2, the fit of the one-factor model (Model 3) was poor [χ 2(9) = 15.75, p = .07; SMRI = 0.106, CFI = .77, and an AIC value of 1,113.35]. Moreover, it provided a significantly worse fit than the full two-factor model (Model 1), ∆χ 2(1) = 11.43, p < .001. Therefore, the full two-factor model, in which tasks were parsed in terms of response selection and inhibition requirements (Model 1), was supported over a single, General Action Control factor model.

Discussion

We used an individual differences approach to examine the underlying relationship between two cognitive-control functions—the selection of appropriate responses (i.e., response selection) and the inhibition of automatic and inappropriate responses (i.e., response inhibition)—at the latent variable level. A series of models were tested via CFA to assess the relations between six behavioral tasks that have been hypothesized to tap response selection and inhibition processes. The results from the CFA showed that the full two-factor model (Model 1), in which tasks were assigned in terms of response selection and response inhibition requirements, provided a significantly better fit to the observed data than did the simplest model—a single General Action Control factor. Specifically, we found that variance on the Response Inhibition factor contributed to performance on the stop-signal and go–no-go tasks, whereas variance on the Response Selection factor played an important role in the PRP, Stroop, and single-task paradigms, and to a smaller degree in the AB.

The present findings suggest that response selection and response inhibition processes are separable, nonunitary processes, since these two factors were weakly related (covariance between the two factors = .26). Our findings challenge recent work that has suggested that a common mechanism accounts for both the selection and inhibition of actions (Mostofsky & Simmonds, 2008; Verbruggen et al., 2014). Instead, the dissociation we found reinforces previous CFA studies that have supported the diversity of other executive functions, as well as some overlap (Fisk & Sharp, 2004; Friedman & Miyake, 2004; Miyake et al., 2000). Moreover, the present work fits well with behavioral evidence from Yamaguchi et al. (2012), who found that the decision to stop an inappropriate response is not dependent on the processing capacity at the central response-selection stage. In addition, our results are consistent with neuroimaging and neurostimulation findings that have posited that distinct neural substrates are involved in response selection (Dux et al., 2006; Filmer, Mattingley, Marois, & Dux, 2013) and response inhibition processes (Aron et al., 2014).

Interestingly, we did not find that performance on the flanker task correlated significantly with the Stroop task or any of the other performance measures. Although there is evidence of commonly observed activations in the anterior cingulate cortex and prefrontal cortex when dealing with response conflict in the flanker and Stroop tasks (Fan, Flombaum, McCandliss, Thomas, & Posner, 2003), studies adopting an individual differences paradigm have frequently shown that correlations vary between the two tasks and other cognitive measures such as intelligence (Jensen & Rohwer, 1966; Spilsbury, 1992; Stins, Polderman, Boomsma, & de Geus, 2005). Moreover, factorially combined Stroop and Simon tasks and flanker and Simon tasks have shown that the conflict-elicited control mechanisms for the two tasks seem to operate in a conflict-specific (i.e., context-specific conflict triggers particular cognitive-control processes), rather than a conflict-general manner that would act on any type of task conflict (for a review, see Egner, 2008). These findings therefore suggest that these types of conflict tasks may have different cognitive-control functions that are mediated by independent cognitive-control loops.

The finding that performance on the Stroop task is more dependent on the constraints imposed by response selection, rather than the ability to inhibit distracting automatic responses, contrasts with existing studies that have utilized CFA methods. In particular, Miyake and colleagues found that the Stroop task significantly loaded on the prepotent Response Inhibition factor (Friedman & Miyake, 2004; Miyake et al., 2000). Instead, our finding more closely resembles previous research that has found that incongruent trials (e.g., the word “blue” presented in red) create interference at the response selection level (Cohen, Dunbar, & McClelland, 1990; MacLeod, 1991; MacLeod & MacDonald, 2000; Stafford & Gurney, 2007), such that pressing a key associated with a specific color (e.g., red) competes with the color word (e.g., the word “blue”) as a potential alternative response. Thus, despite the fact that the Stroop task is typically considered to reflect response inhibition, the combination of these prior findings and ours instead implies that the Stroop paradigm actually taps response selection processes similar to those elicited by the PRP and single-response selection task paradigms.

It must be noted that the AB only weakly loaded on the Response Selection factor. Because this protocol was the only paradigm out of the six submitted to CFA that contained distractor items, it may be possible that this paradigm predominantly measures the suppression of distracting and irrelevant stimuli through controlled attention, which may be separate from the controlled inhibition of prepotent motor responses that is required in the stop-signal and go–no-go tasks (Friedman & Miyake, 2004; Nigg, 2000). The weak correlation between the AB, PRP, and single-response selection tasks also suggests that performance in the AB may predominantly rely on another mechanism dissociated from response selection. A similar finding has been noted in two recent studies by Garner and colleagues (Garner, Matthews, Remington, & Dux, 2015; Garner, Tombu, & Dux, 2014), who investigated whether response selection and sensory consolidation rely on the same capacity-limited central mechanism (Jolicœur & Dell’Acqua, 1998). For example, in the Garner et al. (2014) study, participants were randomly assigned to a relevant training group (a speeded forced choice sensory–motor task matching T1 for an AB or PRP task), an irrelevant training group (i.e., a speeded forced choice sensory–motor task not matching T1 for an AB or PRP task), or a control group (no training). The authors showed that although only relevant training attenuated the PRP effect, both relevant and irrelevant training reduced the AB, suggesting that AB performance is at least partly driven by a different underlying mechanism (but see Jolicœur, 1998; Tombu et al., 2011, for evidence of overlap).

While we argue for a response selection–inhibition account of the observed two-factor solution, one alternate interpretation is that these components could reflect alternative cognitive processes, such as attentional monitoring or working memory. Specifically, performance in response inhibition tasks is often thought to rely heavily on attentional monitoring when one is preparing to respond to infrequent stimuli or conflict (Duann, Ide, Luo, & Li, 2009; Erika-Florence, Leech, & Hampshire, 2014; Hampshire, Chamberlain, Monti, Duncan, & Owen, 2010), whereas performance in response selection paradigms may be dependent on working memory maintenance processes that support reasoning and rule-processing demands (Bunge, Hazeltine, Scanlon, Rosen, & Gabrieli, 2002; Jiang & Kanwisher, 2003; Miller & Cohen, 2001). If our two-factor solution did instead reflect attentional monitoring and working memory processes, then one would predict that the conflict monitoring generated by an incongruent flanker stimuli (the flanker task) and the working memory requirements needed for the sensory consolidation of Targets 1 and 2 (the AB task) should also correlate significantly with one of these two factors. Because the flanker paradigm was omitted from our CFA analysis due to its very weak zero-order correlations with the remaining six paradigms, and because the AB only weakly loaded on the Response Selection factor, our findings instead favor the response selection–inhibition account.

In summary, the present findings support the hypothesis that response selection and response inhibition tap two distinct mechanisms of action control. These results reconcile the previous conflicting empirical work into the nature of response selection and inhibition processes by using an individual differences latent variable approach and a range of cognitive tasks to identify the common sources of variance associated with each process. The results have important implications for how we should conceptualize action control and how one might tailor interventions designed to overcome the limitations associated with each domain. In other words, our findings illustrate the value of using multiple response selection and response inhibition paradigms when investigating these two cognitive operations, and they further suggest that interventions need to be specialized for each aspect of action control.