A central issue in language production concerns the selection of words from the mental lexicon: lexical selection. There is consensus that a spreading activation mechanism at the semantic level will lead to the activation of semantically related lexical representations (e.g., Levelt, Roelofs, & Meyer, 1999), but the precise decision mechanism needed to select the correct representation is still under debate. Research on the dynamics of lexical selection often employs the picture-word interference (PWI) paradigm (for an early review, see MacLeod, 1991). In this paradigm, the participant is presented with a picture with a superimposed word (i.e., distractor). The task is to name the picture whilst ignoring the distractor. Within this paradigm, it is possible to vary the relationship between the picture and distractor (e.g., a semantic relationship such as CAT-dog, e.g., Schriefers, Meyer, & Levelt, 1990) and the properties of the picture and word individually (e.g., Miozzo & Caramazza, 2003).
One effect that has recently attracted attention is the distractor frequency effect. In a PWI task, Miozzo and Caramazza (2003, see also Burt, 2002; Catling, Dent, Johnston, & Balding, 2010; Dhooge & Hartsuiker, 2010) presented pictures with low-frequency or high-frequency distractors. Pictures presented with a high-frequency distractor were named faster than pictures presented with a low-frequency distractor. Currently, there is debate regarding the exact origin of the distractor frequency effect. Two models can account for the distractor frequency effect: WEAVER ++ which assumes an early locus (e.g., Roelofs, 2003, 2005) and the response-exclusion hypothesis (e.g., Mahon, Costa, Peterson, Vargas, & Caramazza, 2007), which places the effect at a late, post-lexical level. The goal of this article is to contrast these two accounts.
In WEAVER++ (e.g., Roelofs, 2003, 2005), information is retrieved from a network by means of spreading activation through a number of stages, namely conceptual identification, lemma retrieval, word form encoding, and articulatory processing. Importantly, WEAVER++ assumes input filtering based on attentional processes. When naming a picture with a superimposed distractor, processing of the picture is favored over processing the distractor by ‘blocking’ the latter out of the speech production system. This is done by an attentional modulation of the system when it detects both the picture and the distractor. Thus, distractor blocking is seen as an early process that is needed to distinguish between correct and incorrect input information. Furthermore, the speed with which a distractor can be ‘blocked’ depends on the speed of the availability of the distractor information. As high-frequency words are perceptually available earlier than low-frequency words (e.g., McClelland & Rumelhart, 1981) their information can be blocked earlier on, making the discrimination between relevant and irrelevant information easier. Consequently, naming latencies will be faster for pictures accompanied by high-frequency distractors. Indeed, computer simulations reported in Roelofs (2005) showed that WEAVER++ can simulate the distractor frequency effect.
In contrast, the response-exclusion hypothesis (e.g., Finkbeiner & Caramazza, 2006; Mahon et al., 2007) places the effect at a later level. The basic premise is that some of the effects found in the PWI paradigm are task-specific, not necessarily reflecting lexical selection. The response-exclusion hypothesis makes two assumptions. The first assumption is based on the observation that in spoken language production, only one response at a time can be produced. This leads to the conclusion that word production must involve a single-channel output buffer. The second assumption stated that in a PWI task, participants will unavoidably formulate a covert verbal response to the distractor. This response is available before the response to the picture, because words have a privileged relationship to the articulators (e.g., Roelofs, 2003; cf., McLeod & Posner, 1984; Glaser & Glaser, 1989). This assumption is supported by the absence of semantic interference when both target and distractor are pictures or when both are words (e.g., Glaser & Glaser, 1989). In the first case, no response to the distractor is formed, while the second observation can be explained by assuming that the distractor word will be overwritten upon presentation of the target word. Thus, on this account, distractor words will always elicit a covert verbal response. Because of the privileged relationship of words with the articulators, any response that is already in the buffer upon the presentation of the distractor will be overwritten. Therefore, in the PWI paradigm, the response to the distractor needs to be purged from this buffer before a response to the picture can be given.
Because of this need to purge the distractor, the latency of the response to the picture depends on two factors: how fast the response to the distractor enters the buffer, and how fast it is removed. The distractor frequency effect is accounted for by the first factor. Responses to low-frequency words are assumed to be available later due to, for example, the higher resting activation of high-frequency words (e.g., Dell, 1986). As a result, the response to low-frequency distractors is also excluded later and will interfere more, explaining the distractor frequency effect. Therefore, the effect should vary according to whether or not the response to the distractor is formed and/or is in the response buffer. If no response enters the buffer, there will be no operations involving the response buffer. Consequentially, naming latencies should reflect genuine lexical selection processes. Put differently, the distractor frequency effect has a post-lexical locus.
Summarizing, both models assume a form of distractor blocking. In WEAVER++, this is implemented as early attentional modulation designed to allow a differentiation between relevant and irrelevant information. In the response exclusion hypothesis, distractors are removed from a response buffer in order to make room for the response to the picture.
Some studies provided some tentative evidence for a late locus of the distractor frequency effect. For example, Miozzo and Caramazza (2003, experiment 4) presented pictures with two types of distractors: those that were and those that were not previously read aloud. They reasoned that if the effect of high-frequency words is attributable to the fact that they were repeatedly selected for production, distractor words that were read aloud before should interfere less. In line with the hypothesis, these distractors indeed led to shorter picture naming latencies. However, this result can also be explained by WEAVER++, by assuming that repeated production leads to faster blocking. Dhooge and Hartsuiker (2010) assumed that, if the distractor frequency effect is indeed due to low-frequency distractors being excluded slower than high-frequency distractors, the effect should disappear when no response is formed. They presented pictures with their distractors either visible or masked (cf. Finkbeiner & Caramazza, 2006). The reasoning is that masking should prevent the formulation of a response to the distractor in the buffer (cf. Finkbeiner & Caramazza, 2008), eliminating the need for late, response-exclusion processes. There was a significant distractor frequency effect under visible conditions, but not under masked conditions. Under masked conditions, the presence of semantic facilitation showed that the absence of the distractor frequency effect was not due to not processing the distractor. Furthermore, it indicated that distractors in the masked condition were processed at least up to the level of semantics. The absence of the distractor frequency effect suggests that the effect has its locus at a later level. Dhooge and Hartsuiker (2010) also tested the hypothesis in a different way. They presented low- or high-frequency distractors at various time points before the picture. According to the response exclusion hypothesis, the more time there is between presentation of distractor and picture, the smaller the distractor frequency effect should be. In line with predictions, the distractor frequency effect was only significant when picture and distractor were presented together and when the distractor was presented 100 ms before the picture. When the distractor was presented 200 ms or 300 ms before the picture the effect disappeared. However, WEAVER++ could also account for these data when it is assumed that 100 ms is not enough to block the distractor out to a sufficient extent.
Another and more direct way to distinguish between the early and late locus is to see whether the distractor frequency effect is present when responses are delayed and lexical selection has already finished. To this purpose, we employed a paradigm introduced by Janssen Schirm, Mahon, and Caramazza (2008), in which participants see pictures with superimposed distractors in an immediate and delayed naming condition. In the immediate naming condition, picture and distractor are presented simultaneously. In the delayed naming condition, the picture is presented 1000 ms before the distractor. Depending on the color of the distractor, either the distractor or the picture has to be named. The crucial condition is the delayed naming condition. In this condition, participants are asked to always prepare the picture name, despite the fact that they sometimes have to name the distractor. This ensures that upon distractor presentation, lexical selection of the picture name has already been completed, which predicts no effect of picture frequency. According to the response exclusion hypothesis, presenting the distractor will always lead to the formulation of a response to the distractor. This response will enter the response buffer, overwriting the response to the picture that is presumably located in the buffer. If the task is then to name the picture, the response to the distractor needs to be purged from the buffer, leading to the distractor frequency effect. However, according to WEAVER++, the response is already available and ready for production. As it is clear which response is correct, there is no need for selection anymore, and consequently, for distractor blocking. Thus, there should not be a distractor frequency effect. The immediate naming condition is the standard condition. In this condition, participants cannot have retrieved the picture name upon presentation of the distractor as they are presented simultaneously. Therefore, both WEAVER++ and the response exclusion hypothesis predict an effect of both picture frequency and distractor frequency.