The question of how morphologically complex words are processed during word recognition has prompted many years of research and is still a hotly debated subject. One account that has become increasingly influential during the past decade is that skilled readers rapidly decompose morphologically structured letter strings (e.g., hunter) into their morphemic subunits (e.g., hunt + er) during reading. Importantly, evidence for the morphological segmentation of letter strings has been found in “pseudo-suffixed” words that have the appearance of morphological complexity (e.g., corn + er) but do not share the same meaning as the embedded pseudo-stem (e.g., corn vs. corner). Evidence for this account stems primarily from masked priming studies showing that the recognition of a target word (as assessed by a visual lexical decisions task) is facilitated by the prior presentation of a related suffixed (e.g., hunter-HUNT) or pseudo-suffixed prime (e.g., corner-CORN), but not by a non-suffixed prime (e.g., cashew-CASH). This finding was initially reported by Longtin, Segui, and Hallé (2003) in French and by Rastle, Davis, and New (2004) in English-speaking participants. Since these initial studies, the basic findings have been replicated across a number of different Indo-European languages, such as Dutch (Diependaele, Sandra, & Grainger, 2005; Diependaele, Sandra, & Grainger, 2009) and Russian (Kazanina, 2011; Kazanina, Dukova-Zheleva, Geber, Kharlamov, & Tonciulescu, 2008) and have had a major influence on theorizing in the field (for a review, see Rastle & Davis, 2008).

However, a major criticism to the morpho-orthographic processing literature was put forward by Baayen, Milin, Filipovic Durdevic, Hendrix, and Marelli (2011), who drew attention to one potential significant flaw in previous morphological priming studies. They pointed out that prime words previously tested in the pseudo-morphological priming condition varied in the extent to which the suffix conveyed its regular meaning. For example, as Baayen and colleagues (2011) pointed out, the etymological origin of archer (“someone who wields a bow”) is Latin arcus (bow, arc), which is similar in structure to a denominal formation, such as trucker (“someone who drives a truck”). More precisely, Baayen et al. (2011) suggested that priming in the pseudo-suffixed condition has previously only been observed, “because for the majority of opaque items, the suffix is fully functional in the meaning of the complex word (p. 466).” Thus the pseudo-suffixed word sniper contains the pseudo-suffix er, which carries the typical meaning of this suffix as “someone who does something.” When applied to suffixes with syntactic function, such as the suffix “y” (adjectival marker) in pseudo-suffixed words like “crafty,” then this rule indeed disqualifies many of the items tested by Rastle et al. (2004).Footnote 1

The goal of the present study was to put to test the important claims of Baayen et al. (2011) using Rastle et al.’s (2004) experimental design with more tightly controlled materials. Following Rastle et al., we carried out a masked priming lexical decision experiment comparing priming effects on stem target recognition for truly suffixed (e.g., hunter-HUNT) and pseudo-suffixed prime–target pairs (e.g., corner-CORN) relative to a non-suffixed control (e.g., cashew–CASH). However, in contrast to the pseudo-suffixed words used by Rastle and colleagues, we used a set of revised and more stringent stimulus selection criteria in this condition to ensure that the whole word and the stem never shared any meaning, not even remotely (e.g., butcher was excluded) or archaically (e.g., archer was excluded; see Materials section for more details) and that the pseudo-suffix did not carry its meaning or function as a real suffix. If Baayen and colleagues are correct that priming in the pseudo-suffixed condition has been previously observed only because for the majority of items the suffix was fully functional in the meaning of the complex word, then we expect to observe priming in the truly suffixed condition, but not in the pseudo-suffixed or non-suffixed conditions. If however, a morpho-orthographic segmentation mechanism indeed exists that blindly decomposes any word that has the mere appearance of morphological complexity, we should replicate the standard pattern of morpho-orthographic priming effects. That is, we should find comparable priming in both the truly suffixed and pseudo-suffixed condition and no priming in the non-suffixed control condition.

Method

Participants

Forty students from Macquarie University, all English native speakers, participated for course credit.

Materials

In the first instance, a list of pseudo-suffixed words (e.g., corner) was selected from the CELEX lexical database (Baayen, Piepenbrock, & van Rijn, 1993), using a number of strict selection criteria. Pseudo-suffixed words were selected such that the whole word and the stem never shared any remote meaning (e.g., butcher was excluded) or archaic meaning (e.g., archer was excluded). We excluded words where the suffix conveyed some degree of meaning (e.g., gaffer was excluded, because although gaffer and gaff are not semantically related, a gaffer is “someone who does something”). We excluded words where suffixes conveyed a syntactic content (e.g., crafty was excluded, because adding the derivational suffix y turns a word into an adjective). Moreover, we ensured that the target was always a noun, verb or adjective (e.g., offer-off and wither-with were excluded). Primes and targets that expressed colloquial, archaic or technical terms were excluded (e.g., laden- lad , bastion -bast, lander -land, and liter -lit were excluded). We also excluded low frequency primes and targets (e.g., scullion), inflected targets (e.g., sold), words containing low frequency suffixes (e.g., et as in helmet), words containing ambiguous suffixes (e.g., illion vs. ion in billion), and words containing ambiguous stems (e.g., note vs. not in notion). Finally, primes were never the orthographic or deletion neighbour of a word that was morphologically related to the stem (e.g. career was excluded, because carer is related to the stem care). The application of these criteria resulted in the selection of 22 pseudo-suffixed words, including the suffixes –en, −er, and –ion, which are listed in Table 3. Each pseudo-suffixed prime (e.g., corner) was then paired with a target, which was the word embedded in the whole letter string (e.g., corner-CORN).

In addition to the list of pseudo-suffixed items, we selected a list of 22 truly suffixed and 22 non-suffixed items. The truly suffixed items were chosen such that the meaning of the prime could always be derived from the meaning of its stem-target (e.g., hunter-HUNT). Moreover, since the suffix er can be used in inflectionally (e.g., darker) and derivationally (e.g., speaker), only derivational instances of er were chosen. In the non-suffixed control condition, primes and targets were selected such that the relationship was purely orthographic (e.g., cashew-CASH). The stimuli-matching criteria were adapted from Rastle et al. (2004). All three conditions were matched on orthographic overlap (calculated by dividing prime length by target length), target neighbourhood size, target length, target frequency, prime length, and prime frequency (see Table 1 for characteristics). Word frequencies were extracted from the SUBTLEX database (Van Heuven, Mandera, Keuleers, & Brysbaert, 2014). In addition, semantic relatedness values for each prime–target pair in all three conditions were extracted from Latent Semantic Analysis Web facility (http://lsa.colorado.edu/; Landauer & Dumais, 1997), which revealed that semantic relatedness values in the truly suffixed condition (.43) were significantly higher than those in both the pseudo-suffixed (0.10), t(27) = 6.64, and the non-suffixed (0.12), t(27) = 6.37, conditions, but the pseudo-suffixed and non-suffixed conditions did not differ, t(40) = 0.58.

Table 1 Mean word frequencies were extracted from the SUBTLEX database (Van Heuven et al., 2014) and are given as Zipf values (log10 occurrences per billion)

For each target, an orthographically, morphologically, and semantically unrelated control prime was selected from the CELEX lexical database (e.g., nearly–HUNT, faulty–CORN, seeing–CASH), which was matched to the related primes on frequency and length. Given that each target word was paired with two different types of prime (related and unrelated), two lists were created so that each target only appeared once in each list, but each time in a different priming condition. Participants were allocated to one of the two lists, such that they only ever saw any target once. A full list of stimuli is provided in the Table 3.

In line with Rastle et al. (2004), an additional set of 22 unrelated word pairs was included to further reduce the prime–target relatedness proportion. The related and unrelated prime–target pairs were matched on target length. Finally, for the purpose of the lexical decision task, we included a set of 88 nonword targets (e.g., SARB), which were all orthographically legal and pronounceable and matched on length to the real-word targets. Each nonword target was preceded by a morphologically complex word prime (e.g., novelty–SARB).

Procedure

Stimuli were presented in the centre of a LED computer screen using DMDX software (Forster & Forster, 2003). Participants were tested individually in a quiet room. Each trial consisted of a 500-ms forward mask of hash keys, then a 50-ms prime in lowercase, then the uppercase target. The target remained present until the response was made or until 3 seconds had elapsed. Participants were instructed to respond as quickly and accurately as possible if the visually presented items were real English words or not. Stimuli were presented in randomized order. The experiment lasted for approximately 10 minutes per participant.

Results

Lexical decisions to word targets were analysed as follows. Incorrect responses were removed from the reaction time (RT) analysis (7.8 % of all data). Inverse RTs (1/RT) were calculated for each participant to correct for RT distribution skew and used throughout the analyses. Reaction times smaller than Q1–(2.5 × IQR) or larger than Q3 + (2.5 × IQR), by either participants or items, were excluded from the analyses (Q1 = first quartile, Q3 = third quartile, and IQR = Q3–Q1 = interquartile range). This trimming led to the removal of 4.1 % of the data. RTs and error rates are presented in Table 2 and were analysed for each subject.

Table 2 Mean lexical decision times and error rates for word targets averaged across subjects

We used linear mixed-effect modelling to perform the main analyses (Baayen, 2008; Baayen, Davidson, & Bates, 2008). Fixed and random effects were only included if they significantly improved the model’s fit in a backward stepwise model selection procedure. Models were selected using chi-squared log-likelihood ratio tests with regular maximum likelihood parameter estimation. Trial order was included to control for longitudinal task effects such as fatigue or habituation. Although target words and prime words were matched on logarithmic word frequency (Zipf frequency; Van Heuven et al., 2014) across item types (truly suffixed, pseudo-suffixed, non-suffixed), we included logarithmic target frequency and logarithmic prime frequency as covariates in the analyses.

A linear mixed-effects model as implemented in the lme4 package (Bates, Maechler, Bolker, & Walker, 2014) in the statistical software R (Version 3.0.3; RDevelopmentCoreTeam, 2008) was created with five fixed effects factors (prime type: related, unrelated; item type: truly suffixed, pseudo-suffixed, non-suffixed; trial order; logarithmic target frequency; logarithmic prime frequency), their interactions, and two random effects factors (random intercepts for subjects and items). The lmer default coding for treatment contrasts was used for factor prime type (i.e., reference level “related”). Pairwise comparisons of factor item type were obtained by setting different baseline conditions within the same model (i.e. using lmer function ‘relevel’). P values were determined using the package lmerTest (Kuznetsova, Brockhoff, & Christensen, 2014). The model was refitted after excluding data-points whose standardised residuals were larger than 2.5 in absolute value (1.2 %; see Baayen, 2008). Factors trial order and logarithmic prime frequency were not significant and was therefore excluded from the analyses. The linear mixed-effects model estimated pairwise level-by-level analyses for fixed-effects factor analyses, yielding three prime type by item type comparisons. RT analyses revealed a significant priming effect in the truly suffixed condition (t = 4.91, p < 0.001; less than the Bonferroni corrected value) and in the pseudo-suffixed condition (t = 3.96, p < 0.001; less than the Bonferroni corrected value). No priming was obtained in the non-suffixed condition (t = 1.20, p = 0.229). Moreover, the results revealed that priming was significantly greater in the truly suffixed than in the non-suffixed condition (t = 2.57, p = 0.010) and priming in the pseudo-suffixed condition was significantly greater than priming in the non-suffixed condition (t = 1.98, p = 0.047), but there was no significant difference between the priming in the truly suffixed and pseudo-suffixed conditions (t = 0.53, p = 0.594). In addition, there was a significant effect of logarithmic stem frequency (t = 6.31, p < 0.001). No other effects were significant.

Error analyses followed the same logic as the RT analyses. We applied a binomial variance assumption to the trial-level binary data using the function glmer as part of the R-package lme4. The results revealed that participants made significantly fewer errors in the truly suffixed condition, compared to the pseudo-suffixed and non-suffixed conditions (z = 2.45, p = 0.014; z = 2.08, p = 0.038, respectively), but there was no difference between the pseudo-suffixed and non-suffixed condition (z = 0.38, p = 0.704). No other effects were significant.

Discussion

The goal of the present study was to test the recent claim that previously observed pseudo-suffixed priming effects (see Rastle & Davis, 2008, for a review) have only been obtained due to the large number of partially transparent items in this condition (such as sniper or crafty; Baayen et al., 2011). According to Baayen et al. (2011), the fact that the pseudo-suffixes (−er and -y in these examples) actually carry their true meaning/function in the pseudo-suffixed words (i.e., a sniper is someone who does something, crafty is an adjective) is the reason why pseudo-suffixed priming effects have been observed in prior research. In discussing how their Naive Discriminative Reader (NDR) model could simulate the statistically equivalent opaque and transparent priming effects observed by Rastle et al. (2004), Baayen et al. (2011) state that “…the reason that the transparent and opaque conditions give rise to a similar priming effect…is because orthographic representations for the suffix (unigrams, bigrams) have become associated with the suffix meaning. Crucially, these associations can emerge because for the majority of the opaque items, the suffix is fully functional in the meaning of the complex word (p. 466).”

To test this account of pseudo-suffixed priming, we conducted a masked primed lexical decision experiment comparing truly suffixed (e.g., hunter-HUNT) and pseudo-suffixed prime-target pairs (e.g., corner-CORN) relative to a non-suffixed control condition (e.g., cashew–CASH). In contrast to previously conducted research, we used a set of revised and more stringent stimulus criteria to select our set of pseudo-suffixed prime words. Most importantly, the pseudo-suffixed words were selected such that the pseudo-affix never conveyed any semantic content (e.g., er in sniper, where er means “someone who does something”) or syntactic function (e.g., y in crafty, where y turns the word into an adjective) that matched the meaning or syntactic category of the pseudo-suffixed word. The results of the present study revealed robust and equal magnitudes of priming in the truly suffixed and pseudo-suffixed conditions, while priming was absent in the non-suffixed control condition. Our findings thus unambiguously demonstrate that complex words are decomposed into morphemic subunits during the early stages of visual word recognition, independently of whether they bear a true morphological structure (e.g., hunt + er) or a pseudo-morphological structure (e.g., corn + er).

The present results clearly challenge amorphous theories of visual word recognition, and in particular the NDR model of Baayen and colleagues (2011). The NDR model accounts for “morphological” phenomena observed during the processing of written language on the basis of learned associations between sublexical orthography and semantics, without any role for any kind of morpho-orthographic segmentation mechanism. Thus, in the NDR model, the orthographic representation of an embedded stem in truly suffixed words (e.g., hunter) is associated with the same meaning as the stem when it stands as a separate word (hunt), whereas an embedded pseudo-stem in pseudo-suffixed words (e.g., corner) is associated with a different meaning. Moreover, associations between a suffix and semantics are only implemented if the suffix conveys a regular meaning (for further discussion, see Baayen et al., 2011, pp. 465–467). Thus, for pseudo-suffixed words in which there is no synchronic suffix (as in corner), no suffix meaning is assigned. Hence, the model predicts no priming from truly pseudo-suffixed primes, or at least no difference between the pseudo-suffixed prime condition and the orthographic prime condition, which is inconsistent with the present results.

On the other hand, the equivalent magnitudes of priming in the truly suffixed and pseudo-suffixed conditions are in line with a considerable number of studies suggesting that there must be an automatic, fast-acting morpho-orthographic segmentation mechanism that rapidly decomposes morphologically complex letter strings into morphemic units, independently from semantics (Beyersmann, Castles, & Coltheart, 2011; Diependaele et al., 2005, 2009; Duñabeitia, Perea, & Carreiras, 2007; Rastle & Davis, 2008; Rastle et al., 2004; Taft, 2003; Taft & Ardasinski, 2006; Taft & Nguyen-Hoan, 2010). It has been proposed that morpho-orthographic segmentation is guided by some kind of ‘affix-stripping’ mechanism (firstly introduced by Taft & Forster, 1975), that matches input letter sequences with higher-level affix representations, while the remaining letter string activates the representation of the stem. The notion that affixes play an active role during the reading of morphologically complex words is further supported by evidence from masked affix priming studies (e.g., Crepaldi, Hemsworth, Davis, & Rastle, 2015; Duñabeitia, Perea, & Carreiras, 2008; Giraudo & Grainger, 2003), suggesting that the recognition of suffixed target words is facilitated by the prior presentation of a suffixed prime which shares the same suffix (e.g., baker-WALKER).

Finally, we note that the pattern of results obtained in the present study may be specific to the lexical decision paradigm. Recent evidence suggests that morpho-orthographic segmentation might not be obligatory, as indicated by the fact that morpho-orthographic effects were not found in a masked primed semantic categorisation task (Marelli, Amenta, Morone, & Crepaldi, 2013), nor in a same–different judgment task (Duñabeitia, Kinoshita, Carreiras, & Norris, 2011). That is, semantically blind morphological decomposition appears to be at least partially task-dependent (Amenta, Marelli, & Crepaldi, 2015).

In conclusion, the presented evidence is clearly inconsistent with the claim that previously reported morpho-orthographic priming effects observed with the lexical decision task were only carried by the large proportion of transparent suffixes in the pseudo-morphological priming condition. Hence, our findings speak against amorphous theories of morphological processing that predict that there should be no priming from pseudo-suffixed prime stimuli.