Introduction

Word knowledge is updated not only by the acquisition of novel words, but also by the learning of new meanings for words with established meanings. For example, one learns “skate” is a kind of fish well after they know its more common meaning related to ice or roller skating. Compared to learning novel words, learning new meanings for known words involves stronger interactions between new and prior knowledge. How these interactions affect the learning of new meanings and the processing of prior word knowledge remains unclear.

Previous studies have found that prior word knowledge affects the acquisition of new meanings for known words. For example, familiar word forms allow more cognitive resources available for meaning learning and therefore facilitate the learning of associations between known words and their new meanings (Storkel & Maekawa, 2005; Storkel, Maekawa, & Aschenbrenner, 2013). Meanwhile, some studies show that the interference from original meanings of familiar words impedes the learning of new meanings (Casenhiser, 2005; Doherty, 2004; Mazzocco, 1997). A recent study reconciled the inconsistency by finding that the facilitation from familiar word forms wanes as differences in form familiarity decrease with exposure, while interference from the original meaning emerges over learning trials especially for highly familiar words (Fang, Perfetti, & Stafura, 2017).

In contrast to contextually bound short-lived associations, robust word learning entails long-term retention and future accessibility. However, the long-term effects of prior word knowledge on the retention of new meanings of known words have been even less studied (although see Rodd et al., 2012; Storkel & Maekawa, 2005). If interference continues to play a dominant role after learning, as suggested by Rodd et al. (2012), new meanings would be forgotten faster over time when the competition between new and original meanings is stronger. However, the co-activation hypothesis of word learning predicts that strong co-activation between new and prior word knowledge benefits the integration of new knowledge (Fang et al., 2017). Learners have more accessible and richer knowledge about more familiar words. This leads to a stronger interaction between new information (i.e., a new meaning) and existing knowledge for more familiar words during learning and subsequent memory processes. The result is stronger connections between new and prior knowledge that support better long-term retention for new meanings of more familiar words.

According to the complementary learning systems (CLS) model, newly learned word knowledge is initially an episodic memory, assumed to be hippocampus-based before being gradually transformed to neocortex-based semantic memory through memory replay during offline consolidation (Davis & Gaskell, 2009; Mcclelland, Mcnaughton, & Oreilly, 1995). Most of the previous studies examined the integration of new knowledge by using an indicator of interaction between new and existing knowledge. The indicator is often lexical competition between novel words and existing words whose orthographic or phonological form is similar to the newly learned words (e.g., Bakker, Takashima, van Hell, Janzen, & McQueen, 2014; Bowers, Davis, & Hanley, 2005; Gaskell & Dumay, 2003; Qiao & Forster, 2013; Tamminen & Gaskell, 2008; Wang et al., 2016); in other studies, the indicator has been meaning-based effects observed in semantic priming or interference paradigms (Coutanche & Thompson-Schill, 2014; Geukes, Gaskell, & Zwitserlood, 2015; Tamminen & Gaskell, 2013; Tham, Lindsay, & Gaskell, 2015). For example, in Tamminen and Gaskell (2013), participants learned novel words with meanings, and then made lexical decisions on familiar words that were primed by newly learned words that were either semantically related or unrelated. Semantic priming from novel words was observable after a delay of one week, but not immediately after learning. The finding seems to support a crucial role of offline consolidation in the integration of new word knowledge, on the assumption that semantic priming reflects automatic meaning processing.

In contrast to previous studies that focus on the new connections between newly learned words and known words at either word form or meaning level, a recent event-related potential (ERP) study investigated how existing form-meaning connections can be affected by recent learning experience (Fang & Perfetti, 2017). In the study, participants learned new meanings for novel words (e.g., “having a tight schedule” for “tasdite”) and known words (e.g., “causing fever” for “plenty”). Following the learning phase, participants performed a one-back task that did not require explicit meaning processing on a word list consisting of word pairs that included the trained words followed by words that probed either their new or original meanings. Consistent with Tamminen and Gaskell (2013), there was no reduced N400 for a word related to the new meaning of a previously known word (e.g., “germ” following “plenty”) or a novel word (e.g., “busy” following “tasdite”) that preceded it, a finding suggesting the absence of automatic meaning activation on the day of learning. However, a word that was related to the original meaning of a preceding known word (e.g., “enough” following “plenty”) showed a larger negativity at the Cz cluster especially within 500–700 ms when the known word had been paired with a new meaning than when it only had been exposed without a new meaning. This enhanced negativity was interpreted as an indicator of knowledge perturbation; i.e. the existing connection between the word form and its original meaning had become less accessible, at least momentarily, following the learning of an arbitrary new meaning. Thus, the general picture is that, before the integration of the new meaning has occurred through offline consolidation, existing form-meaning connections have been affected by new learning, at least when the new meaning is unrelated to the original meaning.

Although the perturbation effect has been observed on the day of learning in an implicit task in which meaning selection is not required (Fang & Perfetti, 2017), its nature and function are not clear. The perturbation effect may reflect the reactivation and subsequent suppression of existing form-meaning connections during the learning of new meanings. These processes may produce the changes in existing connections that seem necessary for the integration of new unrelated meanings (Dudai, Karni, & Born, 2015; Fernandez, Bavassi, Forcato, & Pedreira, 2016). If this is the case, the perturbation is more likely to occur when new meanings are learned for more familiar words, whose meanings are more strongly reactivated, thus requiring more suppression. We assume that the perturbation effect is a reduction in access to the original meaning; however, it is difficult to rule out the possibility that the effect is episodic rather than semantic, the result of having experienced the word in the context of a new meaning during the learning task. Neither is it clear whether the perturbation effect is strictly transient or more long-lasting. We return to both of these questions in the discussion.

Our study aimed to expose the role of knowledge interaction between original and newly learned form-meaning connections through the manipulation of word frequency. Participants learned new meanings for high and low frequency words and were exposed to high and low frequency words without new meanings to provide control for mere exposure (i.e., exposure controls). The retention of new meanings and the processing of the original meanings were then tested in three sessions over one week. The strength of existing connections between word forms and original meanings is stronger for high frequency words than for low frequency words (Perfetti & Hart, 2002), leading to stronger co-activation between new and original meanings in the learning phase. Thus, a frequency manipulation allows us to test the influence of prior knowledge (familiarity) on long-term retention of new meanings and on the perturbation of existing connections. A better long-term retention for high frequency words than low frequency words would suggest a beneficial effect of strong connections between word forms and original meanings in the long run, while the opposite pattern would indicate sustained interference from original meanings even during post-learning memory processes.

To study the influence of learning on existing form-meaning connections, we had participants perform a semantic relatedness judgment task based on the original meanings of trained words in each session. The comparison between words with new meanings and exposure controls provides direct evidence about perturbation of old knowledge. Specifically, a perturbation effect is indicated by longer decision times for words with new meanings compared to exposure controls. Because the learning of new meanings for high frequency words involves more reactivation and suppression of original meanings, we expected a more robust perturbation effect for high frequency words than for low frequency words. If perturbation is persistent, we would see perturbation after a delay of one day and one week.

Experiment 1

Participants

Thirty college students (16 females) participated in Experiment 1. All were native English speakers with normal or corrected-to-normal vision, right-handed and aged from 18 to 23 years (mean = 18.59). None had been diagnosed with a learning disability. Six additional participants who did not attend all three sessions were excluded from data analysis. Participants provided written informed consent before the experiment and received credits for their time to fulfill one of the course requirements. All experimental procedures were carried out with the approval of the local institutional review board.

Stimuli

Trained words and meaning probes

Forty high frequency words (above 30 per million) and 40 low frequency words (below 1 per million) were selected from the SUBTL database by Brysbaert and New (2009; see Table 1 for examples). Twenty-three undergraduates who did not participate in any other part of the study and were from the same subject pool rated their familiarity towards each word from 1 (unfamiliar) to 6 (familiar). The results showed higher ratings for high frequency words than for low frequency words (p < .001). In addition, high frequency words tended to be acquired earlier than low frequency words (Kuperman, Stadthagen-Gonzalez, & Brysbaert, 2012). Each of the selected words had only one meaning but may have more than one sense according to the Wordsmyth English Dictionary–Thesaurus (Parks, Ray, & Bland, 1998). High and low frequency words were matched on concreteness (Brysbaert, Warriner, & Kuperman, 2013) and number of senses (Parks et al., 1998), in addition to number of syllables, word length, and bigram frequency (Balota et al., 2007). Lexical and sublexical characteristics of the selected words are presented in Table 2. For each participant, half of the high and half of the low frequency words were paired with new meanings, and the other half were presented as exposure controls (see Appendix for the full list). The assignment of words to each of the two learning conditions was counterbalanced between participants.

Table 1 Examples of stimuli
Table 2 Lexical and sublexical characteristics of trained words

For each trained word, three probes that were semantically related to its original meaning were created and one was used in each of the three semantic relatedness judgment tasks on each session. Another 17 native English speakers who did not participate in any other parts of the study rated semantic relatedness between trained words and their probes on a scale of 1 (unrelated) to 6 (related). The collected rating data showed that semantic relatedness between trained words and probes were comparable between high and low frequency words on each of 3 days (overall, high frequency words = 4.802 ± .441, low frequency words = 4.768 ± .518; all ps > .37). Additionally, meaning probes of high and low frequency words were matched at number of letters (ps > .42) and number of syllables (ps > .20), while probes of high frequency words had a higher word frequency compared to those of low frequency words (ps < .08; see Table S1 for mean and SD).

New meanings

Forty new meanings were taken from a previous study (Fang et al., 2017). These new definitions were created to allow realistic conceptual mappings but with no overlap with existing words. The pairing between trained words and definitions was counterbalanced across participants such that each definition was paired with a high frequency word for half of participants and with a low frequency word for the other half of participants (see Appendix for all the pairings). To assess any inadvertent relation of the new meaning of a word to its actual meaning, we carried out a term-to-document Latent Semantic Analysis (LSA, http://lsa.colorado.edu), calculating the cosine of the angle between the resultant semantic spaces (Landauer & Dumais, 1997). The results showed very low LSA cosine values for both high and low frequency words with a mean of 0.003 (SD = 0.047) and 0.006 (SD = 0 .054), respectively, indicating that new meanings were not related to the original meanings.

Procedure

As shown in Fig. 1, participants learned new meanings for high and low frequency words in an associative learning paradigm on Day 1, and were tested on both the new and original meanings on each of the three days.

Fig. 1
figure 1

Schematic overview of tasks on each day

Learning phase

Participants had six learning trials for each word in a self-paced learning paradigm. Each trial began with a fixation for 500 ms, immediately followed by a visual presentation of a to-be-learned (trained) word. When participants were ready to learn the meaning, they pressed the space bar, which caused the appearance of either a definition (for words with new meanings) or a string of asterisks (for exposure controls). Participants pressed the space bar when they were ready for the next word. Each word was presented exactly once within a complete cycle of 80 individual trials, before the next cycle began. To facilitate learning, after the second (two learning trials for each word) and fourth cycles (four learning trials for each word), participants were tested on the new meanings of half the words from each condition by a meaning generation test (described below).

Tests on new meaning: meaning generation tests

Participants were presented with a trained word and asked to type its new meaning or type “n” (for “none”) if the word did not have a new meaning (i.e., exposure controls). To promote learning, immediate correct meaning feedback was presented in the two tests during the learning phase; all the post-learning tests including the last test on Day 1 and the tests on subsequent days occurred without feedback. Participants’ responses were rated from 0 (no response or no any related information is provided) to 5 (the exact meaning is provided) based on how close they were to the correct answers by two trained research assistants who were blind to the conditions. Averaged scores from two raters were assigned as final scores; however, when differences of ratings from two raters were larger than 1, inconsistencies were resolved through discussion before final scores were assigned. Only data from the post-learning tests are reported here.

Tests on new meaning: multiple-choice tests

In each trial, participants were presented with a trained word for 500 ms, followed by four options that included three new meanings (two that had been paired with some other word if the word had been paired with a new meaning) as the first three options and a string of asterisks as the fourth option. Participants’ task was to choose the correct meaning from four options by pressing one of the four number keys (from 1 to 4). Feedback about correctness and correct answer were always presented after participants’ response. Participants could choose to study the words again and pressed the space bar to continue to the next word when they were ready. Accuracy was recorded.

Testing on original meanings: semantic relatedness judgment tasks

Participants were shown pairs of words with only one word on the screen each time: The first word was a trained word; the second or probe word was either related or unrelated to the original meaning of the trained word. The first word was presented for 500 ms, followed immediately (ISI=0) by the second word. Participants’ task was to judge whether or not two words were related based on the original meaning of the first word by pressing keys using the right or left index finger. The next trial began immediately following participants’ response. There were 80 related word pairs and 80 unrelated word pairs in each task. The unrelated word pairs were created by shuffling the related word pairs. Participants familiarized themselves with the task through a short practice, in which they made judgments on ten word pairs that did not consist of any trained words or probes. Participants performed the task as the first task on both Day 2 and Day 8 to minimize the influence of recent retrieval of new meanings.

Statistical analysis

Data from each task were analyzed with mixed-effects models (Baayen, Davidson, & Bates, 2008) using lme4 package in R. Accuracy data were analyzed with logistic regression after log transformation, others with linear regression. Random effect terms included intercepts of subject and item (training word); a by-subject or by-item slope was added if model comparisons showed a significant contribution and models converged. Final models are reported.

For the meaning generation and multiple-choice tests, only responses to trained words with new meanings are of interest (data for exposure controls are presented in Fig. S1); fixed effects in the models included Frequency, Day, and their interaction. The fixed effects were treatment-coded: High frequency words provided the reference level of Frequency; the reference level of Day was Day 1 and both Day 2 and Day 8 were compared to Day 1. Thus, the intercepts in the models represented performance for high frequency words on Day 1. These coding procedures allowed us to compare the effect of Frequency on Day 1 with our previous findings (Fang & Perfetti, 2017; Fang et al., 2017) and to examine the change of patterns over one day and one week with the smallest number of models. Following significant interactions, contrast analyses were conducted to reveal the Frequency effect on Day 2 or Day 8. To better characterize the decay of the memories of new meanings following, we defined the retention rate over one week as participants’ performance on Day 8 relative to their performance on Day 1. The retention rate was calculated for each test for each participant and compared across high and low frequency words using a paired t test.

For the semantic relatedness judgment tasks, decision times from related and unrelated trials were analyzed separately, because the related conditions were more comparable with the contrast regarding perturbation effect in our previous study (Fang & Perfetti, 2017). Because participants made the “related” responses with their dominant hand and “unrelated” responses with the non-dominant hand, any relatedness effects would not be easily interpreted. For completeness, the results of modeling including both related and unrelated conditions are reported in Supplementary Tables S3 and S4. In addition to incorrect trials, trials with decision times beyond 2.5 standard deviations from the mean or shorter than 200 ms were excluded before modeling (affecting 3.1% of the remaining data). Fixed effects included TrainingType, Frequency, Day, and their full interactions. The coding of Frequency and Day was the same as in the models for tests on new meanings. The reference level of TrainingType was exposure control. Following significant interactions involving TrainingType, contrast analyses were conducted to reveal TrainingType effect on high or low frequency words on each day. Accuracy data of the task were also analyzed but presented in Table S2, as they were of less interest.

Results and discussion

Retention of new meanings

Results of meaning generation and multiple-choice tests are shown in Fig. 2 and Table 3. In the meaning generation tests, participants learned the new meanings of low frequency words better than high frequency words on Day 1 (t = 2.234, p = .026). However, over one day and one week, the relevant memories about new meanings of low frequency words decayed faster than those of high frequency words (ps < .05 for both interactions between Day and Frequency); high and low frequency words did not differ on Day 2 or Day 8 (Day 2: t = .161, p = .872; Day 8: t = -1.61, p = .238). The retention rate for high frequency words (64.7 ± 21.4%) was higher than for low frequency words (52.1 ± 18.6%; t(29) = 4.300, p < .001).

Fig. 2
figure 2

Performance on trained words with new meanings in the meaning generation (left) and multiple-choice (right) tests in Experiment 1. Error bars represent ±1 SEM. *: p < .05; ~: p < .10

Table 3 Fixed effect estimates for mixed effects models of learning performance in Experiment 1

Very similar patterns were found in the multiple-choice tests: Participants were better at recognizing the new meanings of low frequency words on Day 1 (z = 2.818, p = .005), but their memory for new meanings of high frequency words showed faster decay over one day and one week (ps < .01 for both interactions). The low frequency advantage was gone by Day 2 (p > .90), and reversed, at marginal significance, by Day 8 (z = -1.728, p = .084). Again, the retention rate was higher for high frequency words (87.3 ± 16.8%) than for low frequency words (77.8 ± 16.4%; t(33) = 3.413, p = .002).

The influence of learning on existing meanings

As shown in Fig. 3 and Table 4, Day 1 showed no perturbation effect; neither TrainingType nor its interaction with Frequency were significant in either related or unrelated trials (ps > .45). Neither was there evidence for perturbation on Day 2 or Day 8, as indicated by the insignificant two-way interactions between Day and TrainingType in either related or unrelated trials (ps > .27) or the three-way interactions between Day, TrainingType, and Frequency (ps > .10). Instead, Day 8 decision times for low frequency words with new meanings were marginally faster than exposure controls in related trials (t = 1.731, p = .083).

Fig. 3
figure 3

Response time (RT) difference between words with new meanings and exposure controls in related (left) and unrelated (right) trials in Experiment 1 (SOA = 500 ms). Error bars represent ±1 SEM

Table 4 Fixed effect estimates for mixed effects of semantic relatedness judgment task in Experiment 1 (SOA = 500 ms)

Generally, the perturbation effect observed in Fang and Perfetti (2017) using ERPs during an implicit task was absent in behavioral results when participants were instructed to make semantic judgments on the original meanings. Instead, we observed a trend of facilitation for learning new meanings for low frequency words one week following learning.

It is possible that perturbation is observable only during a narrow window of word processing when selective access to original meaning is required. The SOA of 500 ms used in Experiment 1 may have exceeded this window, providing sufficient time to stabilize the representations of the original meaning or to suppress the activation of the newly learned meaning. The non-significant difference between high and low frequency exposure controls also indicated that the SOA was too long, especially when participants had multiple exposures to both high and low frequency words prior to the task. If this is the case, then a much shorter SOA should expose a perturbation effect on the word’s connection to its original meaning. Accordingly, Experiment 2 used a shorter SOA of 200 ms, a time window during which word processing shows wide spread activation prior to specific meaning selection (Kintsch & Mross, 1985; Van Petten & Kutas, 1987). Experiment 2 also aimed to replicate the advantage for high frequency words in the long-term retention of new meanings.

Experiment 2

Participants

Thirty-four college students (25 females, age 18–20 years, mean age = 18.43) were recruited under the same standards described in Experiment 1. None had participated in any other part of the study. Seven additional participants were excluded because they did not attend all three sessions (n = 5), misunderstood the instruction of semantic relatedness judgment task (n = 1), or had overall accuracy lower than chance level in the multiple-choice test on Day 1 (n = 1).

Stimuli

The stimuli were the same as in Experiment 1.

Procedure

The procedure was the same as in Experiment 1, except that the SOA in the semantic relatedness judgment tasks was shortened to 200 ms (150 ms for the exposure duration of word 1 plus 50 ms ISI).

Statistical analysis

The data were analyzed in the same way as described in Experiment 1. In the semantic relatedness judgment tasks, data from two participants with the overall accuracy lower than 70% were excluded from analysis,Footnote 1 and outliers on decision times were excluded under the same standard described in Experiment 1 (affecting 2.9% of the remaining data).

Results and discussion

Retention of new meanings

As shown in Fig. 4 and Table 5, on Day 1 the performance on high and low frequency words was comparable, non-significantly higher for low frequency words (meaning generation: t = .404, p = .686; multiple-choice: z = 1.419, p = .156). In addition, memories of low frequency words decayed faster than high frequency when tested after one week (meaning generation: t = -2.460, p = .014; multiple-choice: z = -3.206, p < .001), but not when tested after one day (ps > .50 for both interactions). On Day 8, performance for low frequency words was marginally better than that of high frequency words in the multiple-choice test (z = -1.824, p = .068), and the same pattern was found in the meaning generation test (t = -1.259, p = .208). As observed in Experiment 1, the retention rate was higher for high frequency words than for low frequency words, in both meaning generation (65.7 ± 19.6% vs. 58.8 ± 21.4%, t(33) = 2.62, p = .013) and multiple-choice tests (89.7 ± 18.2% vs. 78.9 ± 15.1%, t(33) = 3.410, p = .002).

Fig. 4
figure 4

Performance on trained words with new meanings in the meaning generation (left) and multiple-choice (right) tests in Experiment 2. Error bars represent ±1 SEM. ~: p < .10

Table 5 Fixed effect estimates for mixed effects models of learning performance in Experiment 2

The influence of learning on original meanings

Results are shown in Fig. 5 and Table 6. In the related conditions, high frequency words on Day 1 showed a perturbation effect, as indicated by longer decision times for words with new meanings than exposure controls (t = 2.727, p = .006). A significant interaction of TrainingType with Frequency on Day 1 indicates stronger perturbation for high frequency words than for low frequency words (t = -2.672, p = .002). A contrast analysis showed that low frequency words did not show an effect of TrainingType on Day 1 (t = -0.902, p = .367). The perturbation effect observed on high frequency words was significantly weaker after a delay of one day or one week, as indicated by the significant interactions between Day and TrainingType (both ps < .05). In fact, the effect was absent after the first day when the effect of TrainingType was examined on each day (Day 2: t = -1.227, p = .220; Day 8: t = -.925, p = .355). For low frequency words, decision times for words with new meanings were faster than for exposure controls when tested one week later (t = -2.246, p = .025) but not one day later (t = -.908, p = .364). In the unrelated conditions, we did not see any effect of TrainingType on high or low frequency words on any of the 3 days (ps > .32).

Fig. 5
figure 5

Response time (RT) difference between words with new meanings and exposure controls in Experiment 2 (SOA = 200 ms). Error bars show ±1 SEM

Table 6 Fixed effect estimates for mixed effects of semantic relatedness judgment task in Experiment 2 (SOA = 200 ms)

To summarize, with a shorter SOA of 200 ms, learning new meanings led to slower processing of original meanings of high frequency words immediately following learning, but not after a delay of one day or one week. In contrast, low frequency words showed a benefit of learning new meanings on retrieving the original meanings, an effect that emerged one week after learning, consistent with Experiment 1. Additionally, Experiment 2 replicated the Experiment 1 finding that memories for new meanings of low frequency words decayed faster than those of high frequency words.

General discussion

The experiments addressed both the fate of newly learned meanings and the effects of learning these new meanings on the existing form-meaning connections. Both experiments showed a slower decay of newly-learned meanings of high frequency words compared to those of low frequency words. In semantic relatedness judgments targeting the original meanings, we found longer decision times for words that had been paired with new meanings than for exposure controls, indicating perturbation of existing connections. A perturbation effect was found on high frequency words on the day of learning when the SOA was short (200 ms, Experiment 2), although not when it was 500 ms (Experiment 1). In contrast, not only did low frequency words not show a perturbation effect, accessing their original meanings showed a facilitative effect one week following learning, especially when the SOA was short.

Long-term retention of new meanings

The slower decay of memories of new meanings of high frequency words compared with those of low frequency words over one week is consistent with the co-activation hypothesis of word learning (Fang et al., 2017). When prior knowledge is more available and more elaborated, co-activation of new and prior knowledge is stronger. Such co-activation may involve competition or interference between meanings, especially during selective meaning access, and does not necessarily benefit the initial formation of new associations. However, it does benefit long-term retention, a crucial part of word learning, even though one week is still relatively short. Consistent with this, a recent study by van Kesteren, Krabbendam, and Meeter (2017) showed that the reactivation of prior word knowledge benefits memory integration. In their study, participants first learned AB associations between novel words and their picture referents, and then were cued to recall AB associations before they continued to learn AC associations between novel words and definitions. The results showed that when participants were more successfully reactivating AB memories they were better inferring indirect BC associations between picture referents and definitions, which indicates better memory integration, regardless of congruency between B and C.

The interaction between word frequency and test time can be an example for that desirable difficulty benefits long-term retention (Bjork, 1994; Bjork & Kroll, 2015). Here, the desirable difficulty is mainly a result of interference from the original meanings. Those new meanings that survive stronger interference are more resistant to decay over time. Alternatively, participants had a higher chance to encounter high frequency words (e.g., “plenty”) in their daily life between sessions compared to low frequency words. Therefore, they are more likely to be cued incidentally about their new meanings, leading to post-learning retrieval practice that contributes to better long-term retention (Roediger & Butler, 2011).

We predicted better learning of new meanings for low frequency words than for high frequency words on Day 1 and we found such effect in Experiment 1. Experiment 2 showed the same pattern, but with a low level of reliability. As proposed by Fang et al. (2017), word familiarity can have a biphasic effect on the learning of new meanings, serving to support stimulus (word form) encoding but also create stronger response interference (word meaning), with the first phase showing more encoding facilitation and later phases showing more meaning interference. The low reliability of frequency effect on Day 1 in Experiment 2 could be a mixture of both effects when participants were tested.

Perturbation of existing knowledge

In the semantic relatedness judgment tasks, even though the original meanings had been learned long before the new meanings, we found longer decision times for high frequency words with new meanings than for exposure controls when SOA was short. In contrast to Fang and Perfetti (2017), in which participants performed a one-back task that did not require explicit meaning processing, participants in the current study were explicitly instructed to focus on the original meanings when making the judgments. Thus, the perturbation effect can be detected both in an implicit task using ERPs and in a behavioral task requiring selective meaning access – provided processing time is limited. Findings from these studies provide convergent evidence for the perturbation of existing connections between word forms and original meanings immediately following learning. Such a perturbation effect seems to result from a transient interaction of new and existing knowledge during learning, rather than a continuing offline process.

The perturbation effect was observed on high frequency words but not on low frequency words, consistent with our hypothesis that perturbation reflects the strength of the connection between the word form and its original meaning. The original meaning of high frequency words is more accessible for reactivation and interaction with new meanings than the original meaning of low frequency words. The interaction may include the suppression of original meanings in order to learn new meanings; the need for suppression is higher for high frequency words, because their meanings are more accessible. The perturbation may also reflect a labile status of the previously encoded memory (i.e., word form-original meaning associations) following its reactivation during the learning phase (Dudai et al., 2015; Fernandez et al., 2016). Such a labile status can be especially important for later incorporation of new unrelated meanings and may contribute to better long-term retention of new meanings for high frequency words.

Similarly, an immediate change of inhibitory connections between familiar words has also been reported (Kapnoula & McMurray, 2016). In the study, two groups of participants performed a series of tasks in which either phonologically similar (e.g., neck and net) or different (e.g., neck and park; net and bait) words were presented together and therefore distinguishing the similar words was either needed or not. Immediately following the tasks, lexical competition among phonologically similar words was tested using a visual world paradigm where participants were presented with four pictures each time and asked to fix their eyes on the picture that an auditory word referred to. The activation of competitors was temporarily boosted by presenting auditory words (e.g., neckt) created by cross splicing the onset of a competitor (e.g., neck) and the ending of the target word (e.g., net). When cross-spliced words rather than target words were presented, only participants who had distinguished the phonologically similar words repeatedly were able to recover from the interference and showed a comparable ratio of fixations on the target pictures, suggesting the strengthening of inhibitory connections. While the complementary learning systems model emphasizes the importance of offline consolidation for the existing knowledge to be affected by learning experience (Davis & Gaskell, 2009), these findings suggest that immediate changes to prior word knowledge can occur without offline consolidation.

While the perturbation effect may be related to episodic retrieval of new meanings and the resulting meaning competition, it is not wholly a result of episodic retrieval of new meanings. The behavioral tests on new meanings showed that new meanings of high frequency words were not learned better than those of low frequency words in either experiment. Furthermore, Fang et al. (2017) found faster access to the new meanings of low frequency words than those of high frequency words on the day of learning, even though performance in tests on recognition and cued-recall of new meanings was comparable between high and low frequency words. If the perturbation resulted merely from the reactivation of the new meanings before or during decision-making, we would predict a result opposite to what we found here – stronger perturbation for low frequency words than for high frequency words. Furthermore, according to the complementary learning systems model, new meanings are not automatically activated when they are represented in the format of episodic memory before overnight consolidation occurs. This assumption has been supported by our previous study finding no semantic priming from new meanings of known words on the day of learning (Fang & Perfetti, 2017). However, it is impossible to fully exclude the influence of such episodic retrieval when participants are making semantic judgments on the original meanings.

If new meanings are integrated after offline consolidation and can be automatically accessed at the presence of words, meaning learning will lead the change of a word from an unambiguous word to an ambiguous word (homonym in particular). Previous studies have shown that multiple meanings of an ambiguous word can be simultaneously activated within 200 ms (Kintsch & Mross, 1985; Van Petten & Kutas, 1987), and meaning competition would lead to a disadvantage in processing in a semantic task (e.g., Hino, Lupker, & Pexman, 2002; Piercey & Joordens, 2000). One explanation for the absence of such an effect is that, retrieving the original and dominant meanings on Day 1, as required by the semantic judgment task, may make meaning selection easier when participants performed the same task on the second or third session so that no perturbation is detected with the same SOA. Indeed, recent studies have shown that recent exposure of a particular meaning can change the relative meaning dominance of an ambiguous word (Rodd et al., 2016; Rodd, Lopez Cutrin, Kirsch, Millar, & Davis, 2013). One potential mechanism is that encountering one meaning in the same task (s) helps establish context nodes that allow task- or context-dependent meaning selection, as has been argued to explain the increased efficiency in meaning selection for ambiguous words (Armstrong & Plaut, 2008). However, it is also possible that the amount of new learning was not sufficient to produce longer-term perturbation effects on the existing form-meaning connections. Additional training over days would allow a test of this explanation.

Whereas learning new meanings slowed down the access to original meanings of high frequency words immediately after learning, it sped up the processing on low frequency words one week later. When participants encounter less familiar words, learners may try to recollect or verify the original meanings during attempts to learn the new meanings. The “re-learning” of original meanings did not lead to faster reaction times until one week later, suggesting reconsolidation of memory of word form-original meanings (Forcato, Fernandez, & Pedreira, 2014). We did not see such a pattern for high frequency words; it may be because the meaning connections of high frequency words are already too strong for further strengthening from reconsolidation. However, given that the three-way interaction was not significant (see Tables 4 and 6), further research is required to explain the observed effects.

An interesting pattern emerged concerning the long-term fate of original meanings and new meanings. For low frequency words, new meanings were remembered less well over time, compared with high frequency words. However, knowledge about the original meanings was boosted for low frequency words. It is possible that enhancing prior knowledge is easier than learning new information and the improvement of prior knowledge will benefit the learning of new meanings in the long run. Although more research is needed to reveal the mechanism underlying the trade-off, it seems unlikely that participants apply different strategies in learning new meanings for high and low frequency words, because high and low frequency words were intermixed in each task.

Finally, we draw attention to a recent study that broadens conclusions about the fate of originally learned meanings. Maciejewski, Rodd, Mon-Williams, and Klepousniotou (2018) manipulated the semantic relatedness between new and original meanings, providing training for 30 min per day over 4 days. Participants made semantic relatedness judgments on original meanings prior to training on Day 1 and again on Day 5. In an experiment that required semantic judgments on pairs of words with a short SOA of 200 ms, judgments became slower after words had been paired with new meanings (compared to untrained words); furthermore, this effect was greater when learning involved unrelated meanings than when it involved related meanings. There are a number of differences between our study and that of Maciejewski et al. (2018), especially in training paradigms and stimuli, and these differences may influence the learning results for new meanings and for access to original meanings. Nevertheless, both studies found that slower access to original meanings occurred on the first test following learning. The finding from Maciejewski et al. (2018) that access to original meanings is affected by their relatedness to the new meanings is congruent with the perturbation hypothesis: Suppression is more necessary when the new meaning is unrelated to the original meaning. When the meanings are related, co-activation may have a mix of facilitative and inhibitory effects.

Conclusion

In the learning of new meanings for known words, strong interactions between new information and prior word knowledge benefit the long-term retention of new meanings. Such a facilitation effect occurs along with a perturbation effect, in which the original meaning of a familiar word is made momentarily less accessible immediately after learning. This effect is observable in both explicit and implicit meaning tasks.