Memory Retrieval in Online Sentence Parsing: Empirical Evidence, Computational Modelling, and Simulations

Fujita, Hiroki

doi:10.1007/s42113-024-00206-8

Memory Retrieval in Online Sentence Parsing: Empirical Evidence, Computational Modelling, and Simulations

Research
Open access
Published: 01 July 2024

(2024)
Cite this article

Download PDF

You have full access to this open access article

Computational Brain & Behavior Aims and scope Submit manuscript

Memory Retrieval in Online Sentence Parsing: Empirical Evidence, Computational Modelling, and Simulations

Download PDF

Hiroki Fujita ORCID: orcid.org/0000-0001-7649-9707¹

190 Accesses
Explore all metrics

Abstract

This paper reports two experiments (Experiments 1 and 2) and computational simulations designed to investigate and model memory retrieval processes during real-time sentence processing. Central to this study is the hypothesis that linguistic information serves as a cue to retrieve target representations from memory during dependency formation. The basis for this cue-based memory retrieval stems from research showing that non-target representations that match a set of retrieval cues interfere with target retrieval. The susceptibility to this similarity-based interference has been debated in the sentence processing literature, and various hypotheses and models have been formulated and developed. This issue is addressed empirically in Experiments 1 and 2, which investigated similarity-based interference in sentences with a floating quantifier. Bayesian linear mixed models and Bayes factor analyses suggested similarity-based interference. However, the patterns of interference were not consistent with existing theories and models. To reconcile these findings within the framework of cue-based memory retrieval, this paper implements the Revision Integrated Cue-Based (RICB) model based on the ACT–R architecture. This model assumes that structural information is heavily weighted and incorporates the notions of initial retrieval and revision. The results of the simulations indicate that the RICB model successfully predicts the observed data, highlighting the central role of structural information and revision in memory retrieval during real-time sentence processing.

Does semantic knowledge influence event segmentation and recall of text?

Article 26 March 2019

Does source reliability moderate the survival processing effect? The role of linguistic markers as reliability cues

Article Open access 10 June 2024

Inferential revision in narrative texts: An ERP study

Article 06 June 2015

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

When we read a sentence, we often need to recover information about previously encountered words and link it to the information provided by the current word in order to understand the sentence. Our brains do this easily and quickly in real time when we simply scan the sentence. Unravelling the mechanisms that underlie this real-time language comprehension is a major challenge in cognitive science.

A standard assumption in language comprehension research is that sentences have underlying syntactic structures (Chomsky, 1957; Crocker, 1996; L. Frazier, 1979; Gibson, 1991; Kimball, 1973; Phillips, 1996; Sturt, 1997) from which meaning is derived, and that these syntactic structures are assigned incrementally during real-time processing. Under this assumption, relations are established between elements within the syntactic structures rather than between words in the sentence. These relations are often called dependency relations or simply dependencies (e.g., Chomsky, 1956; Fodor, 1978). Specifically, a dependency relation R on a set of elements E is defined as a subset of E × E. Within the syntactic structures of a sentence, two elements are in a dependency relation if one of the elements requires the presence of the other in the syntactic structures, or the properties of one of the elements vary due to the presence of the other. For example, consider the following sentences where the subject noun phrase (NP) and the auxiliary verb are in a dependency relation.

(1a)
[_NP1 The sisters of [_NP2 the girls]] [_T were] walking back home.
(1b)
[_NP1 The sister of [_NP2 the girls]] [_T were] walking back home.

In (1a/b), the square brackets, labelled as NP1, NP2 and T, represent part of the syntactic structures underlying these sentences.^{Footnote 1} NP1 is the sentence subject NP, and “were” is an auxiliary verb. According to the definition of dependency relations, the elements derived from “The sisters…” and “were” belong to a dependency relation, given their covariance in form.

Recent studies argue that the formation of dependency relations during real-time sentence processing requires memory retrieval. One phenomenon that may shed light on the characteristics of this memory retrieval process is similarity-based interference. Assuming that the properties of words, such as number and gender, are represented as features, e.g., “the sisters” as NP_[+plural] and “the sister” as NP_[+singular], similarity-based interference refers to interference due to feature similarity. For example, the sentence in (1b) is ungrammatical because of the mismatch in number between NP1 and the auxiliary verb (“The sister…were”). It is known that processing difficulties arise when the parser recognises the violation of such a linguistic agreement (mismatch effects; e.g., Cunnings & Felser, 2013; M. Frazier et al., 2015; Fujita, 2024; Fujita & Yoshida, 2024; Giskes & Kush, 2021; Jäger et al., 2020; Kazanina et al., 2007; Kim et al., 2020; Pearlmutter et al., 1999; Sturt, 2003; Sturt & Kwon, 2024; Vasishth et al., 2008; Wagers et al., 2009). Importantly, the sentence in (1b) contains another NP (“the girls”), which is unrelated to the subject-verb dependency relation and matches the auxiliary verb in number. Previous work suggests that this feature-matching plural distractor reduces the size of mismatch effects at the auxiliary verb compared to when the distractor is singular and thus does not match the verb in number, as in (2) below (e.g., Dillon et al., 2013; Fujita & Cunnings, 2023; Kim et al., 2020; Tucker et al., 2015; Wagers et al., 2009). These findings have been interpreted as evidence that at the second dependency element (“were”), the parser uses its lexical features (e.g., [+plural]) as retrieval cues to retrieve the first dependency element (NP1) from memory, a process known as cue-based memory retrieval.

(2)
[_NP1 The sister of [_NP2 the girl]] [_T were] walking back home.

Similarity-based interference has been widely investigated in the context of cue-based memory retrieval. Despite this extensive research, several issues remain unresolved, in particular the conditions under which similarity-based interference occurs. For example, some studies argue that interference occurs in both grammatical and ungrammatical sentences (e.g., Vasishth & Engelmann, 2021), while others propose that it is restricted to ungrammatical sentences (e.g., Wagers et al., 2009). To date, a variety of cue-based hypotheses and models have been proposed and developed (e.g., Dillon, 2014; Dillon et al., 2013; Kush, 2013; Parker, 2019a; Parker & Phillips, 2017; Van Dyke & McElree, 2011; Vasishth et al., 2019; Vasishth & Engelmann, 2021; Wagers et al., 2009). Previous research has also suggested that distractor position might influence cue-based memory retrieval (e.g., Arnett & Wagers, 2017; Parker & An, 2018) and that similarity-based interference might be specific to certain types of dependencies, indicating potential variations in memory retrieval processes across different dependency types (e.g., Dillon et al., 2013; Orth et al., 2021).

Importantly, these unresolved issues are compounded by the claim that detecting similarity-based interference requires a large sample size, particularly in grammatical sentences, and that previous research has lacked sufficient statistical power. Experiments 1 and 2 were designed to address these issues. To overcome the statistical power limitation, each experiment included a substantial sample size as determined by a power analysis (640 participants and 24 item sets per experiment). Experiments 1 and 2 investigated similarity-based interference and the influence of distractor position by examining the processing of floating quantifiers (e.g., Koopman & Sportiche, 1991; Sportiche, 1988) and using relative clauses (e.g., Chomsky, 1965, 1977; de Vries, 2002; Kayne, 1994; Ross, 1967; Salzmann, 2019). Quantifier float, as exemplified in the sentence “John said that the girls recently all walked back home”, involves the separation of a quantifier (e.g., “all”) from its associate (e.g., “the girls”). According to the definition of dependency relations given earlier, these elements are in a dependency relation. The motivation for studying quantifier float stems from the lack of attention given to this dependency in the existing literature, even though its distributional properties provide an interesting test case for cue-based memory retrieval. Furthermore, there is currently conflicting evidence regarding the susceptibility to similarity-based interference in the quantifier float construction (Fujita & Cunnings, 2023). Therefore, the study of quantifier float potentially contributes to our understanding of the universality of similarity-based interference.

A brief overview of the results: Experiments 1 and 2 revealed the presence of mismatch effects, suggesting that memory retrieval during the real-time processing of floating quantifiers adheres to structural constraints. Similarity-based interference was observed in ungrammatical sentences, irrespective of the distractor position. However, there was no clear evidence for interference in grammatical sentences. To reconcile these findings within the framework of cue-based memory retrieval, a computational model was proposed and evaluated through simulations. The results of these simulations demonstrated that the proposed model successfully predicted the observed data.

Theories of Cue-Based Memory Retrieval and Computational Models

In language, some words seem to be associated with other words in certain ways, and these associations seem to be determined or influenced by the position of the words in the sentence. Any theory and model of memory retrieval during sentence processing must integrate this distributional aspect of language to identify what serves as the target representation. This paper assumes that each word projects hierarchical structures, as shown in Fig. 1, which illustrates part of the hierarchical structures underlying the sentence in (1a). Dependency relations are determined on the basis of these hierarchical structures. In the grammar adopted in this paper (Chomsky, 1981), an auxiliary verb basically forms a dependency with an NP in the specifier of a TP. In Fig. 1, NP1, but not NP2, occupies this position. Consequently, in (1a), there is a dependency relation between NP1 and the auxiliary verb. Throughout the paper, underlying syntactic structures are often represented by labelled square brackets, as in (1a/b).^{Footnote 2}

Assuming that dependency relations are determined or influenced by the positions of the elements in the syntactic structures, the assignment of grammatical structures is a prerequisite for the formation of grammatical dependencies during real-time sentence processing. A large body of research has suggested that real-time language comprehension involves the assignment of grammatical syntactic structures (e.g., Aoshima et al., 2004; M. Frazier et al., 2015; Hall & Yoshida, 2021; Kazanina et al., 2007; Kush et al., 2017; Omaki & Schulz, 2011; Phillips, 2006; Schneider & Phillips, 2001; Yoshida et al., 2013, 2014). However, as discussed in the Introduction, previous studies have suggested that online dependency formation does not strictly adhere to structural constraints. Some of these studies claim to be consistent with theories of cue-based memory retrieval, which argue that the lexical features of the second dependency element (e.g., [+plural] from “were”) serve as retrieval cues (e.g., the number cue) to retrieve the first dependency element (e.g., NP1) from memory. These theories must postulate the existence of retrieval cues derived from the underlying syntactic structures (see Alcocer & Phillips, 2012; Kush, 2013).^{Footnote 3}

In the language comprehension literature, there are several models that incorporate cue-based memory retrieval. The activation-based model (Lewis & Vasishth, 2005; Vasishth & Engelmann, 2021), implemented on the basis of the Adaptive Control of Thought–Rational (ACT–R; Anderson, 2007; Anderson et al., 2004; Anderson & Lebiere, 1998) architecture, is one such model. In this model, each element is assumed to have an activation value. During memory retrieval, the element with the highest activation that reaches a retrieval threshold is retrieved. The higher the activation value, the faster the memory retrieval. In essence, the total activation (T) of a given element i is calculated by summing the values of the base-level activation (B), the spreading activation (A), and the stochastic noise (N). The calculation of these activation values is summarised in Table 1.

Table 1 Some crucial components of the activation-based model. i = element, j = cue

Full size table

The base-level activation represents previous retrievals and time-dependent decay. Most important for the present study is the spreading activation. In the activation-based model, cue weights are assumed to be uniformly distributed across cues (i.e., 1/|Cue|). The associative strength is calculated as follows: S_ji = m + ln (P(i| j)), where m is a constant and refers to the maximum strength of association that i can have. The rightmost term is a conditional probability that calculates the probability of i when j is present. A standard assumption in sentence processing research is that all i that match j are equally likely when j is present. Therefore, the conditional probability is simply the reciprocal of the number of i in memory that match j. If multiple i match j, the associative strength between each i and j decreases (the fan effect; Anderson, 1974). Lower activation leads to longer retrieval times because the retrieval time (RT) of i is calculated by the product of F and an exponential function with a negative exponent determined by the total activation.

Increased retrieval times due to the fan effect are referred to as inhibitory interference (Dillon et al., 2013). For example, the activation-based model predicts inhibitory interference in “(1a) [_NP1 The sisters of [_NP2 the girls]] were…” because NP1 and NP2 match the number cue, leading to a reduction in spreading activation to NP1 from this cue (S_num, NP1 ≈ .31; feature match = 1, m = 1, num = number). This contrasts with sentences such as the one in (3) below, where NP2 does not match the number cue (“the girl…were”), and consequently, there is no reduction in spreading activation to NP1 from the number cue (S_{num, NP1} = 1; feature mismatch = 0).

(3)
[_NP1 The sisters of [_NP2 the girl]] were walking back home.

As mentioned in the Introduction, a large body of research has demonstrated that the size of mismatch effects is attenuated in ungrammatical sentences, such as “(1b) [_NP1 The sister of [_NP2 the girls]] were…”, when the distractor (NP2) matches the auxiliary verb in number, compared to when it mismatches it, as in “(2) [_NP1 The sister of [_NP2 the girl]] were…”. In the activation-based model, this effect, called facilitatory interference, is predicted in the following scenario. In (1b), focusing on the structure-based and number-based cues, NP1 only matches the structure cue (A_NP1 = .5; W_j = .5, S_ji = 1), and NP2 only matches the number cue (A_NP2 = .5). Since there is no fan effect (S_{str, NP1} = m + ln(1), S_{num, NP2} = m + ln(1); str = structure), NP1 and NP2 receive the maximum activation from each cue. In (2), NP1 receives full activation from the structure-based cue (as in (1b)). However, NP2 receives no activation from either cue (A_NP2 = 0) because it does not match them. Activation values fluctuate from trial to trial due to stochastic noise. Consequently, in (1b), either NP1 or NP2, if the retrieval threshold is reached, is retrieved with a probability of about 0.5, depending on the noise fluctuations, but the retrieved element consistently has the higher activation. In (2), NP1 is always retrieved (again if the retrieval threshold is reached), even when its activation is relatively low due to stochastic noise. As a result, over multiple trials, the activation-based model predicts shorter retrieval times in (1b) than in (2).

As shown in Table 1, the activation-based model assumes an additive combination of cues (i.e., \(\sum_j{W}_j{S}_{ji}\)). In contrast to the activation-based model, some studies have proposed that retrieval cues operate through a multiplicative combination (see Dillon et al., 2013; Kush, 2013; Parker, 2019a). In sentence processing research, this cue combination scheme (Clark & Gronlund, 1996; Gillund & Shiffrin, 1984; Raaijmakers & Shiffrin, 1981) is primarily motivated by the proposition that memory retrieval resists similarity-based interference during online language comprehension. To illustrate, consider the following equation, which calculates the retrieval probability (RP) of i based on its match with j: \({RP}_i=\frac{\prod_j{S}_{ji}^{W_j}}{\sum_i\prod_j{S}_{ji}^{W_j}}\). Assuming that feature match = 0.99 and feature mismatch = 0.01, this equation indicates that only partially matching elements make a minimal contribution relative to fully matching elements due to the interdependence of retrieval cues. For example, in sentence (1a) “[_NP1 The sisters of [_NP2 the girls]] were…”, if we assume W = 1, RP_NP1 is .990. If the distractor does not match the auxiliary verb in number , as in “(3) [_NP1 The sisters of [_NP2 the girl]] were…”, it is .999. Thus, RP_NP1 ≈ 1, irrespective of whether the distractor matches in number or not.

In the case of ungrammatical sentences, a similar retrieval probability is obtained for NP1 when the distractor does not match in number, as in (2) “[_NP1 The sister of [_NP2 the girl]] were…”. However, when the distractor matches in number, as in (1b) “[_NP1 The sister of [_NP2 the girls]] were…”, the denominator becomes twice the value of the numerator, resulting in a retrieval probability of 0.5. These probabilities provide a basis for assessing similarity-based interference in online language comprehension. The calculation of retrieval times can take a variety of forms, depending on assumptions about the cognitive processes involved in memory retrieval. One assumption, computationally consistent with proponents of multiplicative cue combination in language comprehension research, is that the parser searches for another element whenever no grammatical dependency is formed (Yadav et al., 2023).^{Footnote 4}

This search process will be referred to as revision.^{Footnote 5} The need for revision is defined in terms of dependency relations. Let D = {R₁, R₂, …R_n} be a set of dependency relations, and let x and y represent temporal points during sentence processing, where x < y. Then, revision is considered necessary at y if D_x ⊈ D_y. According to this definition, we can assume that revision becomes necessary at the second dependency element i_b when a feature-mismatching element i_a is retrieved because, during the processing of i_b, 〈i_a, i_b〉 ∈ D at x, but 〈i_a, i_b〉 ∉ D at y due to the violation of linguistic agreement. Since revision is an additional cognitive process, it takes some time to complete (see Cunnings & Fujita, 2021; Ferreira & Clifton, 1986; Ferreira & Henderson, 1991; Fodor & Ferreira, 1998; Frazier, 1979; Frazier & Rayner, 1982; Fujita, 2021b; Fujita, 2023; Fujita & Cunnings, 2020, 2021a, 2021b; Gibson, 1991; Pritchett, 1992; Sturt, 1997; Sturt et al., 1999). In other words, assuming that retrieval times follow a normal distribution, RT_i~Normal(μ, σ), retrieval times with revision can be expressed as: RT_i~Normal(μ + α, σ).

In grammatical sentences, revision occurs with a probability of 1 – RP_NP1. As explained earlier, the retrieval probability of NP1 in grammatical sentences remains close to 1 regardless of the distractor features. Therefore, retrieval times almost always follow the Normal(μ, σ) distribution, resulting in minimal interference effects. In ungrammatical sentences, revision always occurs because no element fully matches the retrieval cues. Therefore, although the retrieval probability of NP1 varies depending on the presence or absence of a feature-matching distractor, retrieval times always follow Normal(μ + α, σ). As a result, the multiplicative cue-based model predicts processing costs due to mismatch effects, but no interference in both grammatical and ungrammatical sentences.

Conditions on Similarity-Based Interference

There is an ongoing debate about the factors that may influence memory retrieval. One area of discussion focuses on a so-called grammatical asymmetry, where interference effects are observed in ungrammatical sentences but not in grammatical sentences. Specifically, a large body of previous research has demonstrated that online memory retrieval in English is susceptible to facilitatory interference (e.g., Cunnings & Sturt, 2018; Fujita & Cunnings, 2022, 2023; Fujita & Yoshida, 2024; González Alonso et al., 2021; Jäger et al., 2020; Kim et al., 2019, 2020; Lago et al., 2015; Wagers et al., 2009). In contrast, the evidence regarding inhibitory interference remains inconclusive. While many studies have found no inhibitory interference (e.g., Cunnings & Fujita, 2023; Dillon et al., 2013; Fujita & Cunnings, 2022, 2023; Fujita & Yoshida, 2024; González Alonso et al., 2021; Jäger et al., 2020; Lago et al., 2015; Wagers et al., 2009), some have reported its presence (Patil et al., 2016; Van Dyke, 2007; Van Dyke & McElree, 2011) or observed interference effects against it (Cunnings & Felser, 2013; Sturt, 2003). The presence of inhibitory and facilitatory interference is consistent with the activation-based model, whereas the absence of inhibitory interference is consistent with the multiplicative cue-based model.

The grammatical asymmetry has led to the formulation of several hypotheses about memory retrieval. For example, Wagers et al. (2009) argue that cue-based memory retrieval occurs only when a prediction conflicts with the actual input. This retrieval process leads to similarity-based interference only in ungrammatical sentences.

To illustrate Wagers et al.’s argument, consider the subject-verb agreement in (1a) “[_NP1 The sisters of [_NP2 the girls]] [_T were]….”. When encountering NP1, the parser predicts a verb marked with a plural feature (e.g., “were”) and maintains information about it (Gibson, 1998; Kim et al., 2020). In this context, it is possible to predict the presence of a verb because it is a component of the obligatory structure; the sentence becomes ungrammatical in its absence (see Abney, 1989; Aoshima et al., 2004; L. Frazier & Fodor, 1978; Fujita, 2023; Gibson, 1991, 1998; Gorrell, 1995; Pritchett, 1992; Weinberg, 1999; Yoshida et al., 2013). As the auxiliary verb “were” materialises, it corresponds to the predicted representation in terms of number and grammatical categories. Consequently, the parser integrates it into the current structure and forms the subject-verb dependency without cue-based memory retrieval being triggered.

In the case of the ungrammatical sentence (1b) “[_NP1 The sister of [_NP2 the girls]] [_T were]….”, the parser predicts a singular verb (e.g., “was”). However, the actual verb “were” is inconsistent with this prediction in terms of number. According to Wagers et al., mismatch effects occur in this case, and the parser then attempts to check the properties of NP1 by retrieving its information from memory, using the features of the verb as retrieval cues. This process sometimes leads to misretrieval of the distractor, resulting in facilitatory interference.

Fujita and Yoshida (2024) also argue for the grammatical asymmetry, but suggest that inhibitory interference does not occur because similarity-based interference is a phenomenon of last resort. Specifically, they claim that in ungrammatical sentences, sentence processing is susceptible to facilitatory interference because, after mismatch effects occur, the parser searches for an element that matches the retrieval cues in terms of lexical features for interpretive reasons. In grammatical sentences, the parser does not need to form such a structurally impermissible dependency, because there is always a grammatical dependency relation.

Although, as discussed above, many studies have observed facilitatory interference, there are a few that have suggested its absence. For example, Dillon et al. (2013) observed facilitatory interference in subject-verb agreement, but not in reflexive resolution, as illustrated in (4a/b) below.

(4a)
[_TP [_NP1 The bodybuilder who saw [_NP2 the trainers]] injured themselves last night].
(4b)
[_TP [_NP1 The bodybuilder who saw [_NP2 the trainer]] injured themselves last night].

The sentences in (4a/b) contain the reflexive pronoun “themselves”, which is referentially dependent on another NP, its antecedent. This reference relation is subject to structural constraints. Informally and approximately, a reflexive takes a locally c-commanding NP as its antecedent (Chomsky, 1981). C-command refers to a structural relation between nodes in a tree as in Fig. 1 and is defined as follows: node a c-commands node b if (i) node c is a sister of a and (ii) b = c or c dominates b (Reinhart, 1976).^{Footnote 6} In (4a/b), NP1 c-commands the reflexive, but NP2 does not. Thus, NP1 is a structurally accessible antecedent and NP2 is a distractor. A reflexive must agree with its antecedent NP in number. In (4a), only NP2 matches the reflexive in number, and in (4b), both NP1 and NP2 do not match. The activation-based model predicts facilitatory interference in (4a) compared to (4b), in contrast to the findings of Dillon et al. On the other hand, the multiplicative cue-based model predicts no facilitatory interference, in line with Dillon et al. The results of Dillon et al. could indicate that different cue combination mechanisms are used for different types of dependencies. However, recent studies with large sample sizes have reported facilitatory interference in reflexive resolution, suggesting that online reflexive resolution in ungrammatical sentences is susceptible to interference effects, at least under certain circumstances (Fujita & Yoshida, 2024; Jäger et al., 2020).

There is also an ongoing debate about whether distractor position affects memory retrieval (Arnett, 2016; Arnett & Wagers, 2017; Engelmann et al., 2019; Fujita & Yoshida, 2024; Parker & An, 2018; Van Dyke & McElree, 2011). Parker and An (2018) reported that memory retrieval is insusceptible to similarity-based interference in subject-verb dependencies when the distractor is in a subject or object position, as in [_NP1 The boys who saw [_NP2 the girls]] were walking back home. In this sentence, NP2, the distractor, is a direct object of “saw”. In this paper, the term subject position refers to the specifier of a tensed TP, and the term object position refers to the complement of a verb. According to Parker and An, subjects and objects are the primary content of sentences and are therefore encoded distinctively; as a result, the parser can easily identify a distractor in a subject or object position as unrelated to the subject-verb dependency, leading to no interference (but see Fujita & Yoshida, 2024).

Some studies suggest different roles for subject and object positions (Arnett, 2016; Arnett & Wagers, 2017; Engelmann et al., 2019). Engelmann et al. (2019) argue that a distractor in a subject position is to some extent prominently encoded, leading to increased interference. Similarly, Arnett and Wagers (2017) argue for the role of subject positions in interference, but their argument is rooted in linguistic theory. As noted earlier, a finite verb basically forms a dependency relation with an NP in the specifier of a tensed TP. Arnett and Wagers suggest, on the basis of their empirical findings, that the parser uses this information as a retrieval cue in subject-verb dependencies. Therefore, according to Arnett and Wagers, (increased) interference effects are expected from a distractor in the specifier of a finite TP when memory retrieval targets such a position.

The Present Study

This paper first reports two experiments (Experiments 1 and 2) designed to investigate similarity-based interference. A computational model and simulations are presented in the General Discussion. Experiments 1 and 2 tested sentences with a floating quantifier, as shown in (5) below.

(5)
[_TP [_NP1 The boys who [_NP2 the girls]] saw all walked back home from school].

In English, universal quantifiers such as “all” usually appear immediately before their associates like “all the boys”. In (5), however, the quantifier “all” appears without an immediate NP. One possible analysis of this floating quantifier phenomenon is that the quantifier and its associate form a constituent, but at a particular stage of the derivation, only the associate moves to a higher position, leaving the quantifier stranded in a lower position (e.g., Al Khalaf, 2019; Giusti, 1990; McCloskey, 2000; Sportiche, 1988; Zyman, 2018). Figure 2 provides a visual illustration of this analysis. In (5), although the quantifier is preceded by two plural nominals (NP1 and NP2), it can only form a dependency relation with NP1. This can be explained by the observation that in English, floating quantifiers are locally c-commanded by their associates, and NP1, but not NP2, c-commands the floating quantifier (see Fig. 2). Also, a floating quantifier must modify a plural nominal. For example, if NP1 in (5) is singular, the sentence becomes ungrammatical (“The boy who the girls saw all walked…”). These constraints provide a test case for similarity-based interference.

In the language comprehension literature, there is only one study that has examined similarity-based interference in the quantifier float construction, and the results are controversial with respect to online sentence processing (Fujita & Cunnings, 2023). Specifically, Fujita and Cunnings (2023) observed facilitatory interference in one reading experiment, but found no interference effects in another reading experiment, when the distractor was in the subject position of a relative clause, as in (5). Therefore, investigating quantifier float has theoretical importance in terms of understanding the universality of similarity-based interference and cue combination mechanisms (Dillon, 2014; Dillon et al., 2013; Orth et al., 2021; Parker, 2019a). In addition, quantifier float provides an intriguing test case for exploring cue-based memory retrieval for several reasons. First, the parser is unlikely to predict the presence of a floating quantifier in sentences such as (5). This is because the quantifier is not obligatory in the sentence structure, and the sentence lacks a contextual bias towards its presence (Fujita, 2023; Gibson, 1991; Gorrell, 1995; Weinberg, 1999). Consequently, the appearance of a floating quantifier should be unexpected, which may trigger cue-based memory retrieval to search for its associate, leading to inhibitory interference (Wagers et al., 2009). Second, the associate of a floating quantifier is not restricted to a subject position; it can also occupy an object position, as observed in sentences such as John likes them all. Given this distributional property, it is possible that the retrieval of the associate of a floating quantifier does not exclusively target subject positions. Thus, if we adopt the perspective suggested by Arnett and Wagers (2017), similar interference effects would be expected between subject and object positions. However, if we follow Engelmann et al. (2019), who argue that any element in a subject position is to some extent prominently encoded, we can expect increased interference effects from subject positions relative to object positions. It is also conceivable that real-time sentence processing is impervious to similarity-based interference when the distractor occupies a subject or object position (Parker & An, 2018). By investigating online dependency formation between floating quantifiers and their associates, the present study aims to provide insight into these theoretical issues. The influence of distractor position is assessed by placing a distractor in either a subject position (Experiment 1) or an object position (Experiment 2).

An important consideration when investigating similarity-based interference is that its size may be small, particularly for inhibitory interference. For example, Nicenboim et al. (2018) reported an effect size of approximately 9 ms for inhibitory interference, with a 95% range of [0, 18] ms, based on pooled data from their two experiments. Assuming that this estimate approximates the true effect size, a simple power analysis suggests that, with a reasonable standard deviation (e.g., Parker, 2019a), 640 participants would be required to achieve an 80% probability of detecting inhibitory interference (see Fig. 3).

To ensure robust and reliable evidence of similarity-based interference, the present study included 640 participants and 24 sets of experimental sentences in the data analysis for each experiment.

Experiment 1

Experiment 1 investigated similarity-based interference in the quantifier float construction with a distractor in the subject position. The experimental sentences, as shown in (6a–d) below, were taken from Fujita and Cunnings (2023).

(6a)
Grammatical, Distractor match

The boys who saw the girls quite recently, all walked back home from school.
(6b)
Grammatical, Distractor match

The boys who saw the girl quite recently, all walked back home from school.
(6c)
Ungrammatical, Distractor match

The boy who saw the girls quite recently, all walked back home from school.
(6d)
Ungrammatical, Distractor mismatch

The boy who saw the girl quite recently, all walked back home from school.

In (6a/b), the floating quantifier “all” and its associate “The boy(s)…” match in number, whereas in (6c/d), they do not. In these sentences, the distractor “the girl(s)” occupies the subject position of a relative clause. This distractor matches the floating quantifier in number in (6a/c), but does not match it in (6b/d). If memory retrieval for floating quantifiers is a structure-dependent process, reading times at the quantifier should be longer in the ungrammatical conditions (6a/b) than in the grammatical conditions (6c/d) due to mismatch effects (Fujita & Cunnings, 2023; Wagers et al., 2009).

With respect to similarity-based interference, the activation-based model predicts longer reading times at the quantifier in (6a) than in (6b) due to inhibitory interference and shorter reading times at the quantifier in (6c) than in (6d) due to facilitatory interference (Lewis & Vasishth, 2005; Vasishth & Engelmann, 2021). In contrast, the multiplicative cue-based model predicts no similarity-based interference. The absence of similarity-based interference is also predicted by Parker and An (2018), who argue that similarity-based interference does not occur when a distractor is in a subject or object position.

Participants

A total of 640 native English speakers were included in the data analysis. These participants were recruited via Prolific (https://prolific.co/). They had a university degree, were British citizens, and had lived in the UK for most of their lives before the age of 18.

Materials

Experiment 1 included 24 sets of experimental sentences, as exemplified in (6a–d). In addition, the experiment included 72 filler sentences with a variety of syntactic structures. The critical region in all experimental sentences contained the quantifier “all”.

Procedure

Reading times for each word were measured using a lexicality maze task implemented in PCIbex Farm (Zehr & Schwarz, 2018). In this task, participants were presented with each sentence word by word, along with a pseudoword, and had to press the button corresponding to the correct word that continued the sentence. If a pseudoword was selected, the trial was immediately terminated, accompanied by the display of a warning message, and the next trial began. The experimental file was generated using code available online (Fujita, 2021a). The experiment began with four practice trials, followed by 24 experimental sentences and 72 fillers presented in a pseudo-random order.

Data Analysis

Reading times at the critical region (“all”) and the spillover region (“walked”) were analysed in Bayesian linear mixed models (Gelman et al., 2013; Lee & Wagenmakers, 2014; Nicenboim et al., 2023; Vasishth, 2023; Vasishth et al., 2023) with a log-normal likelihood, using the brms package (Bürkner, 2017) in R (R Core Team, 2022). In Bayesian statistics, the prior uncertainty is computed as a probability distribution, and the observed data update this uncertainty to compute the posterior distribution. The parameter values of research interest were sampled using Markov Chain Monte Carlo techniques.

Before data analysis, reading times below 100 milliseconds or above 10,000 milliseconds were excluded from the data, as these data points are likely to indicate lapses in attention. This procedure follows Fujita and Cunnings (2023). The models included the fixed effects of grammaticality (categorised as grammatical or ungrammatical), distractor (categorised as distractor match or distractor mismatch) and their interaction. The two levels of grammaticality and distractor were coded as 0.5/–0.5. The models included varying intercepts and slopes for participants and materials.

Regularising weakly informative priors were specified for the prior uncertainty. Specifically, Normal(6.5, 1) was specified for the intercept, and Normal(0, 1) for the other parameters. The prior distributions for the random effect standard deviations for participants and materials were truncated at 0. Each Bayesian linear mixed model involved four chains, each running for 4,000 iterations, with the first 2,000 iterations discarded as warm-up samples. Convergence was assessed using both R-hat diagnostics and visual inspection. All models reported in this paper converged.

In the results section, the estimates of the effects of theoretical interest are presented with their uncertainties, represented by the 2.5th percentile for the lower bound and the 97.5th percentile for the upper bound of the posterior distributions. This credible interval is denoted as 95% Crl [2.5th percentile, 97.5th percentile]. The posterior distributions are presented on the millisecond scale rather than on the logarithmic scale.

In addition to the 95% credible intervals, Bayes factor analyses were conducted for the interaction effects and similarity-based interference effects. A Bayes factor quantifies the relative evidence between two models by comparing their marginal likelihoods, which represent the probability of observing the data given the model specifications. Specifically, a Bayes factor (BF₁₀) is calculated as follows: \({BF}_{10}=\frac{P\left( Data|{Model}_1\right)}{P\left( Data|{Model}_0\right)}\), where Model₀ represents the null hypothesis, e.g., the interaction effect is zero (the model does not contain the interaction term), and Model₁ represents an alternative hypothesis, e.g., the interaction effect is not zero (the model contains the interaction term). For example, BF₁₀ = 3 suggests that the observed data are three times more likely to have occurred under Model₁ than under Model₀, providing evidence for Model₁ and against Model₀. In other words, \({BF}_{01}=\frac{1}{3}\) (i.e., \({BF}_{10}={\frac{1}{BF}}_{01}\)).

The strength of evidence was interpreted according to the guideline proposed by Jeffreys (1961), as cited in Lee & Wagenmakers, 2014). This guideline, presented in Table 2 (Lee & Wagenmakers, 2014, p. 105), was considered as an approximate description of the strength, in line with the recommendation of Lee and Wagenmakers (2014).

Table 2 The interpretation of the Bayes factor. This guideline is proposed by Jeffreys (1961), as cited in Lee & Wagenmakers, 2014)

Full size table

Bayes factors are sensitive to the prior distribution of parameters. For example, the weakly informative prior Normal(0, 1) assumed for the log scale fixed effects in this study gives a strong bias towards the null hypothesis. Therefore, more informative priors, namely, Normal(0, 0.025) and Normal(0, 0.05), were used for the Bayes factor analyses.

To calculate Bayes factors, models with varying intercepts for participants and materials were fitted using four chains and 22,000 iterations. The first 2,000 iterations of each chain were discarded and marginal likelihoods were calculated using the bridgesampling package (Gronau et al., 2020).

Results

The log-transformed reading times at the critical and spillover regions are illustrated in Fig. 4. Figure 5 displays the estimates of grammaticality, distractor, and their interaction, accompanied by 95% credible intervals. Figure 6 presents the results of the Bayes factor analyses. In cases where there was a sign of the interaction, a follow-up analysis was performed by fitting a nested model with a sum-coded fixed effect of distractor within each level of grammaticality.

Critical Region

The effects of theoretical interest are grammaticality and its interaction with distractor. At the critical region, both effects show a positive sign. The 95% Crl for the grammaticality effect is [11, 24] ms, indicating longer reading times in the ungrammatical conditions than in the grammatical conditions (i.e., mismatch effects). The 95% Crl for the interaction effect is [0, 32] ms, and, as shown in Fig. 6, the Bayes factor analysis provides strong evidence for this effect.

For ungrammatical sentences, the nested model shows a negative sign of the distractor effect, with the 95% Crl [–23, –5] ms, indicating shorter reading times in the distractor match condition than in the distractor mismatch condition. This negative sign is consistent with facilitatory interference and is supported by the Bayes factor analysis, which provides very strong evidence for its presence. For grammatical sentences, the 95% Crl for the distractor effect is centred around zero ([–8, 12] ms), and the Bayes factor analysis provides moderate evidence against its presence, indicating no inhibitory interference.

Spillover Region

The results at the spillover region closely mirror those at the critical region. The 95% Crl for the grammaticality effect is [12, 29] ms, indicating mismatch effects. There is also moderate to strong evidence for the interaction effect (the 95% Crl [1, 33] ms).

For ungrammatical sentences, the nested model shows shorter reading times in the distractor match condition than in the distractor mismatch condition (95% Crl [–31, –13] ms), a pattern consistent with facilitatory interference. The Bayes factor analysis provides extremely strong evidence for this effect. For grammatical sentences, there is a negative sign, with the effect being centred around zero, a pattern inconsistent with inhibitory interference (95% Crl [–15, 5] ms). The Bayes factor analysis suggests that the data are more likely to have been generated under the null hypothesis.

Discussion

The mismatch effects observed at the critical and spillover regions indicate that when the floating quantifier appears, the parser attempts to retrieve its associate in a structurally permissible position. This finding suggests that memory retrieval during online language comprehension adheres to structural constraints. Importantly, the results provided evidence of facilitatory interference. However, there was no evidence of inhibitory interference. This pattern of interference is inconsistent with both the activation-based model (Vasishth & Engelmann, 2021) and the multiplicative cue-based model, but consistent with previous research observing the grammatical asymmetry. The results are also inconsistent with Parker and An (2018), who argue that similarity-based interference does not arise from a distractor in the subject position.

Experiment 2 further investigated similarity-based interference in the quantifier float construction, but with the distractor in an object position.

Experiment 2

Experiment 2 used experimental sentences similar to those used in Experiment 1, but with the distractor in an object position, as shown in (7a–d) below.

(7a)
Grammatical, Distractor match

The boys who saw the girls quite recently, all walked back home from school.
(7b)
Grammatical, Distractor match

The boys who saw the girl quite recently, all walked back home from school.
(7c)
Ungrammatical, Distractor match

The boy who saw the girls quite recently, all walked back home from school.
(7d)
Ungrammatical, Distractor mismatch

The boy who saw the girl quite recently, all walked back home from school.

The predictions for (7a–d) are analogous to those for (6a–d). First, if memory retrieval is a structure-dependent process, number mismatch effects should be observed at the quantifier, with longer reading times in the ungrammatical conditions (7c/d) than in the grammatical conditions (7a/b). Second, the activation-based model predicts inhibitory interference in (7a) compared to (7b) and facilitatory interference in (7c) compared to (7d). In contrast, the multiplicative cue-based model predicts no interference effects. Third, according to Parker and An (2018), interference effects should be absent in both (7a) and (7c) because the distractor is in the object position.

Regarding the potentially different roles of subject and object positions in memory retrieval, Engelmann et al. (2019) suggest that larger interference effects arise from subject positions relative to object positions. In contrast, Arnett and Wagers (2017) argue that such effects are limited to cases where memory retrieval targets subject positions. Given that the associates of floating quantifiers are not restricted to subject positions, these studies predict either that (7a–d) should elicit smaller interference effects than (6a–d), or that similar interference effects should occur between (7a–d) and (6a–d).