Readers rely on multiple sources of linguistic information in order to arrive at the correct interpretation of a sentence. A central goal of psycholinguistics is to understand how readers use the information available to them to facilitate processing and whether some linguistic cues might be relied on more heavily than others. In particular, this article examines how semantic properties of the noun phrases (NPs) of a sentence might influence the processing of syntactically complex sentences. Studies of syntactic complexity often draw a comparison between subject-extracted and object-extracted relative clauses (RCs). In a subject-extracted RC (SRC), as in (1a), the head NP serves as the subject of the action being described in the RC, as well as the subject of the action described in the main clause. In contrast, the head NP of an object-extracted RC (ORC), as in (1b), serves as the object of the action being described in the RC.

  1. (1a)

    The journalist that accused the senator caused a scandal after the election.

  2. (1b)

    The journalist that the senator accused caused a scandal after the election.

Although these two sentences contain the exact same words and differ only in word order, ORCs are more difficult to process than SRCs—a standard finding that has been reported using many different paradigms (e.g., Caplan, Alpert, & Waters, 1998; Caramazza & Zurif, 1976; Ford, 1983; Holmes & O’Regan, 1981; Just, Carpenter, Keller, Eddy, & Thulborn, 1996; Wanner & Maratsos, 1978).

Recent work has identified a variety of lexical, semantic, and pragmatic factors that moderate the ORC–SRC asymmetry (e.g., Gennari & MacDonald, 2008; Gordon, Hendrick, & Johnson, 2001, 2004; Gordon, Hendrick, Johnson, & Lee, 2006; Johnson, Lowder, & Gordon, 2011; King & Just, 1991; Traxler, Morris, & Seely, 2002; Traxler, Williams, Blozis, & Morris, 2005; Warren & Gibson, 2002), with these findings being used in the evaluation of competing theoretical explanations of complex-sentence processing (see Gordon & Lowder, 2012, for a review). One class of these explanations has focused on semantic and/or pragmatic factors that influence the relationships between key aspects of RC sentences. For example, King and Just showed that the ORC–SRC difference was substantially reduced when arbitrary noun–verb pairings (e.g., The robber that the fireman detested watched the program) were replaced by verbs that established meaningful sentential relationships (e.g., The robber that the fireman rescued stole the jewelry). This manipulation aids in ORC processing because the meaning of the sentence is consistent with what we know about the world (e.g., The robber stole; The fireman rescued), whereas a reverse pairing of NPs and verbs (e.g., The robber rescued; The fireman stole) would be inconsistent with world knowledge. In contrast, an ORC with arbitrary NP–verb pairings (e.g., The robber watched; The fireman detested) is more difficult to process because the NP–verb relationships are pragmatically reversible, making it impossible for readers to rely on their world knowledge to facilitate processing.

Other semantic/pragmatic approaches to RC processing have examined how the ORC–SRC asymmetry is moderated by broader semantic classes, such as NP animacy, in cases where world knowledge does not provide specific information about meaningful NP–verb links (Gennari & MacDonald, 2008, 2009; Mak, Vonk, & Schriefers, 2002, 2006; Traxler et al., 2002; Traxler et al., 2005). For example, Tralxer et al. (2005) conducted an eye-tracking-while-reading experiment using sentences like those in (2). Results showed that the ORC–SRC difference was larger when the head NP (NP1) was animate and the embedded NP (NP2) was inanimate (2b vs. 2a) than when the animacy configuration was reversed (2d vs. 2c). Traxler et al. (2005) proposed that this effect occurred because readers adopt an active-filler strategy (e.g., Clifton & Frazier, 1989; Frazier & Clifton, 1989) during RC processing, such that all of the sentences in (2) are initially parsed as SRCs at the complementizer that. This strategy leads to an incorrect initial parse for ORCs (e.g., 2b and 2d), which must be revised to yield the correct interpretation. Traxler et al. (2005) proposed that reanalysis of NP1 as the object of the RC verb was more difficult when it was animate (2b) than when it was inanimate (2d), thereby yielding a larger ORC–SRC difference when NP1 was animate than when it was inanimate.

  1. (2a)

    The senator that skimmed the article was forgotten after the election.

  2. (2b)

    The senator that the article accused was forgotten after the election.

  3. (2c)

    The article that accused the senator was forgotten after the election.

  4. (2d)

    The article that the senator skimmed was forgotten after the election.

As in the King and Just (1991) example, manipulations of animacy raise issues related to the reversibility of NP–verb pairings. For example, the ORCs studied by Traxler et al. (2005; 2b and 2d) each contain one animate NP, one inanimate NP, and an embedded action verb, which allows for only one semantically acceptable interpretation of each sentence (e.g., The senator skimmed the article is acceptable, whereas The article skimmed the senator is anomalous).

In Lowder and Gordon (2012) we challenged the notion that the animacy of NPs plays a direct role in the difficulty of processing ORCs. Comparing ORCs that differed only in the animacy of NP2, as in (3), we showed that the greater difficulty associated with having an inanimate, as compared with an animate, NP2 (3b vs. 3a) emerged entirely at the embedded verb. This pattern suggests that local processing difficulty emerges when an inanimate subject is integrated with an action verb (e.g., article accused) but that this difficulty does not contribute to broader-level RC effects, such as differences at the matrix verb.

  1. (3a)

    The senator that the journalist accused was forgotten after the election.

  2. (3b)

    The senator that the article accused was forgotten after the election.

A similar difficulty emerged when the inanimate subject and action verb appeared together in the main clause of the sentence (e.g., The article accused . . .) but was reduced when the two constituents appeared in separate clauses (e.g., The article that accused . . .). This pattern is consistent with other recent findings (Lowder & Gordon, 2013, 2014) showing that readers use sentence structure as a cue for determining the relative importance of different semantic relationships to the overall interpretation of a sentence, with important relationships being processed more deeply than less important ones. The semantic relationship between article and accused has reduced relevance and is processed less deeply when integration of the critical constituents occurs across a clause boundary, as compared with when these two constituents appear together in the same clause.

Our experiment (Lowder & Gordon, 2012) demonstrating that NP animacy affects local processes of subject–verb integration—but not broader processing of the complex relations in an ORC—manipulated the animacy of NP2 while holding NP1 constant (e.g., 3a vs. 3b). As such, its results only pertain to the animacy of the embedded NP, the processing of which was the focus of Traxler et al.’s. (2002; Traxler et al., 2005) explanation of how animacy influences processing of RC sentences. However, Gennari and MacDonald (2008, 2009) have argued that the effect of animacy on the difficulty of ORCs (e.g., 2b vs. 2d) is due, instead, to differences in the animacy of NP1. They conducted a self-paced reading experiment using ORCs similar to the ones in (2b) and (2d) and found that ORCs with an animate NP1 (2b) were more difficult than ORCs with an inanimate NP1 (2d), an overall pattern that is consistent with previous findings, but in addition found that the difference first emerged at NP2 (i.e., before any verbs had been encountered). Gennari and MacDonald explained these results using an expectation-based framework that they called semantic indeterminacy (see also Hale, 2001; Levy, 2008; MacDonald & Christiansen, 2002). According to this account, comprehension difficulty at each point in the sentence is a function of how statistically likely that continuation is, on the basis of what has come before it. Thus, Gennari and MacDonald argued that when readers encounter an animate head followed by a complementizer (e.g., The senator that . . .), the probability that this stem will continue with an inanimate NP (e.g., The senator that the article . . .) is quite low. In contrast, an inanimate head followed by a complementizer (e.g., The article that . . .) is relatively more likely to continue with an animate NP (e.g., The article that the senator . . .). This, they argued, explains the processing differences at NP2 for (2b) versus (2d).

While Gennari and MacDonald’s (2008, 2009) findings raise a variety of interesting questions, their interpretive focus on NP1 faces the same limitation as Traxler et al.’s (2005) interpretive focus on NP2; the source of the greater difficulty for (2b) versus (2d) could involve the animacy of NP1, the animacy of NP2, or both. This comparison is additionally complicated by the use of two different embedded verbs. Lowder and Gordon (2012) controlled for this by manipulating NP2 while keeping the rest of the sentence constant (see 3). The present study adopts a similar strategy in order to investigate the effects of NP1 animacy on RC processing and to provide evidence about the levels of processing at which animacy and world knowledge contribute to sentence processing.

Experiment 1

This experiment used eye tracking during reading to determine how the processing of ORCs is affected by the animacy of NP1. The materials were based on those used in previous studies (Gennari & MacDonald, 2008; Traxler et al., 2005) but were adapted such that only animacy of NP1 varied between the critical ORC sentences (see 4b and 4c); SRC items using animate NP1s were also tested (see 4a). Thus, the comparison of (4a) and (4b) provides evidence about how ORC–SRC differences affect performance in sentences that are otherwise matched, whereas comparison of (4b) and (4c) provides evidence about how the animacy of NP1 in ORC sentences affects performance in sentences that are otherwise matched.

  1. (4a)

    The senator that criticized the journalist accused the governor of embezzling millions of dollars. (SRC)

  2. (4b)

    The senator that the journalist criticized accused the governor of embezzling millions of dollars. (ORC–Animate)

  3. (4c)

    The article that the journalist criticized accused the governor of embezzling millions of dollars. (ORC–Inanimate)

Gennari and MacDonald’s (2008) self-paced reading experiment provided evidence that having an inanimate NP1 facilitated ORC processing early in sentence comprehension during the reading of NP2. In contrast, Traxler et al.’s (2005) report of their eye-tracking data does not allow the exact locus of the difficulty to be pinpointed, because they reported reading times for the RC region as a whole, but not for individual words. The use of eye-tracking methodology in the present experiment allows us to assess whether manipulation of the animacy of NP1 influences ORC processing in early stages, later stages, or both. If NP1 animacy influences the early stages of ORC processing, as reported by Gennari and MacDonald (2008), we should observe longer reading times at NP2 for the ORC–animate condition, as compared with the ORC–inanimate condition, before the reader has fixated any of the verbs in the sentence. Although such a pattern would be consistent with the findings reported by Gennari and MacDonald (2008), there are at least two possible explanations. Gennari and MacDonald (2008; see also Staub, 2010) explained this effect in terms of semantic indeterminacy, proposing that the differences in processing times between these two conditions reflect differences in readers’ expectations. However, we have argued previously that these early effects may, instead, be consistent with a memory-based framework of RC processing (see, e.g., Gordon et al., 2001, 2004; Gordon, Hendrick, & Levine, 2002; Lewis & Vasishth, 2005; Lewis, Vasishth, & Van Dyke, 2006). This perspective conceptualizes difficulty with ORCs in terms of the cognitive mechanisms necessary to encode and store NPs in memory and then, later retrieve them when they must be integrated with the verbs of the sentence. Thus, the early effect of animacy at NP2 may reflect greater effort at encoding, triggered by the readers’ detection of a memory load that will have to be retrieved at a later point in the sentence (Gordon & Lowder, 2012; Johnson et al., 2011).

In contrast, effects of NP1 animacy that emerge in relatively later measures (i.e., after the reader has begun processing the verbs) are most readily explained in terms of differences in how semantic reversibility affects retrieval of information from earlier in the sentence. Note that the ORC–animate condition always contained two animate NPs, whereas the ORC–inanimate condition contained one inanimate NP and one animate NP. Thus, the actions in the ORC–animate condition were always semantically reversible, whereas the actions in the ORC–inanimate condition often were not. For example, in the ORC The movie that the actress hated, readers can rely on semantic knowledge to assign the thematic role of agent to “actress” and patient to “movie.” In contrast, in the ORC The director that the actress hated, readers cannot rely purely on semantic knowledge to assign thematic roles, since both “director” and “actress” could easily serve as either agent or patient. Thus, longer reading times for the ORC–animate condition, as compared with the ORC–inanimate condition, that emerge after readers have fixated the verbs most likely capture differences in difficulty associated with memory retrieval, which leads to more rereading of the sentence in the ORC–animate condition than in the ORC–inanimate condition.

Method

Participants

Thirty-six students at the University of North Carolina at Chapel Hill participated in this experiment in exchange for course credit. They were all native English speakers and had normal or corrected-to-normal vision.

Materials

Each participant was presented with 36 experimental sentences and 64 filler sentences. The experimental sentences (see 4) were partly adapted from previous work (Gennari & MacDonald, 2008; Traxler et al., 2005), but new sentences were also created. The two ORC conditions (4b and 4c) were identical to one another, except for the head noun. Animate and inanimate head nouns did not differ in length or log frequency, ts < 1 (SUBTLEXus database; Brysbaert & New, 2009). The embedded noun was always animate. The SRC condition was created by reversing the position of the embedded NP and embedded verb in the ORC–animate condition (see Appendix 1 for the full set of experimental stimuli).

The experimental sentences were counterbalanced across three lists so that each participant saw only one version of each item and so that each participant saw the same number of sentences from each of the three conditions. A true/false comprehension question followed every sentence. For the experimental sentences, two thirds of the questions asked about the action being described in the RC, whereas the other third asked about the action being described in the main clause (King & Just, 1991). For half of the questions the correct answer was true, and for the other half it was false.

Procedure

Participants’ eye movements were recorded with an EyeLink 1000 system (SR Research) at a sampling rate of 1000 Hz, which was calibrated at the beginning of each session. A chinrest was used to minimize head movement. Participants were instructed to read at a natural pace. At the start of each trial, a fixation point was presented near the left edge of the monitor, marking the location where the first word of the sentence would appear. When the participant’s gaze was steady on this point, the experimenter initiated presentation of the sentence. After reading the sentence, the participant pressed a button, which caused the sentence to disappear and a true/false comprehension question to appear in its place. Participants pressed one button to answer “true” and another button to answer “false.” After the participant answered the comprehension question, the fixation point for the next trial appeared.

Participants were first presented with 4 of the filler sentences. After this warm-up block, the remaining 96 sentences were presented in a different random order for each participant. The experimenter monitored gaze location throughout the experimental session and recalibrated the tracker as necessary.

Analysis

Analysis of the eye-tracking data focused on three standard measures (see Clifton, Staub, & Rayner, 2007, for a discussion). First-pass reading time is the sum of all initial fixations on a region, given that the eyes have not yet gone beyond the region; it begins when the region is first fixated and ends when gaze is directed away from the region, either to the left or to the right. First-pass reading time for a single word is referred to as gaze duration. We therefore use the term first-pass reading time to describe early processing effects on a multiword region and the term gaze duration to describe early processing effects on a single word or a single word and its preceding determiner. Regression path duration (also called go-past time) includes all fixation durations from the first fixation in a region until gaze is directed away from the region to the right. Thus, regression path duration includes time spent rereading earlier parts of the sentence before the reader is ready to proceed with the rest of the sentence. Total time is the sum of all fixation durations on a word or region.

Differences in the processing of ORCs versus SRCs were assessed by analyzing reading times for two regions of interest: the RC and the matrix verb. The RC was analyzed as a single region to control for the different word orders of ORCs and SRCs. This region consisted of the three words between the complementizer and the matrix verb. Differences in the processing of ORC–animate and ORC–inanimate sentences were further assessed by analyzing reading times separately for NP2 and the embedded verb, which consisted of the same words in the two conditions.

An automatic procedure in the EyeLink software combined fixations that were shorter than 80 ms and within one character of another fixation into one fixation. Additional fixations shorter than 80 ms and longer than 1,000 ms were removed. In addition, maximum cutoff values for the three reading time measures were set at 3,000 ms.

Results

Comprehension question accuracy

Mean comprehension question accuracies for each condition were as follows: SRC (85 %), ORC–animate (80 %), ORC–inanimate (89 %). A one-way repeated measures ANOVA showed that accuracy differed by condition, F 1(2, 70) = 5.31, MSE = 139, p < .01; F 2(2, 70) = 5.90, MSE = 125, p < .01. Follow-up comparisons showed that performance was significantly worse on questions following ORC–animates, as compared with questions following ORC–inanimates, F 1(1, 35) = 7.83, MSE = 187, p < .01; F 2(1, 35) = 10.51, MSE = 140, p < .01. In addition, performance was marginally worse on questions following ORC–animates than on questions following SRCs, F 1(1, 35) = 3.17, MSE = 147, p < .09; F 2(1, 35) = 3.21, MSE = 145, p < .09, and performance was marginally worse on questions following SRCs than on questions following ORC–inanimates, F 1(1, 35) = 3.39, MSE = 82, p < .08; F 2(1, 35) = 3.09, MSE = 90, p < .09.

RC region

Mean reading times for the RC region and matrix verb are presented in Table 1. Analysis of first-pass reading time on the RC region revealed no significant differences among the three conditions. In contrast, there were significant differences in regression path duration, F 1(2, 70) = 17.00, MSE = 15,546, p < .001; F 2(2, 70) = 12.45, MSE = 20,607, p < .001, and total time, F 1(2, 70) = 18.06, MSE = 25,914, p < .001; F 2(2, 70) = 6.37, MSE = 63,019, p < .005. Follow-up comparisons showed a robust ORC–SRC processing difference, with longer reading times for ORC–animates than for SRCs on regression path duration, F 1(1, 35) = 32.08, MSE = 16,448, p < .001; F 2(1, 35) = 23.72, MSE = 21,610, p < .001, and total time, F 1(1, 35) = 28.14, MSE = 20,069, p < .001; F 2(1, 35) = 13.48, MSE = 42,004, p < .005. Similarly, ORC–inanimates were more difficult than SRCs in regression path duration, F 1(1, 35) = 9.12, MSE = 12,264, p < .01; F 2(1, 35) = 7.23, MSE = 15,799, p < .02, but not in total time. Finally, ORC–animates were more difficult than ORC–inanimates in both regression path duration, F 1(1, 35) = 8.57, MSE = 17,925, p < .01; F 2(1, 35) = 5.85, MSE = 24,413, p < .05, and total time, F 1(1, 35) = 23.47, MSE = 34,785, p < .001; F 2(1, 35) = 8.84, MSE = 72,033, p < .01. The locus of this animacy difference is examined in greater detail below.

Table 1 Eye-tracking results of relative-clause effects in Experiment 1 (mean reading times are presented, with standard errors in parentheses)

Matrix verb

Analysis of gaze duration on the matrix verb revealed no significant differences among the three conditions. In contrast, there were significant differences in regression path duration (although only in the subject analysis), F 1(2, 70) = 3.30, MSE = 13,923, p < .05; F 2(2, 70) = 1.59, MSE = 20,863, p > .10, and total time, F 1(2, 70) = 3.22, MSE = 17,862, p < .05; F 2(2, 70) = 4.59, MSE = 13,736, p < .02. Follow-up comparisons revealed longer reading times for ORC–animates than for SRCs in regression path duration (marginal in the item analysis), F 1(1, 35) = 5.33, MSE = 16,694, p < .05; F 2(1, 35) = 3.56, MSE = 18,250, p < .07. In addition, SRCs were more difficult than ORC–inanimates in total time, F 1(1, 35) = 6.47, MSE = 17,147, p < .02; F 2(1, 35) = 6.72, MSE = 18,006, p < .02. No other pairwise comparisons were significant.

Regions within the ORCs

The differences obtained between the ORC–animate and ORC–inanimate conditions at the RC region were examined further by comparing these two conditions at both NP2 and the embedded verb (note that these words are identical for these two conditions). Mean reading times for these two regions are presented in Table 2. There were no significant differences in gaze duration at either NP2 or the embedded verb. In contrast, there were significantly longer reading times on NP2 for ORC–animates, as compared with ORC–inanimates, in measures of both regression path duration, F 1(1, 35) = 6.93, MSE = 8,973, p < .02; F 2(1, 35) = 4.32, MSE = 13,311, p < .05, and total time, F 1(1, 35) = 10.79, MSE = 25,711, p < .005; F 2(1, 35) = 8.34, MSE = 29,775, p < .01. Analysis of regression path duration at the embedded verb revealed no significant differences; however, analysis of total time at this region revealed longer times for the ORC–animate condition than for the ORC–inanimate condition (marginal in the item analysis), F 1(1, 35) = 5.66, MSE = 12,783, p < .05; F 2(1, 35) = 3.39, MSE = 19,423, p < .08.

Table 2 Eye-tracking results of animacy effects in Experiment 1 (mean reading times are presented, with standard errors in parentheses)

Discussion

The results of Experiment 1 showed that ORCs were more difficult to process than SRCs, as indicated by worse performance on comprehension questions, longer regression path durations and total times at the RC region and longer regression path durations at the matrix verb. This pattern replicates previous work that has examined ORC–SRC differences using eye-tracking methodology (e.g., Gordon et al., 2006; Johnson et al., 2011; Lowder & Gordon, 2012; Traxler et al., 2002; Traxler et al., 2005). In addition, these results are consistent with previous claims that ORCs are easier to process when NP1 is inanimate and NP2 is animate (Gennari & MacDonald, 2008; Traxler et al., 2002; Traxler et al., 2005). Previous work has compared these structures to ORCs with the reverse animacy configuration (i.e., animate NP1, inanimate NP2) and with different embedded verbs. Because the present experiment manipulated only NP1 while holding NP2 and the embedded verb constant, its findings establish the role of NP1 animacy more conclusively than did previous experiments that varied the animacy of NP1 and NP2 together and used different embedded verbs across conditions. The results showed that the ORC–animate condition was more difficult than the ORC–inanimate condition as early as regression path duration on NP2 (i.e., before any verbs had been encountered). This pattern in the eye-tracking data is consistent with the self-paced reading results of Gennari and MacDonald (2008), who also found an animacy effect on NP2. Gennari and MacDonald (2008) argued that this effect stems from differences in readers’ expectations regarding how a sentence with an animate NP1 versus an inanimate NP1 is most likely to unfold.

In addition to the early effect of animacy at NP2, differences between the ORC–animate and ORC–inanimate conditions were also observed in comprehension question accuracies and in total time at NP2 and the embedded verb. In contrast to the expectation-based explanation for the early animacy effect, these differences most likely reflect differences in the reversibility of NP–verb pairings between the ORC–animate and ORC–inanimate conditions. Whereas the inclusion of one animate and one inanimate NP in the ORC–inanimate condition can facilitate readers’ assignment of thematic roles, the inclusion of two animate NPs in the ORC–animate condition requires readers to focus more closely on the syntactic structure of the sentence in order to understand who did what to whom. This difference explains why participants spent more time rereading NP2 and the embedded verb in the ORC–animate than in the ORC–inanimate, condition and why comprehension was worse.

Differences in reversibility also explain why comprehension question accuracy was lower for SRCs than for ORC–inanimates and why total time on the matrix verb was longer for SRCs than for ORC–inanimates. Even though the syntactic structure of the SRCs was simpler than that of the ORC–inanimate condition, the inclusion of two animate NPs in the SRC condition made the relationships between NPs and verbs more easily confusable than in the ORC–inanimate condition.

Experiment 2

Experiment 1 produced results that are consistent with Gennari and MacDonald’s (2008) finding that the animacy of NP1 can have an early effect on ORC processing. Gennari and MacDonald (2008) argued that their pattern of results can be explained in terms of semantic indeterminacy, where readers use the animacy of NP1 to quickly generate expectations about how the sentence is most likely to unfold, and processing difficulty thus reflects violations of those expectations. In the present experiment, we assess whether the influence of NP1 animacy on ORC processing depends further on differences in the degree of semantic relatedness between NP1 and NP2. This research question was driven in part by the observation that some of Gennari and MacDonald’s (2008) critical NPs shared a tight semantic relationship, in the sense that the referents of the two NPs were likely to participate in the same event (e.g., The movie that the director . . .; The trial that the lawyer . . .), whereas others seemed to have been paired in an arbitrary manner (e.g., The accident that the musician . . .; The meal that the minister . . .). In this experiment, we tested ORC sentences like those in (5), where the only word that differed across conditions was the head noun. Whereas the head nouns in (5a) and (5b) share a close semantic relationship with NP2, the head nouns in (5c) and (5d) were chosen arbitrarily such that the referents of the two NPs were unlikely to participate in the same event. In addition, the head noun could be animate (5a and 5c) or inanimate (5b and 5d).

  1. (5a)

    The mayor that the senator criticized received more publicity than anyone expected. (animate–related)

  2. (5b)

    The bills that the senator criticized received more publicity than anyone expected. (inanimate–related)

  3. (5c)

    The waitress that the senator criticized received more publicity than anyone expected. (animate–arbitrary)

  4. (5d)

    The recipe that the senator criticized received more publicity than anyone expected. (inanimate–arbitrary)

We assume that during ORC processing, readers attempt to establish a meaningful relation between NP1 and NP2 as soon as they can. The comparisons outlined in (5) illustrate that readers have multiple sources of semantic information available to them during processing. In the related condition, a tight semantic relationship exists between NP1 and NP2, and we predicted that readers would use this relationship to establish a meaningful association between the two referents before encountering any verbs. In contrast, there is no preexisting semantic link between NP1 and NP2 in the arbitrary condition. When readers can no longer use semantic relatedness to associate the two NPs, we predicted that they would rely on the animacy of the NPs to begin establishing how the two NPs might relate to one another. Thus, if early processing in the related condition is driven more by the preexisting semantic link between NP1 and NP2 than by animacy information, the early effect of NP1 animacy (i.e., regression path duration at NP2) should be smaller in the related condition than in the arbitrary condition.

Method

Participants

Forty-four students at the University of North Carolina at Chapel Hill participated in this experiment in exchange for course credit. They were all native English speakers and had normal or corrected-to-normal vision. None had participated in Experiment 1.

Materials, procedure, and analysis

Each participant was presented with 36 experimental sentences and 88 filler sentences. The experimental sentences (see 5) were all ORCs that were adapted from the materials used in Experiment 1. The words across all four conditions were identical, except for the head noun. Across conditions, head nouns did not differ in log frequency, F(3, 140) < 1, p > .48 (SUBTLEXus database; Brysbaert & New, 2009), or length, F(3, 140) = 1.15, p > .33. As in Experiment 1, the embedded noun was always animate. Closeness of the semantic relationship between N1 and N2 was assessed using latent semantic analysis (LSA; Landauer & Dumais, 1997), which derives semantic similarity from patterns of word co-occurrence across a large sample of text. The mean LSA scores between N1 and N2 in the related condition (.38) were significantly higher than the mean LSA scores between N1 and N2 in the arbitrary condition (.05), F(1, 142) = 149, p < .001 (see Appendix 2 for the full set of experimental stimuli).

The experimental sentences were counterbalanced across four lists so that each participant saw only one version of each item and so that each participant saw the same number of sentences from each of the four conditions. All other aspects of the procedure were identical to the procedure described in Experiment 1. As in Experiment 1, analysis of eye-tracking data in Experiment 2 focused on gaze duration, regression path duration, and total time. Finally, the same data exclusion criteria as those described in Experiment 1 were also employed here.

Results

Comprehension question accuracy

Mean comprehension question accuracies for each condition were as follows: animate–related (83 %), inanimate–related (89 %), animate–arbitrary (82 %), and inanimate–arbitrary (90 %). There was a significant main effect of animacy, such that performance was worse on questions following ORCs with animate heads, as compared with questions following ORCs with inanimate heads, F 1(1, 43) = 11.71, MSE = 195, p < .005; F 2(1, 35) = 7.73, MSE = 241, p < .01. Neither the main effect of semantic relatedness nor the interaction between animacy and relatedness was significant, Fs < 1, ps > .64.

NP2

Reading times are presented in Table 3. An analysis of gaze duration at NP2 revealed no significant main effects or interactions. In contrast, analysis of regression path duration revealed a significant main effect of animacy (marginal in the item analysis), such that there were longer reading times for the animate condition than for the inanimate condition, F 1(1, 43) = 4.18, MSE = 10,124, p < .05; F 2(1, 35) = 3.45, MSE = 11,833, p < .08. A similar pattern also emerged in the analysis of total time, F 1(1, 43) = 9.47, MSE = 22,595, p < .005; F 2(1, 35) = 6.10, MSE = 29,192, p < .02. In addition, there were significant main effects of relatedness for both regression path duration, F 1(1, 43) = 5.00, MSE = 12,240, p < .05; F 2(1, 35) = 4.22, MSE = 13,853, p < .05, and total time, F 1(1, 43) = 6.71, MSE = 24,660, p < .02; F 2(1, 35) = 8.33, MSE = 17,262, p < .01, with longer reading times in the arbitrary condition than in the related condition.

Table 3 Eye-tracking results of Experiment 2 (mean reading times are presented, with standard errors in parentheses)

Crucially, there was a significant animacy × relatedness interaction in the analysis of regression path duration, F 1(1, 43) = 6.00, MSE = 16,271, p < .02; F 2(1, 35) = 5.08, MSE = 13,406, p < .05. Planned contrasts showed that the arbitrary–animate condition was more difficult than the arbitrary–inanimate condition, t 1(43) = 3.37, p < .005; t 2(35) = 2.64, p < .02, whereas there was no difference between the related–animate and related–inanimate conditions, ts < 1. In addition, the arbitrary–animate condition was more difficult than the related–animate condition, t 1(43) = 3.61, p < .005; t 2(35) = 3.41, p < .005, whereas there was no difference between the arbitrary–inanimate and related–inanimate conditions, ts < 1.

Embedded verb

Main effects of animacy emerged in both gaze duration (marginal in the item analysis), F 1(1, 43) = 4.19, MSE = 3,215, p < .05; F 2(1, 35) = 3.32, MSE = 2,487, p < .08, and in total time (significant only in the subject analysis), F 1(1, 43) = 6.65, MSE = 9,872, p < .02; F 2(1, 35) = 2.71, MSE = 22,533, p > .10. Whereas reading times were longer for the inanimate condition than for the animate condition in gaze duration, the opposite pattern emerged in total time.

Main effects of relatedness at the embedded verb emerged in all three reading time measures, with longer times in the arbitrary condition than in the related condition. These effects were significant in the subject analysis for gaze duration (marginal in the item analysis), F 1(1, 43) = 6.40, MSE = 2,458, p < .02; F 2(1, 35) = 3.55, MSE = 3,773, p < .07, and total time, F 1(1, 43) = 4.49, MSE = 11,321, p < .05; F 2(1, 35) = 2.83, MSE = 14,741, p > .10, but were fully significant in regression path duration, F 1(1, 43) = 12.50, MSE = 7,106, p < .005; F 2(1, 35) = 5.64, MSE = 10,941, p < .05. The animacy × relatedness interaction was not significant in any measure.

Matrix verb

Analysis of regression path duration revealed a significant main effect of animacy at the matrix verb, such that times were longer in the animate condition than in the inanimate condition, F 1(1, 43) = 13.66, MSE = 19,112, p < .005; F 2(1, 35) = 10.05, MSE = 21,147, p < .005. A similar pattern emerged in analysis of total time (significant only in the subject analysis), F 1(1, 43) = 4.15, MSE = 9,446, p < .05; F 2(1, 35) = 2.49, MSE = 15,752, p > .10. There were no other significant main effects or interactions.

Discussion

The results of Experiment 2 showed that ORCs with animate, as compared with inanimate, heads elicited early processing difficulty only when there was no straightforward semantic relationship between the critical NPs. In contrast, when a meaningful relationship existed between NP1 and NP2, no animacy differences were observed until readers had begun processing the verbs of the sentence. As in Experiment 1, these later animacy differences likely reflect the differences in reversibility between the animate and inanimate conditions; whereas the thematic roles of NP1 and NP2 in the animate condition were always reversible, this was usually not the case in the inanimate condition. In addition to these animacy effects, main effects of relatedness also emerged (i.e., total time at NP2, as well as gaze duration, regression path duration, and total time at the embedded verb). These patterns reflect readers’ overall greater ease of processing ORCs when the critical NPs are entities that are typically associated with one another, as compared with when the NPs are paired arbitrarily.

Early difficulty with ORC processing seems to arise from at least two different sources. As readers begin processing the first two NPs of an ORC, they attempt to relate them to one another even before any verbs have been encountered. If these two NPs are semantically related to one another (e.g., The mayor that the senator . . .; The bills that the senator . . .), readers will proceed to the verbs of the sentence without experiencing any early difficulty at NP2. In contrast, if there is no obvious semantic connection between the two NPs (e.g., The waitress that the senator . . .; The recipe that the senator . . .), readers may then rely on the animacy of NP1 to establish an initial meaningful relation between the two NPs. This strategy leads to easier processing when NP1 is inanimate than when it is animate, since readers can more easily understand that the head NP is the object of the RC when it is inanimate than when it is animate. In contrast, later difficulty with ORC processing likely stems from memory retrieval processes related to the reversibility of NP1 and NP2. When both of the critical NPs are animate, readers experience difficulty remembering which NPs should receive which thematic roles and are more likely to reread earlier parts of the sentence, as compared with when thematic roles can be assigned on the basis of animacy differences.

General discussion

The two experiments reported in this article demonstrate that the ease of understanding ORCs is influenced by both the animacy of NP1 and the degree of semantic relatedness between NP1 and NP2. Experiment 1 showed that ORCs are more difficult to process when NP1 is animate than when it is inanimate, even when all other words in the sentence are held constant across conditions. This effect of animacy emerged relatively early in processing, before any verbs had been fixated. Experiment 2 replicated the early effect of animacy that was observed in Experiment 1, but only when there was no straightforward semantic link between NP1 and NP2. In contrast, when NP1 and NP2 were semantically related, the early effect of NP1 animacy disappeared. Later effects of NP1 animacy emerged in both experiments.

Previous studies that have investigated the effects of NP animacy on RC processing (Gennari & MacDonald, 2008, 2009; Mak et al., 2002, 2006; Traxler et al., 2002, 2005) have found that ORCs are more difficult when NP1 is animate and NP2 is inanimate, as compared with the reverse animacy configuration. As described above, these experiments have typically manipulated the animacy of NP1 and NP2 simultaneously, which leads to difficulty in identifying the true source of the reported effects. In Lowder and Gordon (2012), we manipulated the animacy of NP2 while holding the rest of the sentence constant. ORCs with an inanimate, as compared with an animate, NP2 were more difficult to process, but we demonstrated that this effect stemmed from local difficulty associated with integrating an inanimate NP with a verb (e.g., article accused), rather than global difficulty associated with the complex syntax of an ORC. In the present study, we manipulated the animacy of NP1 while holding the rest of the sentence constant. ORCs with an animate, as compared with an inanimate, NP1 were more difficult to process at NP2. Although this pattern is consistent with previous findings using self-paced reading (Gennari & MacDonald, 2008), the explanation for this early effect is less clear. Gennari and MacDonald (2008) proposed that difficulty associated with processing ORCs can be explained largely as a function of violation of readers’ expectations. According to this account, an animate NP1 followed immediately by an animate NP2 is less expected than an inanimate NP1 followed by an animate NP2, which explains the reading time differences at NP2. Although the present study cannot rule out an expectation-based explanation of this phenomenon, another possibility is that inflated reading times at NP2 reflect processes that operate under a memory-based framework of RC processing (e.g., Gordon et al., 2001, 2002, 2004, 2006; Lewis & Vasishth, 2005; Lewis et al., 2006). Accounts of this sort conceptualize ORC difficulty as stemming from processes associated with memory encoding, storage, and retrieval. Thus, the early NP2 effects observed in the present study could reflect an encoding-based strategy in which readers anticipate that they will experience more difficulty remembering two animate NPs than one inanimate and one animate NP and so they adjust their encoding times accordingly (see also Gordon & Lowder, 2012; Johnson et al., 2011). Importantly, these two explanations do not necessarily provide mutually exclusive accounts of early animacy effects in RC processing; in principle, both could explain some aspects of the observed difficulty. Furthermore, it is possible that the accounts may differ with respect to how well they explain animacy × RC-type interactions obtained in different languages, such as Dutch (Mak et al., 2002, 2006), Korean (Kwon, Lee, Gordon, Kluender, & Polinsky, 2010; Lee, Lee, & Gordon, 2007), and English (see Gordon & Lowder, 2012; Lee, Lee, Gordon, & Hendrick, 2010).

Importantly, Experiment 2 showed that the early animacy effect at NP2 disappeared completely when NP1 and NP2 were semantically related. This pattern suggests that previous demonstrations of an early animacy effect (Experiment 1; Gennari & MacDonald, 2008) may have been driven by a subset of experimental materials in which NP1 and NP2 were paired arbitrarily. We propose that as readers begin processing an ORC, they attempt to establish meaningful relations between NP1 and NP2 before encountering the verbs of the sentence. When NP1 and NP2 share a close semantic link, readers can use this information to form an initial representation of these two referents participating in the same event, regardless of whether the NPs are both animate (e.g., The mayor that the senator . . .) or the first is inanimate (e.g., The bills that the senator . . .). Readers then easily proceed to the verbs of the sentence to understand more specifically how the two referents relate to one another in the event. In contrast, when there is no straightforward semantic link between NP1 and NP2, readers can make use of information about the animacy of the two NPs to help form an understanding of how these two referents may be interacting in an event. When NP1 is inanimate and NP2 is animate (e.g., The recipe that the senator . . .), readers can easily understand that the inanimate NP1 is the object of the action being described in the RC, and so they proceed to the verbs of the sentence. However, when there is no semantic link between the two NPs and they do not differ in animacy (e.g., The waitress that the senator . . .), readers have difficulty establishing an initial meaningful relation between them, and so slowdown begins at NP2.

Both experiments produced evidence of animacy effects in later reading time measures and in comprehension question accuracies. In contrast to the early effects, these later effects likely reflect differences in ease of memory retrieval. When NP1 was inanimate and NP2 was animate, the actions described in the sentence were rarely semantically reversible (e.g., The movie that the actress hated . . .), and so readers could rely on their knowledge of the world to assign thematic roles appropriately. In contrast, when both NPs were animate, the actions described in the sentence were always semantically reversible (e.g., The director that the actress hated . . .), and so readers could not rely on world knowledge but had to attend to the syntactic structure of the sentence in order to assign thematic roles appropriately. In cases where readers did not carefully encode the order of the two NPs, they may have experienced difficulty when the NPs later had to be retrieved from memory and integrated with the verbs of the sentence. Thus, participants were more likely to go back and reread the sentence when both NPs were animate, so that they knew which NP was the agent and which was the patient.

Conclusion

Processing of complex syntactic structures is influenced by a wide variety of lexical, semantic, and pragmatic factors. The experiments reported here examined how two semantic factors—animacy and NP relatedness—influence both early and later stages of complex-sentence processing. The results demonstrate that object-relative clauses with an inanimate, as compared with an animate, NP1 can facilitate early processing, but only when there is no straightforward semantic relationship between NP1 and NP2. When the two critical NPs do share a strong semantic link, readers will use this information rather than animacy information to form an initial representation of the two referents participating in the same event. In addition, differences in NP animacy influence later stages of complex-sentence processing, regardless of the relationship between NP1 and NP2. We believe that these later effects emerge as a result of memory retrieval processes that operate during the assignment of thematic roles.