How do children deal with inconsistencies in text? An eye fixation and self-paced reading study in good and poor reading comprehenders

van der Schoot, Menno; Reijntjes, Albert; van Lieshout, Ernest C. D. M.

doi:10.1007/s11145-011-9337-4

How do children deal with inconsistencies in text? An eye fixation and self-paced reading study in good and poor reading comprehenders

Open access
Published: 18 August 2011

Volume 25, pages 1665–1690, (2012)
Cite this article

Download PDF

You have full access to this open access article

Reading and Writing Aims and scope Submit manuscript

How do children deal with inconsistencies in text? An eye fixation and self-paced reading study in good and poor reading comprehenders

Download PDF

Menno van der Schoot¹,
Albert Reijntjes² &
Ernest C. D. M. van Lieshout¹

4339 Accesses
61 Citations
Explore all metrics

Abstract

In two experiments, we investigated comprehension monitoring in 10–12 years old children differing in reading comprehension skill. The children’s self-paced reading times (Experiment 1) and eye fixations and regressions (Experiment 2) were measured as they read narrative texts in which an action of the protagonist was consistent or inconsistent with a description of the protagonist’s character given earlier. The character description and action were adjacent (local condition) or separated by a long filler paragraph (global condition). The self-paced reading data (Experiment 1), the initial reading and rereading data (Experiment 2), together with the comprehension question data (both experiments), are discussed within the situation model framework and suggest that poor comprehenders find difficulty in constructing a richly elaborated situation model. Poor comprehenders presumably fail to represent character information in the model as a consequence of which they are not able to detect inconsistencies in the global condition (in which the character information is lost from working memory). The patterns of results rule out an explanation in terms of impaired situation model updating ability.

Word processing difficulty and executive control interactively shape comprehension monitoring in a second language: an eye-tracking study

Article Open access 04 March 2022

The relations between children's comprehension monitoring and their reading comprehension and vocabulary knowledge: an eye-movement study

Article 20 July 2019

Eye movements and reading comprehension performance: examining the relationships among test format, working memory capacity and reading comprehension

Article 18 March 2023

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

An important component skill of reading comprehension in children is comprehension monitoring (e.g., Cain, Oakhill, & Bryant, 2004; Eme, Puustinen, & Coutelet, 2006; Rubman & Waters, 2000; Zinar, 2000). Comprehension monitoring refers to the processes by which readers evaluate their understanding of a text. Skilled readers who evaluate their comprehension constantly ask themselves if what they are reading makes sense. If it does not, they apply repair strategies to restore comprehension. In practice, comprehension monitoring often comes down to detecting and, if possible, resolving inconsistencies such as contradictory sentences, or statements conflicting with world knowledge. Measures of comprehension monitoring therefore usually involve the analysis of verbal and nonverbal responses immediately following a consistency violation (Skarakis-Doyle, 2002) or answers to questions following a text containing inconsistent or conflicting information (Zinar, 2000).

Although some studies have shown that children with comprehension difficulties have problems with dealing with internal inconsistencies in a text (e.g., Ehrlich, 1996; Ehrlich, Remond, & Tardieu, 1999; Vosniadou, Pearson, & Rogers, 1988; Yuill & Oakhill, 1991; Zabrucky & Moore, 1989), there is no full agreement yet on the locus of this effect. One hypothesis is that poor comprehenders have difficulty in detecting the inconsistencies because they are impaired in the ability to construct a richly elaborated and coherent situation model from the text (e.g. Albrecht & O’Brien, 1993; Myers, O’Brien, Albrecht, & Mason, 1994; Rinck, Hahnel, & Becker, 2001; Rubman & Waters, 2000). Supposedly, this prevents them from (accurately) interpreting later text information in the light of earlier text information. Another hypothesis is that they are especially poor at resolving the inconsistencies. In this account, poor comprehenders are capable of detecting inconsistencies, but instead do not engage in, or lack knowledge of, appropriate comprehension-repair strategies (e.g. Gersten, Fuchs, Williams, & Baker, 2001; Hyönä, Lorch, & Rinck, 2003; Rinck, Gamez, Diaz, & de Vega, 2003; Wagoner, 1983). The main aim of this study was to replicate the finding of comprehension monitoring difficulties in children, and then to decide between the two above hypotheses. In addition, the role of verbal working memory capacity in comprehension monitoring was investigated. For these purposes, we adopted the situation model approach (Zwaan & Radvansky, 1998) to serve as our framework in which we set up the experiment and interpreted the data.

The situation model framework

It is generally assumed that reading comprehension involves the reader’s ability to construct a mental representation of the situation described in a (narrative) text, rather than a representation of the text itself (e.g. Zwaan & Radvansky, 1998). Successful comprehenders build such a representation, called a situation model, by monitoring a number of key situational dimensions such as time, space, causation, and the protagonists’ characteristics, goals and emotions (Zwaan, Langston, & Graesser, 1995a, Zwaan, Magliano, & Graesser, 1995b). By integrating information from these dimensions, readers gradually update their representation and build a coherent and richly-connected situation model. Within the situation model framework, updating thus refers to the process of incorporating a new sentence or clause into the evolving model. For present purposes, it is important to recognize that inconsistencies interfere with the updating of the situation model as new incoming information is more difficult to integrate into the evolving model if it is inconsistent, or less consistent, with the information in the current state of the model.

The inconsistency detection task

To measure construction and updating of situation models, we used the inconsistency detection task described earlier by, among others, Albrecht and O’Brien (1993), Huitema, Dopkins, Klin and Myers (1993), Long and Chong (2001), O’Brien, Rizella, Albrecht and Halleran (1998) and Poynor and Morris (2003). In this task, participants typically read a text in which the action of a protagonist (e.g. eating a hamburger at McDonald’s) is either consistent or inconsistent with the description of the protagonist’s character (fast-food addict vs. vegetarian) or goals (He wanted to have a quick bite before going home vs. He wanted to go wining and dining in a five-star restaurant) presented earlier in the text. The character/goal description and the performed action can be adjacent (local condition) or can be separated by a substantial amount of intervening text (global condition). The purpose of the long filler paragraph in the global condition is to ensure that the character/goal description is eliminated from working memory by the time readers encounter the target action (Albrecht & O’Brien, 1993; Long & Chong, 2001). This means that if readers want to keep the character/goal information accessible for further processing they have to represent it in the situation model which is assumed to be developed and stored in long-term memory or long-term working memory (Ericsson & Kintsch, 1995). In the local condition, on the other hand, the character/goal elaboration is assumed to be still active in (short-term) working memory when participants read the target action.

Importantly, the inconsistency detection task can be used to explore the different types of information incorporated into situation models. The main argument behind this has been recently formulated by Hyönä, Lorch and Rinck (2003, p. 324): “If readers represent a certain type of information in the situation model, they should exhibit comprehension difficulties upon encountering a sentence that is inconsistent with regard to this information”. Obviously, the reverse is true as well. That is, if readers do not build some piece of information into their situation model and a later sentence contradicts this information, the problem arises not so much in solving the inconsistency (updating the situation model and, if possible, restoring its coherence) as it does in detecting it (an inconsistency will simply go unnoticed if the information that is contradicted has been lost from memory). By applying these lines of argument, studies using (different versions of) the inconsistency detection task revealed not only the representation of character information (Albrecht & O’Brien, 1993; de Vega, Diaz, & Leon, 1997) and goal information (Huitema et al., 1993; Poynor & Morris, 2003) in situation models constructed from narrative texts but also the representation of emotional information (Gernsbacher, Goldsmith, & Robertson, 1992), spatial information (de Vega, 1995; O’Brien & Albrecht, 1992) and temporal information (Rinck et al., 2001).

The present study

To our knowledge, most of the above-cited studies as well as other studies on this topic have been conducted with (young) adults. This prompted us to investigate the use of situation models in good and poor comprehenders who are between 10 and 12 years old, thereby extending a series of our previous studies on reading comprehension in children attending fifth and sixth grade at a regular primary school (van der Schoot, Vasbinder, Horsley, & van Lieshout, 2008, van der Schoot, Bakker Arkema, Horsley, & van Lieshout, 2009a, van der Schoot, Vasbinder, Horsley, Reijntjes, & van Lieshout, 2009b, van der Schoot, Horsley, & van Lieshout, 2010). Specifically, the goal was to find out whether poor comprehending children differ from good comprehending children in the extent to which they construct a richly elaborated situation model, or whether the difference between the two groups mainly resides in the ability to update the situation model (or working memory) representation.

How does the inconsistency detection task—in which, in the present study, we varied the character of the protagonist—differentiate between these two types of problems? Above, we have already explained how the inconsistency detection task is more or less designed to reveal difficulties in situation model construction. Readers who do not represent the protagonist’s character in their situation model have a far worse chance of detecting the inconsistency of the protagonist’s later action. At least, this is to be expected in the global condition in which the character information is lost from working memory. In the local condition, however, even the poor situation model constructors (i.e. readers who leave the character elaboration out of their situation model) are able to detect inconsistencies since the character information against which the target action needs to be evaluated is still active in working memory. In this study, it is therefore the way in which the local inconsistencies are dealt with that can ultimately differentiate between readers who try to actively update their situation model and readers who do not. The former make an attempt to resolve the inconsistencies and restore comprehension in the situation model. The latter do not take reparative action, either because they lack knowledge of comprehension-repair strategies or because they do not know how to apply them (Gersten et al., 2001).

To investigate comprehension monitoring in 10- to 12-year-old good and poor comprehenders, we conducted two experiments both of which employed the same inconsistency detection task. In Experiment 1, we examined situation model construction and updating by measuring the participants’ self-paced reading times. In Experiment 2, we used the eye tracking method and measured their eye fixations and regressions. To be able to examine group differences adjusted for working memory capacity, we included this variable as covariate in both experiments. Working memory capacity has been found to be related to comprehension skill in tasks that require the simultaneous storage and processing of verbal information (Cain et al., 2004; Seigneuric, Ehrlich, Oakhill, & Yuill, 2000). This implies that working memory capacity may affect performance in the present task and reduce the differences between good and poor comprehenders that are hypothesized below.

Experiment 1

In addition to looking at the participants’ answers to questions following the experimental texts, we addressed the research questions by looking at the pattern of reading times as determined by means of the self-paced (sentence-by-sentence) moving window method (Just, Carpenter, & Woolley, 1982). In line with the findings in experienced adult readers (e.g. Huitema et al., 1993; Long & Chong, 2001), we hypothesized that in the global and local condition, the good comprehenders would spend more time reading the target sentence when it was inconsistent with the earlier-described character than when the same sentence was consistent with the character description. It is assumed that these longer reading times reflect, at least in part, the processes involved in the detection and resolution of consistency violations (e.g. Albrecht & O’Brien, 1993).

For the poor comprehenders, two possibilities were hypothesized. The first hypothesis departs from the assumption that poor comprehenders are impaired in their ability to construct a richly elaborated situation model (and thus fail to incorporate the character information) but that they do try to update the model (or working memory system) when the information which they did build-in is contradicted and in need of revision. Thus, under the first hypothesis, we expected poor comprehenders to read inconsistent actions more slowly than consistent ones in the local condition (in which the protagonists’ actions can be viewed in the light of their character as the character information is still active in working memory) but not, or much less so, in the global condition (in which the target actions may not be evaluated against any character information as this information is likely to be lost from memory). The starting assumption underlying the second possible hypothesis is that poor comprehenders are especially poor at updating the situation model. This implies that even when poor comprehenders are able to detect an inconsistency they are not expected to adjust their reading in an attempt to resolve it. Hence, under the second hypothesis, no differences in reading times were anticipated between inconsistent and consistent actions in both the global and local condition.

Method

Participants

The participants were 31 children (18 boys/13 girls) with high reading comprehension levels (good comprehension group) and 26 children (16 boys/10 girls) with low reading comprehension levels (poor comprehension group). The groups were matched on age (M = 11.3, SD = .6 vs. M = 11.5, SD = .6, respectively, t(55) = 1.32, ns) and decoding skill (M = 81.52, SD = 12.50 vs. M = 77.19, SD = 12.42, respectively, t(55) = 1.31, ns). Decoding skill was assessed by the EMT, a standardized Dutch word reading test (Brus & Voeten, 1999). The EMT showed that the word decoding skills in all participating children were around their grade level and thus more or less automatized.

The children attended either Grade 5 or 6 at a regular primary school in the Netherlands. They were native speakers of Dutch and had normal or corrected-to-normal vision. Exclusion criteria were any neurological disorder and IQ less than 85 (IQ was estimated on the basis of two subtests (Vocabulary and Block Design) of the Dutch version of the Wechsler Intelligence Scale for Children—Revised (van Haasen, 1986)). For all participating children informed consent was obtained from their parents or care-takers.

Assessment of reading comprehension level

Children were classified as good or poor comprehenders based on their performance on the (Grade 5 and Grade 6 versions of the) standardized Test for Reading Comprehension of the Dutch National Institute for Educational Measurement (CITO) (“Toets Begrijpend Lezen”, Staphorsius & Krom, 1998). This test is part of the standard Dutch CITO pupil monitoring system and is designed to determine general reading comprehension level in primary school children.

To classify children as good or poor comprehenders, we compared the test scores with their age-appropriate norm scores. Children whose scores were among the highest 25% of the norm scores were classified as good comprehenders (M = 70.71, SD = 11.24), children scoring among the lowest 25% were classified as poor comprehenders (M = 35.96, SD = 12.49). The educational age-norms for average reading comprehension level were obtained in extensive standardization studies on reading in the Dutch population of primary school children (Staphorsius & Krom, 1998).

Materials and design

The experimental texts had the following structure. Each text began with a three- or four-sentence paragraph introducing a person. This was followed by a three- or four-sentence elaboration paragraph in which the characteristics (or preferences) of the person are described. These characteristics were either consistent or inconsistent with an action performed by the person later in the text. The elaboration paragraph was followed by a filler paragraph, which contained either one sentence (local condition: mean (M) number of words = 11.4, SD = 1.9) or five to seven sentences (global condition: M = 77.8, SD = 5.1). The text continued with the target sentence in which the person performs the action that is either consistent or inconsistent with his character described in the elaboration paragraph. For example, the target sentence Peter ordered a cheeseburger and fries is consistent with his description as a fast-food addict but inconsistent with his description as a vegetarian (example taken from Albrecht & O’Brien (1993) and Long & Chong (2001)). In the local condition, the character elaboration (in this case, Peter’s food preferences) was still active in working memory while reading the target sentence as the character description and the target action were only separated by one intervening sentence. On the other hand, in the global condition, the long filler paragraph ensured that the character description was eliminated from memory (see Albrecht & O’Brien, 1993; Long & Chong, 2001). Importantly, all filler paragraphs were content-neutral with regard to the target action. The target sentence was followed by a second critical sentence (post-target sentence) so as to detect possible spillover effects. Each text ended with a brief closing paragraph.

Thus, in total, there were 4 within-subjects conditions formed by the crossing of 2 factors: location (local vs. global) and consistency (consistent vs. inconsistent). Each subject was presented with 8 experimental texts, 2 in each condition. The stimuli were arranged in 4 material sets, each containing the 8 texts. Each set was presented to 8 good comprehenders (with the exception of one set which was presented to 7 good comprehenders) and 7 poor comprehenders (with the exception of two sets which were presented to 6 poor comprehenders).

In order to ensure full combination of conditions and materials (and control for text effects), the different versions of each text were counterbalanced across the material sets by means of a 4 × 4 Latin square design. Thus, across sets and across participants, each story occurred equally often in the local/consistent, local/inconsistent, global/consistent and global/inconsistent version. The order in which the texts were presented in each set was pseudorandomized.

Self-paced moving window method

Reading times were collected by means of the self-paced moving window method (Just et al., 1982). In this method, subjects are presented with passages of text on a computer screen in such a way that words are masked by X’s. A window reveals one word, or one sentence, at a time to the reader. When the reader is finished comprehending a word/sentence, they press a key to move the window to the next word/sentence. Assuming that information is processed as soon as it is perceived (Just & Carpenter, 1980), the key pressing latencies (i.e. reading times) in the self-paced moving window task reflect the processing of the word/sentence contained in the window.

In this experiment, we made use of the self-paced sentence-by-sentence procedure. Reading time on the target and post-target sentence was thus defined as beginning when the sentence was first revealed and lasting until the next key press.

Procedure

Before the start of the inconsistency detection task, children were informed that the study was designed to examine reading comprehension of text displayed on a screen. They were instructed to read at their normal rate and to comprehend what they were reading as well as they could. As for the text presentation procedure, seven or eight masked sentences were presented on one screen. Each sentence consisted of either one or two lines of text. The target and post-target sentences always fit on one line of the screen. At the start, the moving window was placed on the first sentence of the first text. Subjects used the down-arrow key to see each successive sentence in a text. Pressing the down-arrow key on the last sentence on a screen caused the window to move to the first sentence on the next screen. The last sentence of each text revealed the words “NEXT TEXT” indicating the end of the text and the beginning of the next one. The first experimental sentence on the first screen was preceded by four test sentences.

The experimental texts were interspersed with short filler texts to prevent the subjects from becoming aware of the purpose of the experiment. After each text (i.e. each time the moving window revealed the words “NEXT TEXT”), a Yes or No comprehension question was asked. The question pertained to the situational content of the text just read and was asked orally by the experimenter. After answering the question, the subjects were instructed to directly jump to the next text (by pressing the down-arrow key). To prevent them from becoming tired, participants were given a break halfway through the experiment.

The children were tested at school. They completed the inconsistency detection task, sentence span task (see below), and word reading test in a silent room and the test for reading comprehension in the classroom (whole-class test taking). The experimenter was always present. In total, the experiment lasted approximately 1 h and 45 min.

Working memory capacity measure

To assess working memory capacity, an adaptation of the Sentence Span Task (Daneman & Carpenter, 1980; Swanson, 1994; Swanson, Cochran, & Ewers, 1989) was administered. In this task, participants were asked to read aloud groups of unrelated sentences (7–10 words in length). After reading, their task was to recall the last words of the sentences in the right order, and to answer a comprehension question about one of the sentences. The purpose of the question was to make sure that children read for comprehension and did not merely try to remember the target words. The number of sentences in the groups gradually increased. Working memory capacity was defined as the largest group of end words recalled (with the additional requirement that the comprehension question was answered correctly). The sentence span task measures verbal working memory capacity and predicts performance on reading tasks as well as other related tasks (Daneman & Green, 1986; Masson & Miller, 1983).

Results

Reading time on the target sentence

On the reading times, overall 2 × 2 × 2 analyses of variance (ANOVA) on the subject (F ₁) and item (i.e. text) (F ₂) means were conducted with consistency and location as within-subject variables and with comprehension group as the between-subject variable. In Fig. 1, reading time on the target sentence (in milliseconds) is presented as a function of consistency (consistent vs. inconsistent), location (local vs. global) and comprehension group (good vs. poor). The results of the ANOVA showed that, on the subject means, reading times were faster in the global than local condition (F ₁(1,55) = 5.40, p < .05, η ²_p = .09; F ₂(1,2) = .45, ns), and that good comprehenders tended to read the target sentence faster than poor comprehenders (F ₁(1,55) = 3.11, p < .1, η ²_p = .05; F ₂(1,2) = 9.39, p < .1, η ²_p = .82; 0.01 is considered a small partial eta-squared effect size, 0.06 is considered a medium effect, and 0.14 is considered a large effect Stevens, 2002). More important here is that the effect of consistency varied as a function of the location of the target sentence in poor but not in good comprehenders. In the local condition, both good and poor comprehenders read the target sentences more slowly when they were inconsistent with the character description than when they were consistent with it (Consistency × Group: F ₁(1,55) = .35, ns; F ₂(1,2) = .43, ns). In the global condition, on the other hand, the inconsistency effect was present in the good but not in the poor comprehenders (who read inconsistent sentences as fast as consistent ones; Consistency × Group interaction: F ₁(1,55) = 4.40, p < .05, η ²_p = .07; F ₂(1,2) = 9.90, p < .1, η ²_p = .83). In other words, poor comprehenders slowed down their reading on inconsistent sentences when the target action directly followed the character elaboration, but not when the target sentence and character elaboration were interspersed with a long filler paragraph (Consistency × Location: (F ₁(1,55) = 4.44, p < .05, η ²_p = .15; F ₂(1,2) = 14.33, p < .1, η ²_p = .88). The above pattern of results was evident in the significant (F ₁) and marginally significant (F ₂) Consistency × Location × Group interaction (F (1, 55) = 4.51, p < .05, η ²_p = .08; F ₂(1, 2) = 12.99, p < .1, η ²_p = .87). The three-way interaction disappeared, however, when working memory capacity was entered as a covariate (F ₁(1,54) = 2.37, p = .13).

Reading time on the post-target sentence

In Table 1, reading time on the post-target sentence (in milliseconds) is presented as a function of consistency (consistent vs. inconsistent), location (local vs. global) and comprehension group (good vs. poor). The analyses on the reading times on the sentence immediately following the target sentence showed that good comprehenders had faster reading times than poor comprehenders (F ₁(1,55) = 7.79, p < .01, η ²_p = .12; F ₂(1,2) = 15.99, p = .057, η ²_p = .89). In addition, both groups exhibited an effect of location (F ₁(1,55) = 4.16, p < .05, η ²_p = .07; F ₂(1,2) = 77.94, p < .05, η ²_p = .98), signifying that they read the post-target sentence slower in the local then global condition. However, analyses on the post-target sentence reading times failed to show any significant differences between consistent and inconsistent sentences.

Table 1 Post-target sentence reading times (in milliseconds) as a function of consistency (consistent vs. inconsistent), location (local vs. global) and comprehension group (good vs. poor) (+SE)

Full size table

Mixed-effects modeling

In addition to ANOVA, we also analyzed the data using mixed-effects modeling with the maximum likelihood method to calculate parameter estimates. Mixed-effects modeling is a statistical technique for data repeatedly observed from the same subjects and/or materials. It is gaining popularity over the conventional methods because differences between individuals and differences between materials are modeled by means of (crossed) random effects, resulting in, among other things, increased power, a better account of heterogeneity of variance, and a better use of available data (e.g. Baayen, Davidson, & Bates, 2008; Bicknell, Elman, Hare, McRae, & Kutas, 2010; Kliegl, 2007, for an application of mixed-effects modeling in reading comprehension research using, respectively, self-paced reading and eye tracking).

In the present analyses, subjects and materials (i.e. experimental sentences) were thus treated as random effects, and consistency, location and group as fixed effects. The fixed effects were coded as follows: Group (good = 1, poor = 0), Consistency (consistent = 1, inconsistent = 0) and Location (local = 1, global = 0). To control for word-related effects, reading times were normalized by the number of words in the sentence, and the sentences’ average log word frequency (obtained from CELEX, the database from the Dutch Centre for Lexical Information Baayen, Piepenbrock, & Gulikers, 1995) was added as covariate to exclude a word frequency confound. We used the logarithm of word frequency because reading times are linearly related to the logarithm of word frequency, not to raw word frequencies (Haberlandt & Graesser, 1985; Just & Carpenter, 1987).

Results confirmed the findings reported above. The analysis on the target sentence reading times showed main effects of Group (F(1, 57) = 4.42, p < .05), Consistency (F(1, 399) = 6.41, p < .05) and Location (F(1,399) = 6.83, p < .01) (The F values are produced by Type III Tests of Fixed Effects, and since the design is balanced, they are expressed as the ratio of the appropriate sums of squares). More importantly, the critical Consistency × Location × Group interaction was significant in predicting the reading times on the target sentence (F(1,399) = 9.55, p < .005) (model fit: −2 log likelihood = 6,225.77). The theoretically meaningful parameter estimates for Consistency (PE = 68.89, SE = 39.59) and Consistency × Location × Group (PE = 234.67, SE = 75.92) were, respectively, marginally significant (p < .1) and significant (p < .005). Results for the post target sentence indicated a main effect of Group (F(1,56.87) = 8.02, p < .01) and Location (F(1,397.74) = 5.40, p < .05), but no significant effects of Consistency, neither as a main effect, nor through its interactions with the other factors (model fit: −2 log likelihood = 8,078.72). Parameter estimates for Consistency (PE = 51.81, SE = 298.39) and Consistency × Location × Group (PE = 254.31, SE = 572.22) were also not significant.

Comprehension questions

In Fig. 2, the average number of correct answers to the comprehension questions is presented as a function of consistency (consistent vs. inconsistent), location (local vs. global) and comprehension group (good vs. poor). As can be seen from Fig. 2, good comprehenders answered more questions correctly than poor comprehenders (F(1,55) = 11.06, p < .005, η ²_p = .17). Of more significance is that the interaction between group and consistency depended on location, yielding a three-way interaction between those factors (F(1,55) = 4.97, p < .05, η ²_p = .08). In the global condition, poor comprehenders gave more incorrect answers to inconsistent than consistent texts. Good comprehenders did not show this effect of consistency (Consistency × Group: F(1,55) = 8.01, p < .01, η ²_p = .13), nor did both groups in the local condition (Consistency × Group: F(1,55) = .00, ns). The three-way interaction remained marginally significant after controlling for working memory (F(1,54) = 3.22, p < .1, η ²_p = .06).

Discussion

Experiment 1 investigated comprehension monitoring in 10–12 years old children differing in general reading comprehension skill. The children’s reading times were measured as they read narrative texts in which an action of the protagonist was consistent or inconsistent with a description of the protagonist’s character given earlier. The character description and action were adjacent (local condition) or separated by a long filler paragraph (global condition). The goal was to find out whether poor comprehenders mainly differ from good comprehenders in situation model construction or updating. In this study, situation model construction specifically refers to the richness of the model, which here concerns the question of whether or not a reader incorporated the character information. Situation model updating refers to the adaptability and solution-readiness of the model in case of a consistency violation.

For the good comprehenders, we hypothesized that they would slow down their reading on inconsistent actions when compared to consistent ones in both the local and global condition. The target sentence reading time analysis confirmed this hypothesis at both the subject- and item-level, and in both the ANOVA and mixed model analysis, indicating that good comprehenders detected the inconsistencies and made an attempt to resolve them (e.g. Albrecht & O’Brien, 1993). Probably, the extra time good comprehenders spent on an inconsistent target sentence reflected their effort to double-check the inconsistency and/or think up possible resolutions. It is important to note that from the inconsistency effect they displayed in the global condition it can be inferred that good comprehenders must have represented the character elaboration in the situation model they constructed from the text. Otherwise, the character information would have been lost from their working memory when they read the target sentence as a consequence of which they would have missed its inconsistency (thereby leaving their above-mentioned updating skills unused).

The results for the good comprehenders are in line with the results obtained in experienced adult readers (e.g. Huitema et al., 1993; Long & Chong, 2001). Yet, this study is one of the few demonstrating the current situation model construction and updating skills in children. Apparently, skilled 10–12 years old readers who have automatized their lower-level word decoding skills (as was the case here) have freed up enough processing capacity for the higher-level processes to enhance reading comprehension. In particular, the present study showed that they can carry out reading comprehension strategies to aid the construction and updating of a coherent and richly-connected situation model of a text.

In poor comprehenders, we found that reading times were slower on inconsistent than consistent target sentences in the local but not the global condition. This pattern of reading times, found at both the subject- and item-level, and in both the ANOVA and mixed model analysis, provides clear support for one of the two proposed hypotheses. This hypothesis says that poor comprehenders find difficulty in constructing a rich situation model. That is, they tend to leave situation-relevant information out of the model, including information that could be of use to them in interpreting later information in the text. In the present inconsistency detection task, this implies that poor comprehenders presumably failed to represent the character of the protagonist in their situation model in most stories. At least, this explains why they did not show any effect of inconsistency in the global condition. The inconsistencies simply went unnoticed as the information that was contradicted was neither present in their (short-term) working memory nor (long-term) situation model and therefore was no longer accessible.

Here, it is important to realize that the inconsistency effect which is exhibited by poor comprehenders in the local condition rules out an explanation in terms of impaired updating ability. Under the assumption that in this condition the character elaboration is still active in working memory, this finding indicates that when poor comprehenders understand that new information contradicts previous information they do make an attempt to resolve the inconsistency and restore comprehension in their current (working memory) representation of the text. As in good comprehenders, the extra time poor comprehenders spend reading an inconsistent action statement probably reflected their effort to double-check the inconsistency and/or elaborate possible resolutions. It should be mentioned, however, that the effects of inconsistency were limited to the target sentence reading times in that they did not extend into the spillover sentence immediately following the action sentence. This finding, obtained in both the subject and item analyses, and in both the ANOVA and mixed model analysis, is intriguing but difficult to interpret, especially since similar studies as the present one have previously demonstrated such a spillover effect (Albrecht & O’Brien, 1993; Long & Chong, 2001; Poynor & Morris, 2003).

The above-described differences in situation model construction and updating between good and poor comprehenders were not only evident in the reading time data but also in the data on the comprehension questions (which were assumed to at least partially tap the situation model representation that the reader constructed from the text). It was found that the negative effect of consistency violations on comprehension were restricted to the poor comprehenders’ answers given in the global condition. As was the case with the reading time data, this can be accounted for by assuming that poor comprehenders did not build character information into their situation model as a consequence of which they had a worse chance of noticing global inconsistencies. So, the comprehension question data coincide with the reading time data, and together, they indicate that poor comprehenders found it especially hard to build and maintain a coherent and integrated situation model from globally inconsistent texts.

This leaves us with the question of how to explain the influence of verbal working memory capacity. As was demonstrated in the analysis of covariance, the interacting effects of consistency, location and group disappeared after controlling for the participants’ sentence span. The most dominant conceptions of both verbal and non-verbal working memory assume that working memory consists of a storage component, in which information is maintained during processing, and a processing component, which coordinates the mental activities that are required by the task at hand (Baddeley, 1986; Shah & Miyake, 1996). In the present sentence span task, the storage component involved reading aloud a small set of sentences and the processing component involved remembering the last word of each sentence.

It is generally assumed that the relationship between verbal working memory tasks, such as the present sentence span task, and reading comprehension is mediated by the processing component of verbal working memory, rather than by the storage component (e.g. Daneman, 1987; Daneman & Tardif, 1987). According to this assumption, the lower verbal memory span of poor comprehenders demonstrated in this and other studies is thus a direct reflection of their weak language comprehension skills. Interestingly, Nation, Adams, Bowyer-Crane and Snowling (1999) showed that poor comprehenders had lower spans only on memory tasks that called upon semantic processing skills. On a series of other non-verbal/non-linguistic tasks, including one tapping spatial working memory, poor comprehenders were not impaired. From this, Nation et al. (1999) concluded that the poor comprehenders’ difficulties with reading and language skills can not be related to a general processing capacity weakness. Instead, they came to the conclusion that “…the memory difficulties associated with poor reading comprehension are specific to the verbal domain and are a concomitant of language impairment, rather than a cause of reading comprehension failure” (p. 139). Probably, this may account for the finding that the interaction effects of consistency and location depended on the poor and good comprehenders’ verbal working memory capacity. That is to say, the effects were weakened supposedly because the covariate represented the same conceptual measure (i.e. semantic processing skill) as the patterns of the dependent variables.

Experiment 2

In Experiment 1, a self-paced reading procedure was adopted to investigate situation model construction and updating in children. Even though it may not be as ecologically valid as the eye tracking method (which provides a more natural reading experience), we assumed that the self-paced moving window method has the capacity to tap into these (and other) reading comprehension processes and reveal the hypothesized differences therein between good and poor comprehenders. This assumption is supported by results obtained by, among others, Just et al. (1982) and Rinck et al. (2003). Just et al. (1982, p. 228) qualitatively and quantitatively compared the self-paced moving window method with the eye tracking method and concluded that “the word-level effects are generally similar”. Specifically, the correlation between the mean fixation duration on each word (averaged over subjects) and the mean button-pressing latency was .57. More recently, Rinck et al. (2003) compared two experiments on the processing of temporal text information. These experiments were the same except that one used the eye tracking method and the other collected self-paced reading times (sentence-by-sentence). Rinck et al. concluded that the results of both experiments were “in perfect accordance” (p. 81).

Nevertheless, it should be acknowledged that the self-paced moving window method has some disadvantages. One potential problem with the moving window method when a word-by-word presentation is used is that it may induce an artificial buffering strategy because readers cannot press the button as fast as they can comprehend the word (e.g. Danks, 1986). Obviously, this limitation could not have played a role here since in Experiment 1 sentence-by-sentence reading times were collected (see also Magliano, Graesser, Eymard, Haberlandt, & Gholson, 1993). On the other hand, two other limitations may have affected the present results and their generalizability. First, the self-paced moving window method tends to slow readers down (Rayner, 1998). Second and more importantly, it prevents readers from looking back, or prevents the experimenter from observing any look-backs (in case a cumulative method is used; see Rayner, 1998).

To overcome these limitations, we conducted a second experiment in which the readers’ eye movements and fixations were monitored as they read the same experimental texts used in Experiment 1. In Experiment 2, readers were therefore free to reread whenever they felt necessary. This allowed us to examine whether readers engage in any other comprehension-repair strategy than that of reading the inconsistent target sentence more slowly (see also Hyönä et al., 2003). Of course, slowing down may be the right strategic choice if they want to double-check the inconsistency or think up some resolutions for the contradiction (Yuill & Oakhill, 1991). But readers might also wonder whether they misunderstood earlier parts of the text. In that case, the right thing to do would be to look back at the possible source of the inconsistency, i.e. the character description (Hyönä et al., 2003; Zabrucky & Ratner, 1986).

Thus, Experiment 2 enabled us to distinguish between initial reading and rereading patterns while seeking converging evidence for the conclusions of Experiment 1. In a more general sense, Experiment 2 provided the opportunity of a two-method comparison as any differences in results between the two experiments will be related to the two methods of presentation and data collection (self-paced moving window method vs. eye tracking method).