Introduction

People conceive of everyday activity as composed of discrete events and seem to do so without effort. A baseball game breaks down into innings and at-bats; making a sandwich breaks down into collecting the ingredients, stacking them on the bread, and cleaning up. In these cases and in many others, everyday activity is hierarchically broken down into parts and subparts that are determined by social conventions or by hierarchical relations amongst goals (Dickman, 1963; Brewer & Dupree, 1983; Lichtenstein & Brewer, 1980; Zacks & Tversky, 2001). Yet, at times in healthy aging and aging with dementia, people may experience difficulty segmenting continuous activity into the appropriate parts and subparts. In the clinic, this may present as difficulties with tasks of daily living or with following conversations or television (Galvin et al., 2005). In this study, we investigated age-related variation in the ability to segment goal-directed activity into events and sub-events and how this segmentation is related to memory for events.

Understanding the hierarchical structure of activity is important for building a representation of what is currently happening in the world, developing event schemata, producing one’s own actions, and remembering what has happened (Boltz, 1992; Bower, Black & Turner, 1979; Foley & Ratner, 2001; Grafman, Thompson, Weingartner, Martinez, Lawlor, & Sunderland, 1991; Newtson & Engquist, 1976; Zacks & Tversky, 2001). In Newell and Simon’s classic General Problem Solver (Newell & Simon, 1972), plans are formulated by identifying a superordinate goal and a set of subgoals that must be executed to satisfy that goal. Children and adults alike show hierarchical patterns of recall for videotaped events and narrative text; they tend to group actions with their superordinate goals (Bauer & Mandler, 1989; Bower et al., 1979; Brewer & Dupree, 1983; Lichtenstein & Brewer, 1980; Travis, 1997). In narrative comprehension, hierarchical event schemata guide memory for text (Bower et al., 1979; Franklin & Bower, 1988; Rumelhart, 1977), and readers have been found to track and coordinate the goal plans of fictive characters (see Trabasso & Wiley, 2005). Infants show evidence of grouping actions with higher-order goals, gazing longer at familiar action sequences that accomplish an unfamiliar goal compared to unfamiliar action sequences that accomplish a familiar goal (Sommerville & Woodward, 2005).

Segmenting events into parts and subparts

In order to understand the hierarchical structure of activity, it is important to identify the behaviors in the perceptual stream that correspond to that structure. In particular, one needs to segment the ongoing perceptual stream into meaningful events. Behavioral and neurophysiological evidence suggests that segmentation is an ongoing concomitant of event perception (for reviews, see Newtson, 1976; Zacks & Tversky, 2001; Zacks, Speer, Swallow, Braver, & Reynolds, 2007) and that there is a high level of agreement regarding the beginnings and ends of activity (Newtson, 1976; Speer, Swallow, & Zacks, 2003).

Consistent with the importance of segmentation to event perception, evidence suggests that participants’ segmentation of events is hierarchically organized (Hard, Tversky, & Lang, 2006; Zacks, Tversky, & Iyer, 2001). A direct observation of this behavior is provided by having participants perform a segmentation task in which participants watch movies of an actor engaged in everyday activities and press a button when a meaningful unit of activity ends and another begins (Newtson, 1973). Participants segment the same activity twice, once identifying the largest meaningful units of activity (coarse grain), and once identifying the smallest meaningful units (fine grain). Event segmentation patterns are typically hierarchically aligned such that for every coarse event boundary, there tends to be a fine boundary nearby that is closer than expected by chance (Hard et al., 2006; Zacks et al. 2001), and such that the boundaries of fine events tend to be enclosed by the course events they make up (Hard et al., 2006). Consider making a sandwich. If a participant were segmenting hierarchically, then a coarse event boundary signaling the end of assembling the ingredients would coincide with the temporal location of a fine boundary that signals the end of the final subevent, such as applying the last piece of meat. The Appendix illustrates the kinds of units that people identify.

Effects of aging on event segmentation

Aging may affect how well one can encode and maintain event representations. Older adults tend to have worse episodic memory for events than younger adults (Balota, Dolan, & Duchek, 2000), as well as worse memory for the temporal order of words (Dumas & Hartman, 2003) and event sequences (Zacks, Speer, Vettel, & Jacoby, 2006). Recent work also suggests that older adults have difficulty binding event features, such as actors to actions, in memory for naturalistic activities (Kersten, Earles, Curtayne, & Lane, 2008). Maintaining and coordinating event representations likely relies on attentional control processes (Zacks et al., 2007); older adults tend to perform worse than younger adults on tasks relying on attentional control (McCabe, Roediger, McDaniel, Balota, & Hambrick, 2010), which may reflect a reduced capacity to maintain task and goal representations during ongoing task performance (Balota et al., 2000; Braver & Cohen, 2001; Miller & Cohen, 2001; Paxton, Barch, Racine, & Braver, 2008). Moreover, studies of task-switching tend to show that older adults have more difficulty shifting to new task sets than younger adults (e.g., Mayr, 2001), suggesting that older adults may have difficulty updating goal representations. This could well have significant practical consequences. A study in which older adults performed everyday activities, such as following a recipe to make pudding, found that successful performance correlated with measures of attentional control, working memory, and verbal memory (Baum, Edwards, Yonan, & Storandt, 1996). Thus, older adults may have difficulty perceiving hierarchical structure in activity.

A recent theory of event segmentation, called Event Segmentation Theory (EST), also suggests that older adults may have difficulty perceiving hierarchical structure in activity (Zacks et al., 2007). According to this account, people maintain working memory representations of the current event, called event models, which are updated at event boundaries. These are representations of “what is happening now,” and are informed both by what has happened so far, as well as information from event schemata stored in long-term memory. EST proposes that event models are maintained by structures in prefrontal cortex, and event model updating is signaled by the midbrain dopaminergic system; both of which decline in integrity and functioning with age (Fearnley & Lees, 1991; Kanne, Balota, Storandt, McKeel, & Morris, 1998; Raz, 2005; Raz et al., 1997; Rympa, Prabhakaran, Desmond, & Gabrieli, 2001; Sakata, Farooqui, & Prasad, 1992; West, 1996).

One recent study directly asked whether there are age-related differences in the ability to segment and remember everyday activity (Zacks et al., 2006). In their study, older and younger adults segmented movies of an actor engaged in activities, such as making a bed. Participants then performed a recognition task for what happened in the movie and an order memory task in which they sorted images from the movie into their proper sequence. In order to assess segmentation ability, the authors created a measure termed segmentation agreement. This assessed the extent to which each participant’s segmentation locations agreed with normative segmentation locations, as reflected by the event segmentation patterns of the entire sample. Given that there tends to be high within- and between-subject reliability in event segmentation locations (Newtson, 1976; Speer et al., 2003), if one’s segmentation pattern deviates from normative segmentation patterns, chances are that the event segmentation system is not performing optimally. Zacks et al. (2006) found that older adults had lower segmentation agreement than younger adults, suggesting that their segmentation was less normative than younger adults. Corresponding with lower segmentation agreement, older adults also showed worse recognition and order memory for the events; older adults with dementia of the Alzheimer type were further impaired on all three measures.

Does lower agreement with group norms on the locations of event boundaries indicate worse event understanding? On the one hand, segmenting in the appropriate places may impact how well events are encoded. If this is the case, then more normative segmentation should correlate with better memory. On the other hand, older adults could segment less normatively because they have more varied life experiences than younger adults. If so, one might predict that those with less normative event segmentation would have better memory, or at least that memory would be unrelated to segmentation agreement. Zacks et al. (2006) reported data supporting the first interpretation: Those older adults with higher segmentation agreement performed better on both the recognition and order memory tests. This effect was not merely due to differences in general cognitive functioning; the relation between segmentation and recognition memory held even after controlling for measures of memory, visuospatial, and executive functioning (Zacks et al., 2006).

Thus, there are theoretical reasons to propose that older adults will be less able than younger adults to identify the hierarchical structure of ongoing behavior, and there is evidence for age-related differences in some aspects of event segmentation. However, no studies to date have investigated age differences in the hierarchical segmentation of activity. Moreover, there is reason to doubt that older adults will have difficulty perceiving hierarchical structure in activity. Both semantic and schematic knowledge about events is generally preserved in older adults, with the former perhaps increasing with age (Rosen, Caplan, Sheesley, Rodriguez, & Grafman, 2003; Salthouse 2003). Additionally, studies of narrative comprehension have shown that older adults rely on mental models of the described events, situation models, just has heavily as younger adults (Radvansky & Dijkstra, 2007). Older adults, then, appear capable of generating and using event schemata during comprehension, and these schemata are hierarchically organized. Therefore it remains an important open question whether older adults will segment activity hierarchically, similar to younger adults.

Our study directly tested this hypothesis. Experiment 1 investigated age differences in hierarchical segmentation and event memory, and Experiment 2 expanded on this using a concurrent describing task in an attempt to improve older adults’ segmentation and memory.

Experiment 1

In Experiment 1, younger and older adults segmented one set of everyday activities into events and completed memory tests for a different set of events. We used different movies for the segmentation and memory tasks to reduce potential shared method variance between the two; this departs from the procedure of Zacks et al. (2006). Our main question was whether older adults would segment less hierarchically than younger adults. Our secondary aim was to examine whether individual differences in hierarchical segmentation would predict individual differences in memory.

Method

Participants

Forty-two younger adults (mean age 19 years, range 18–23 years) were recruited from the Washington University Psychology Department participant pool, whose members are mostly current students. Forty-three healthy older adults (mean age 77 years, range 60–89 years) were recruited from the Washington University Psychology Department’s Older Adults Volunteer Pool, whose members are mostly healthy community-dwelling adults not currently working full time or raising minor children. Older adults received $15 for participation, and younger adults had the option of receiving $15 or course credit.

Materials and tasks

Segmentation task and measures

Participants segmented two movies of everyday events taken from Zacks et al. (2006): a woman washing a car (duration 432 s), and a man building a boat out of toy blocks (duration 246 s). A third movie depicting a woman making a sandwich (duration 127 s) was used for practice. The movies did not contain any cuts and were shot from a fixed head-height perspective. Participants were asked to watch the movies and concurrently segment them into meaningful units of activity by pressing a button on a button box. Participants performed this task twice, once at each of two grain sizes. For fine segmentation, participants were told to mark off the smallest units of activity, and for coarse segmentation, they were told to mark off the largest units of activity. The movies were presented using Psyscope (Cohen, MacWhinney, Flatt, & Provost, 1993) on a Macintosh computer with a 24-inch display.

We computed three measures characterizing participants’ segmentation. Event length is simply the mean duration of events identified within each movie, calculated by dividing the movie duration by the number of identified events. Segmentation agreement is a measure of how well each participant’s segmentation agreed with that of the group (Zacks et al., 2006). For each movie, we created 1-second bins and recorded whether or not each participant indicated a boundary within that bin. Then, for each movie and grain, we computed the point-biserial correlation between each individual’s segmentation and the proportion of participants that segmented at each 1-second bin. The larger the correlation, the better the agreement with the group. Correlations were rescaled to control for individual differences in the number of boundaries identified.Footnote 1 Hierarchical alignment measures the mean temporal distance between each coarse event boundary and its nearest fine event boundary, relative to the distance expected by chance (see Zacks et al., 2001).Footnote 2 Larger scores indicate more hierarchical alignment. We used log-transformed distances to correct for positive skew (see Hard et al., 2006).

Memory tests and measures

Participants watched three movies for the memory tests: a man sorting and washing clothes (duration 300 s), a woman building a tent (duration 379 s), and a man planting flowers in a window box (duration 354 s). Movies were presented as for the segmentation task. Memory was tested using the methods described in Zacks et al. (2006). Immediately after viewing each movie, participants performed a recognition memory test followed by an order memory test. The recognition memory test was a 25-item two-alternative forced-choice test, with distracter items chosen from movies of the same actor in the same setting. Pairs of pictures were presented side-by-side on the computer screen, and participants responded by pressing a corresponding button on a button box. Order of presentation of the pairs, and placement of the images (left or right), were randomized separately for each participant. Response times greater or less than 3 standard deviations from each individual participant’s grand mean were identified as outliers and removed from the analysis, resulting in the removal of 1.7% of the total number of correct trials.

For each movie, order memory was tested with 12 visually distinctive images printed on 3 × 5 inch cards. Cards were presented in two rows of six on a table top, randomly ordered. Participants were instructed to arrange the images into the order in which they occurred in the movie. Completion time was measured with a timer. Order error was calculated from the absolute distance of each card’s ordinal location from the correct ordinal location (Zacks et al., 2006)

Design

This experiment was conducted as a 2 (age group: younger vs. older) x 2 (segmentation grain: fine vs. coarse) mixed design with age group as the between subjects factor and segmentation grain as the within subjects factor. The order of segmentation grain and order of movie presentation for both the segmentation and memory tasks were counterbalanced across participants. Within each participant, movie order was the same for fine and coarse segmentation.

Procedure

Participants first segmented the toy boat and car washing movies into coarse or fine events, depending on instructions. Participants practiced segmenting with the assigned grain on the sandwich-making movie. We used a shaping procedure to constrain individual differences in segmentation grain.Footnote 3 Once it was determined that participants understood the task they segmented the toy boat and car movies at the same grain size (either at a fine or coarse grain). They then performed the recognition and order memory tasks for the clothes-washing, planting, and tent movies. Finally, participants again performed the segmentation task, this time segmenting the toy boat and car washing movies at the grain other than the one used in the first segmentation phase, i.e., participants who had initially performed coarse segmentation now performed fine segmentation, and vice versa. As for the first segmentation phase, participants were trained on the new segmentation grain using the sandwich-making movie.

Results and discussion

Segmentation

The goals of these analyses were to assess age differences in (1) the degree of hierarchical segmentation, (2) the locations of participants’ event boundaries, and (3) the perceived duration of events. For all analyses involving the entire sample, age was treated as a continuous variable (Preacher, MacCallum, Rucker, & Nicewander, 2005); in some cases we followed these up with separate analyses for each age group, and for descriptive purposes we present group means.Footnote 4 Data for all analyses were collapsed across movie.Footnote 5 Table 1 presents mean segmentation scores by group (and grain where appropriate). Table 2 presents the results of the analyses.

Table 1 Experiment 1 segmentation and memory performance by age group (standard deviations in parentheses)
Table 2 Correlations between age and each segmentation, and memory measure, for Experiment 1

Hierarchical alignment

Both the younger and older adults showed significant hierarchical alignment; the alignment between coarse and fine boundaries was closer than that expected by chance (see Table 1; younger: t(39) = 17.33, p < 0.001, d = 2.74; older: t(39) = 13.51, p < 0.001, d = 2.14). Critically however, alignment scores were significantly lower for older than younger adults (see Table 2). This suggests that older adults segment events less hierarchically than younger adults.

Segmentation agreement

As can be seen in Tables 1 and 2, older adults had significantly lower fine and coarse agreement scores than younger adults. This replicates Zacks et al. (2006), and suggests that older adults segment events less normatively than younger adults.

Event lengths

Participants were able to modify their segmentation grain as instructed: Event lengths were longer for coarse than fine events (coarse: M = 36.3 s, SD = 17.7 s; fine: M = 13.0 s, SD = 6.9 s). In addition, older adults produced significantly longer fine event lengths than younger adults (see Tables 1 and 2).

Recognition memory and order memory

Table 1 presents mean memory scores by group, and Table 2 presents the results of the analyses.

Recognition accuracy and speed

Older adults had significantly lower recognition accuracy, and were significantly slower, than younger adults (see Tables 1 and 2).

Order errors and completion time

For the order errors analysis, one person was excluded for missing order error data for one movie, and for the completion time analysis one participant was excluded because of an outlying completion time. Older adults made significantly more order memory errors than younger adults and took significantly longer to complete the order memory task than younger adults (see Tables 1 and 2).

Relations between segmentation and memory for events

Recognition memory

As can be seen in Fig. 1, both segmentation measures were moderately associated with recognition memory. Within-group correlations between each of the segmentation measures and recognition memory were positive, but only one of the correlations was statistically significant (between hierarchical alignment and recognition memory for older adults). Hierarchical regression analyses did not find a significant relation between either segmentation measure and recognition after controlling for age (see Table 3).

Fig. 1
figure 1

Recognition memory accuracy as a function of segmentation agreement and alignment by age group for Experiment 1

Table 3 Experiment 1 hierarchical regression results predicting recognition and order memory accuracy from the segmentation measures, controlling for the main effect of age

Order memory

As can be seen in Fig. 2, order errors generally declined with increasing segmentation performance, but the relations were weak. Hierarchical regression analyses, conducted as above and presented in Table 3, revealed no significant effects of the segmentation measures or interactions with age.

Fig. 2
figure 2

Order memory error as a function of segmentation agreement and alignment by age group for Experiment 1

Relations between segmentation agreement and hierarchical alignment

Older adults had less hierarchical segmentation and also less normative segmentation than younger adults. One possibility is that older adults simply perform the segmentation task less reliably than younger adults. Greater within-participant noise would produce lower segmentation agreement and also lower hierarchical alignment. If lower reliability is the cause of the reduced alignment in older adults, then the age effect on alignment should be accounted for by segmentation agreement. For both age groups, those with more normative segmentation showed greater hierarchical alignment (younger: r(38) = 0.48, p < 0.01; older: r(38) = 0.55, p < 0.001). Using methods described by Baron and Kenny (1986), we tested whether segmentation agreement mediated the relation between age and hierarchical alignment. The above analyses indicate that age is significantly correlated with alignment and agreement. An additional regression analysis with both segmentation agreement and age entered simultaneously revealed that the age difference in hierarchical alignment was not significant after controlling for segmentation agreement, but that agreement significantly predicted alignment while controlling for age (effect of agreement: t(77) = 5.22, p < 0.001; effect of age: t(77) = -1.183, p = 0.241). A Sobel test (Sobel, 1982) confirmed that the relation between age and alignment was significantly reduced after controlling for agreement, z = -2.84, p = 0.004, indicating full mediation. This suggests that older adults segment less hierarchically in part because their segmentation is generally noisier.

In sum, there was strong evidence for age differences in hierarchical segmentation. This is the first evidence showing an age difference in the perception of hierarchical structure in everyday activity. In addition, compared to younger adults, older adults segmented less normatively and remembered the temporal order and visual contents of events less well. This replicates and extends previous findings on age-related differences in event segmentation and memory (Zacks et al., 2006). Lastly, there was modest but not overwhelming support for a relation between hierarchical segmentation and recognition memory.

Experiment 2

If older adults segment less well than younger adults, and if segmentation plays a causal role in determining later memory, then it may be that intervening to improve segmentation can be an effective means to improve memory for everyday events. In Experiment 2, we attempted to improve segmentation by coaching participants to describe ongoing activity as they segmented it. We hypothesized that concurrent description would improve segmentation by strengthening the influence of top-down knowledge on perceptual processing, and that this would lead to better memory. To preview, this was not the case: describing had little effect on segmentation and memory. The important findings from Experiment 2 were, first, a strong replication of the age differences in hierarchical segmentation and memory and, second, a replication of the relation between segmentation agreement and memory.

Participants segmented activity as in Experiment 1. However, in Experiment 2 one group of participants described the events as they segmented them. There are at least three possibilities regarding the effects of describing on age differences in segmentation. Concurrent describing may activate knowledge structures such as event schemata. Event schemata represent the hierarchical organization of activity (Bower et al., 1979; Rumelhart, 1977), and such knowledge may be preserved with age (Radvansky & Dijkstra, 2007). One possibility, consistent with the results of Experiment 1, is that older adults have an impairment in perceptual processing that makes it more difficult for them to access schematic knowledge about events and thereby segment hierarchically. This possibility would explain the fact that the age-related difference in hierarchical alignment was mediated by segmentation agreement. Also, given that older adults may have difficulty self-initiating cognitive processes (Luo & Craik, 2008), giving them a task that requires knowledge activation may improve performance, as we hypothesized. A second possibility is that concurrent describing is similar to simple think-aloud or introspection procedures in which the contents of working memory are reported while performing a cognitive task. It has been argued that such procedures have little effect on ongoing cognition (Ericsson & Simon, 1994). If so, then concurrent describing should have no effect on segmentation performance. A third possibility is that concurrent describing, being a demanding secondary task, may impose an additional cognitive load on participants and reduce performance. If so, one might expect this to differentially affect older adults (Riby, Perfect, & Stollery, 2004; Verhaeghen, Steitz, Sliwinski, & Cerella, 2003). Interestingly, the limited previous evidence on how describing influences segmentation is mixed. Zacks et al. (2001) found that concurrent describing increased hierarchical alignment, whereas Hard et al. (2006) found no effect of describing on alignment.

A second significant modification in Experiment 2 was that segmentation and memory were measured using the same movies. We separated the segmentation and memory tasks in Experiment 1 to reduce shared method variance between the tasks. Compared to what was reported in Zacks et al. (2006), it appears that this procedure may have slightly reduced the magnitude of the relation between segmentation and memory. However, we do not have a direct test of this possibility. We had participants perform both tasks on the same movies in Experiment 2 so that a direct comparison could be made to Experiment 1.

Method

Participants

Sixty-seven younger adults (mean age 20, range 18–22) and 63 healthy older adults (mean age 76 years, range 65–85 years) were recruited using the same procedures as in Experiment 1. Six younger adults and five older adults were excluded from analysis because they failed to complete the protocol (e.g., fell asleep, answered a cell phone). In addition, one older adult produced more coarse than fine boundaries for one of the movies, one older adult reported a diagnosis of memory impairment, and one older adult reported an inability to see out of both eyes; these participants also were excluded.

Materials and tasks

Stimuli

The three movies used in this experiment were the washing clothes, flower box, and tent movies from Experiment 1. As in Experiment 1, we used the sandwich movie for practice.

Segmentation task

The segmentation task was identical to that used in Experiment 1. Participants segmented each of the three critical movies twice, once at a fine grain and once at a coarse grain. The order of both segmentation grain and movie was counterbalanced across participants. Half of the participants were asked to describe, after each button-press, the unit that preceded the button-press (describe condition). Participants’ descriptions were recorded with a digital recorder using an external microphone. We used the same shaping procedure as in Experiment 1 to constrain segmentation grain.Footnote 6

Memory tests

The memory tasks and materials were identical to those used in Experiment 1, except that one item in the order memory task (the first picture for the tent movie) was replaced because it was placed correctly by all participants in Experiment 1 and thus had no variability. The new image was chosen from a point slightly later in the movie.

Design

This experiment was conducted as a 2 (segmentation grain: fine vs. coarse) x 2 (age group: younger vs. older) x 2 (describe condition: describe vs. silent) mixed design with segmentation grain as the within subject factor and age group and describe condition as between subjects factors. The order of segmentation grain and movie order was counterbalanced across participants. For each participant, the order of movie presentation was the same for fine and coarse segmentation.

Procedure

Similar to Experiment 1, this experiment proceeded in three phases; however, the order of occurrence was slightly different from Experiment 1. Participants first segmented the three movies twice (coarse and fine, with order counterbalanced). They then performed the recognition and order memory tasks for each of the three movies, with the recognition task always preceding the order memory task. Participants completed both memory tasks for each movie before moving to the memory tasks for the next movie.

Results and discussion

Computation of segmentation and memory measures, and exclusion of outliers, was the same as in Experiment 1. As in Experiment 1, age was treated as a continuous variable in the primary analyses; follow-up tests for each group were conducted to explore interactions involving age, and group means are presented for descriptive purposes.

Segmentation

The goals of these analyses were to investigate the effect of describing on (1) the degree of hierarchical segmentation for each age group, (2) segmentation agreement, and (3) the perceived duration of events. The ANOVA for the alignment analysis was conducted with describe condition as a between participants variable and age as a continuous predictor. Separate ANOVAs with describe condition as a between participants variable and age as a continuous predictor variable were conducted for the fine and coarse segmentation agreement scores and event lengths. Table 4 presents means and standard deviations for all segmentation and memory measures (averaged across movies). The results of the analyses are presented in Table 5.

Table 4 Experiment 2 segmentation and memory performance by describe condition and age group (standard deviations in parentheses)
Table 5 ANOVA results for the segmentation and memory measures for Experiment 2

Hierarchical alignment

One participant was excluded as an outlier. Both younger and older adults had significant alignment for each describe condition, smallest t(30) = 6.10, p < 0.001, d = 1.10 (see Table 4). However, older adults had significantly lower alignment scores than younger adults (see Table 5; older: M = 0.34, SD = 0.26; younger: M = 0.42, SD = 0.22), replicating the alignment results of Experiment 1. No other effects were significant.

Segmentation agreement

For both fine and coarse segmentation, older adults had significantly lower agreement scores than younger adults (see Tables 4 and 5), replicating the segmentation agreement results of Experiment 1. For fine segmentation, agreement scores were significantly higher for the silent condition than the describe condition (describe: M = 0.54, SD = 0.14; silent: M = 0.62, SD = 0.13). For coarse segmentation, age significantly interacted with describe condition such that for younger adults agreement scores while describing were higher than agreement scores while silent, t(65) = 2.51, p = 0.01, d = 0.61, whereas for older adults, agreement scores for the describing and silent conditions did not significantly differ from each other, t(61) = 1.67, p = 0.410, d = 0.42.

Event lengths

Participants were able to modify their grain of segmentation as instructed. Event lengths were longer for coarse than fine events (coarse: M = 41.1 s, SD = 18.9 s; fine: M = 13.7 s, SD = 8.0 s). Older adults segmented fine events into significantly longer units than did younger adults (see Tables 4 and 5), which replicates the event length results of Experiment 1. Fine event lengths were marginally longer for the describe condition than the silent condition (describe: M = 15.08 s, SD = 7.93 s; silent: M = 12.44 s, SD = 7.95 s). There were no other effects of describe condition on event lengths.

Recognition memory and order memory

The goals of these analyses were to assess the effect of describing on recognition and order memory performance for each age group. Means and standard deviations for each condition (collapsed across movie) are presented in Table 4, and results of analyses in Table 5.

Recognition accuracy and speed

Two participants were identified as outliers for recognition speed and removed from the recognition speed analyses.Footnote 7 Older adults had significantly lower recognition accuracy than younger adults (older: M = 0.83, SD = 0.07; younger: M = 0.91, SD = 0.05) and significantly longer response times (older: M = 6.6 s, SD = 2.4 s; younger: M = 3.8 s, SD = 1.1 s), as can be seen Tables 4 and 5. No other effects for recognition accuracy were significant.

Order errors and completion time

Due to experimenter error, three participants were missing order error data and two participants were missing completion time data for one of the movies. These participants were excluded from the order errors and completion time analyses. Older adults committed significantly more order errors and had longer completion times than younger adults (see Tables 4 and 5), replicating the memory results of Experiment 1 and Zacks et al. (2006). There were no other significant effects for completion times.

Relations between segmentation and memory

Recognition memory

Recognition performance for each age group and describe condition is plotted as a function of segmentation agreement and alignment in Fig. 3. Effects of age and segmentation on recognition accuracy were analyzed with hierarchical regression, and the results are reported in Table 6. For agreement, there was a significant main effect of agreement, and a significant agreement-by-age interaction. For older adults, increases in agreement were associated with increases in recognition accuracy, r(61) = 0.40, p = 0.001, whereas for younger adults, agreement did not reliably predict recognition r(65) = -0.16, p = 0.23. For alignment, there was no significant main effect, and no significant interaction with age, but there was a significant alignment-by-describe condition interaction. For the silent condition, increases in alignment were marginally associated with increases in recognition accuracy, r(63) = 0.22, p = 0.08, whereas for the describe condition, alignment did not reliably predict recognition accuracy, r(62) = -0.03, p = 0.84. The three-way interaction among alignment, age, and describe condition was not significant.

Fig. 3
figure 3

Recognition memory accuracy as a function of segmentation agreement and alignment by age group and describe condition for Experiment 2

Table 6 Experiment 2 hierarchical regression results predicting recognition and order memory accuracy from the segmentation measures, controlling for the main effects of age and describe condition, and their interaction

We note that, compared to the younger adults in Experiment 1, those in Experiment 2 had higher recognition memory scores, and these scores were not strongly related to the segmentation measures. A likely possibility is that this ceiling effect came about because in Experiment 2 participants viewed and segmented the movies twice before performing the recognition test.

Order memory

Order memory performance for each age group and describe condition is plotted as a function of segmentation agreement and alignment in Fig. 4. We followed the same 3-step hierarchical regression procedure as above to predict order memory errors, the results of which are presented in Table 6. Except for a marginally significant main effect of segmentation agreement, none of the hierarchical regressions indicated significant effects for the segmentation measures or their higher-order interactions with age or describe condition.

Fig. 4
figure 4

Order memory errors as a function of segmentation agreement and alignment by age group and describe condition for Experiment 2

In sum, older adults segmented events less hierarchically than younger adults, replicating the main result from Experiment 1. Older adults also segmented events less normatively and remembered them less well than younger adults, replicating Experiment 1. In addition, describing had little effect on segmentation and memory performance. In the single case in which it did have an effect, it improved coarse segmentation agreement for younger adults.

Experiment 2 also provided further evidence that segmentation performance is related to later memory. Older adults’ recognition scores were correlated with segmentation agreement, replicating Experiment 2. Hierarchical alignment again predicted older adults’ recognition memory, but only when not describing the activity during encoding.

We failed to find evidence that describing activity while segmenting improved older adults’ memory. This supports the possibility that the describe task does not encourage the use of knowledge when constructing event representations. Similar arguments have been made regarding the effects of think-aloud and introspection tasks on language comprehension (Ericsson & Simon, 1994). We return to the issue of interventions for segmentation in the general discussion.

Finally, Experiment 2 allowed us to ask whether the relation between segmentation and memory is at least partially driven by shared method variance between the segmentation and memory tasks. In this experiment, participants performed the segmentation and memory tasks on the same movies. In this design, it is possible that correlations between segmentation and memory could emerge either as a result of stable individual differences, or as a result of subject-by-item interactions or moment-to-moment fluctuations in cognitive performance. Experiment 1 separated the two tasks with two different movie sets, reducing the potential contribution of factors other than stable individual differences. The magnitude of the first order correlation between segmentation agreement and recognition memory for older adults was numerically larger for Experiment 2 (silent condition) than Experiment 1. However, a linear regression predicting recognition memory accuracy, controlling for the main effects of agreement and experiment, showed no interaction between experiment and agreement, F(1, 68) = 1.25, p = 0.268, ΔR 2 = 0.02. An analogous analysis for alignment (silent condition only for Experiment 2) also shows no interaction between experiment and alignment, F(1, 68) < 1, p = 0.904, ΔR 2 = 0.00. These results suggest that the correlation between segmentation ability and memory reflects a stable attribute of a person’s cognition, rather than attributes specific to the interaction of that person with a particular stimulus.

General discussion

In this study, we found evidence for age-related differences in the ability to perceive hierarchical structure in events. In both experiments, older adults segmented events less hierarchically than younger adults. In addition, older adults’ segmentation agreed less well with group norms, for both coarse and fine events. Segmentation was related to memory performance, particularly for older adults. This effect was most strong in the segmentation agreement measures, although occasionally hierarchical alignment predicted memory. These results, as well as those presented in Zacks et al. (2006), suggest that older adults may have difficulty in identifying meaningful units of behavior in everyday activities, and that this difficulty is significant for event encoding.

Age differences in event segmentation

Our results are consistent with the possibility that aging may affect how well one can encode and maintain event representations. Previous evidence suggests that older adults’ episodic memory for events is impaired (Balota et al., 2000; Zacks et al., 2006), as well as attentional processes related to goal maintenance and updating (Mayr, 2001; McCabe et al., 2010). The corresponding age-related change in event perception ability appears to manifest itself in at least two ways, less ability to segment events into normative units and less ability to perceive hierarchical structure in activity. These changes are consistent with EST, a recent theory of event segmentation (Zacks et al., 2007). This theory proposes that the maintenance of event models is subserved by regions in the lateral PFC, and signals for event model updating are, at least partially, subserved by the midbrain dopaminergic system. Age is associated with declines in structural and functional integrity in the PFC (Kanne et al., 1998; Raz et al., 1997; Rympa et al., 2001; West, 1996), and reductions in dopamine functioning (Fearnley & Lees, 1991; Sakata et al., 1992). Previous work is consistent with the possibility that these brain regions are important for adaptive event segmentation. Zalla and colleagues have shown selective decreases in coarse segmentation performance in individuals with PFC lesions and individuals with schizophrenia (Zalla et al., 2003, 2004), which also affects the functioning of the PFC and dopaminergic functioning (Braver et al., 1999).

The evidence suggests that older adults have a reduced ability to perceive how a series of fine events are grouped by higher-level units.Footnote 8 Based on work on aging and situation model construction, this result may be a bit unexpected. Studies examining narrative comprehension between older and younger adults suggest that younger and older adults rely equally on situation models during text understanding (Radvansky & Dijkstra, 2007). Critically, however, these studies have not assessed the quality of situation models between age groups. It is possible that how strongly one relies on event models is unrelated, or weakly related, to their quality. The present data suggest that although older adults may rely on schematic knowledge about event structure for memory encoding and retrieval, they are less able to perceptually identify that structure during on-line comprehension.

Recently, researchers have called attention to the fact that older adults are more variable in their performance across items than younger adults, and noted that this may explain some age-related differences in cognition (Hultsch, Strauss, Hunter, & MacDonald, 2008). Intra-individual variability certainly could affect the hierarchical alignment measure, which assesses segmentation across viewings. To assess whether age-differences in reliability was related to the observed differences in segmentation performance, we computed Cronbach’s alphas between the two movies in Experiment 1 and the three movies for Experiment 2. Interestingly, alphas were consistently higher for older adults than younger adults for both hierarchical alignment (Experiment 1: older = 0.37, younger = 0.25; Experiment 2: older = 0.71, younger = 0.22), and segmentation agreement (Experiment 1: older = 0.70, younger = 0.50; Experiment 2: older = 0.90, younger = 0.80). Thus, differences in reliability cannot account for the age differences in segmentation reported here.

In both experiments, older adults tended to identify fewer fine events than younger adults. One possibility is that older adults may fail to recognize some of the subunits that make up a larger activity. However, another possibility is that this difference is specific to the segmentation task used here; perhaps older and younger adults interpret differently the instructions to mark off the smallest or largest meaningful units. The shaping procedure we used to reduce extraneous variance in segmentation grain did not, however, completely eliminate these group differences. This provides weak evidence in favor of the first possibility over the second. In future research, it will be important to test whether this result is robust across tasks and experimental procedures in order to establish whether it reflects a true group difference in the mechanisms of event segmentation.

Improving event segmentation to improve memory

In this study, older adults whose segmentation was more normative and more hierarchical tended to remember the events better (see also Zacks et al., 2006). However, these effects were not large, nor perfectly consistent. Although modest, it is important to note that previous work has demonstrated that segmentation and recognition memory are uniquely related (Zacks et al., 2006). One possibility is that recall measures would provide more opportunity to observe relations between memory and segmentation—particularly the hierarchical organization of segmentation. In free recall tasks participants cluster information based on semantic, taxonomic, source, and temporal features, among others (Bower, Clark, Lesgold, & Winzenz, 1969; Hintzman, Block, & Inskeep, 1972; Kahana, 1996; Murdock & Walker, 1969; Puff, 1979). Some recent work has suggested that older adults cluster information in a free recall test to a lesser extent than younger adults (Taconnat, Raz, Bouazzaoui, Sauzeon, & Isingrini, 2009; but see Rankin, Karol, & Tuten, 1984). Most research on memory hierarchies for events has used free recall tasks (e.g., Bower et al., 1979; Brewer & Dupree, 1983; Lichtenstein & Brewer, 1980). These tasks may be particularly sensitive to the nature of event structures built during the perceptual experience of ongoing activities (Brewer & Dupree, 1983).

We assessed whether concurrent describing during segmentation would influence segmentation and memory performance, and whether it could help alleviate age differences in performance. The mixed previous results on the effects of describing on event segmentation left this an open question (Hard et al., 2006; Zacks et al., 2001). In our study, describing while segmenting had a limited effect on segmentation and no effect on memory. Describing reduced fine segmentation agreement for all participants and it increased coarse agreement for younger adults. This suggests that describing is not effective for encoding fine-grained units of activity, but leaves open the possibility that stronger manipulations of one’s attention to one’s coarse level segmentation may facilitate adaptive encoding for memory. In particular, attending to coarse level segmentation may help engage relevant knowledge structures (Zacks, 2004; Zacks, Kumar, Abrams, Mehta, 2009), which may in turn help activate the right information at the right time. Knowledge structures about events have been shown to play a major role in action planning (Grafman, 1995; Hommel, 2006) and memory for events (Abelson, 1981; Bower et al., 1979), and to allow for effective understanding and memory in older adults (Radvansky & Dijkstra, 2007). Knowledge structures for events prominently represent information about actors’ goals and their relations, which is related to the part–subpart structure of activity. Recent work has suggested that event knowledge and inferences about goals are more likely to influence coarse than fine event segmentation (Zacks, 2004; Zacks, Speer and Reynolds 2009), which is consistent with the finding that when describing had a positive effect, it was on coarse segmentation. However, it is important to keep in mind that older adults received no benefit from describing.

Other interventions may affect attention to segmentation more powerfully than the description task used here. For example, Boltz (1992) used commercial placement in film to highlight event boundaries. Participants who watched films that had commercials placed at event boundaries recalled the films better than those who watched the films that had commercials inserted within events, and better than those who watched films without commercials. Another possibility is that the segmentation task itself may be an effective memory encoding strategy. In our studies participants always segmented while viewing the movies. In future research it would be valuable to compare segmentation to viewing without segmentation. However, to maximize potential benefits of attending to segmentation, it may be important to provide assistance in segmenting well.

Conclusion

As people age they report problems tracking everyday activities, such as difficulty with learning how to use new tools and following conversations and television programs (Galvin et al., 2005). These seemingly simple tasks require a complex process of parsing and organizing a set of actions and events into meaningful parts and subparts. The results from the current study suggest that older adults may perceive less hierarchical structure in the activity units that make up these tasks. Interventions that support more effective event segmentation may prove helpful in addressing these concerns.