Introduction

Monitoring and evaluating outcomes of actions (e.g., reward or benefit of an action or payoffs of a choice) is essential for adaptive behavior. A large body of work has demonstrated that organisms are sensitive to reward distributions in the environment (e.g., Montague, Dayan, Person, & Sejnowski, 1995; Shafir, Wiegmann, Smith, & Real, 1999) and that reward guides learning, influences decision making, and directs action selection (Kurniawan, Guitart-Masip, & Dolan, 2011; Montague, Hyman, & Cohen, 2004; Niv & Schoenbaum, 2008). However, only recently has research started to address the question of whether reward already influences earlier and, presumably, more “automatic” perceptual and attentional orienting or alerting processes. Emerging evidence from this line of work indicates that reward can guide attention relatively early during visual processing, thereby affecting the perception and selection of visual stimuli in the environment (Anderson, Laurent, & Yantis, 2011; Della Libera & Chelazzi, 2006; Hickey, Chelazzi, & Theeuwes, 2010; Kiss, Driver, & Eimer, 2009; Raymond & O’Brien, 2009; Serences, 2008). These interactions between reward and early attentional processes may affect orienting behavior but may also play a critical role during foraging and economic decision making (Robertson, Watamura, & Wilbourn, 2013; Towal, Mormann, & Koch, 2013; Wolfe, 2013).

The mesolimbic and frontal-striatal dopaminergic pathways are critically involved in reward processing (Cools, 2011; D’Ardenne, McClure, Nystrom, & Cohen, 2008; Schultz, Dayan, & Montague, 1997; Tobler, Fiorillo, & Schultz, 2005). The efficacy of dopaminergic, as well as other catecholaminergic, neuromodulation undergoes maturation and senescence in the course of development across the lifespan. There is ample evidence for aging-related declines of pre- and postsynaptic markers of the dopamine system, which have negative consequences on a variety of cognitive functions in older adults (for reviews, see Bäckman Nyberg, Lindenberger, Li, & Farde, 2006; Li, Lindenberger, & Bäckman, 2010; for a neurocomputational integration, see Li, Lindenberger, & Sikström, 2001). In children, there is evidence that adultlike subcortical dopamine levels are reached during middle childhood, whereas dopamine receptor function shows a more gradual developmental trajectory (Baddeley & Diamond, 1996; Diamond, 2007; Haycock et al., 2003). Given these lifespan age differences in dopaminergic neuromodulation, it has been suggested that the sensitivity to reward information and the ability to use reward information for learning, outcome monitoring, or decision making may change across the lifespan (Eppinger, Hämmerer, & Li, 2011; Hämmerer & Eppinger, 2012; Mohr, Li, & Heekeren, 2010). Indeed, children and adolescents often react more strongly to reward information than do younger adults (Crone, Jennings, & Van der Molen, 2004; Eppinger, Mock, & Kray, 2009; van Leijenhorst, Crone, & Bunge, 2006), and older adults are often found to be less sensitive in processing reward information (Eppinger, Kray, Mock, & Mecklinger, 2008; Eppinger, Nystrom, & Cohen, 2012; Hämmerer, Li, Müller, & Lindenberger, 2011; Samanez-Larkin et al., 2011).

In addition to the age-related differences in reward processing, the ability to selectively attend to relevant information changes substantially across the lifespan. During maturation, children and adolescents show deficits in selective attention (Hommel, Li, & Li, 2004; Passow et al., 2013; Rueda et al., 2004; Waszak, Li, & Hommel, 2010). In the course of healthy aging, the ability to selectively focus on task-relevant information declines (Hasher & Zacks, 1988; Passow et al., 2012; Passow et al., 2014; Störmer, Li, Heekeren, & Lindenberger, 2013; see Guerreiro, Murphy, & Van Gerven, 2010, for a review). These changes in selective attention and reward processing across the lifespan give rise to important questions that are yet unanswered: Can reward value modulate attentional selection across the lifespan? What are the cognitive and motivational mechanisms that mediate the effects of reward value on selective attention across different age groups? To what extent may the effect of reward modulation of selective attention differ between age groups?

In the study reported here, we investigated how reward value affects visual attention across the lifespan. Children, adolescents, younger adults, and older adults performed a visual search task in which the target colors were associated with either high or low reward values. Importantly, the feature carrying the reward information (i.e., colors of the target stimuli) was manipulated orthogonally to the goal of the visual search task and was, therefore, task irrelevant. We employed three measures to examine whether reward value affects stimulus selection. First, we computed median response times (RTs) for high- and low-reward target stimuli, separately for the four age groups. Shorter RTs for high-value targets would indicate that high reward value speeds up attentional selection, rendering search more efficient (cf. Duncan & Humphreys, 1989; Wolfe, 1994). Second, for each participant, we computed a reward RT benefit score by taking the differences between RTs from high and low reward value trials and then dividing the differences by individual mean RT of the low-reward trials. By taking this ratio, we control for age and individual differences in baseline RT, and thus this reward score allows us to examine the reduction of RT due to high reward that is proportional for a given individual’s baseline RT. Third, as an index of response consistency, we computed within-person trial-by-trial RT variability for high- and low-reward target stimuli. High within-person performance variability has been associated with deficits in dopaminergic neuromodulation (MacDonald, Li, & Bäckman, 2009a; MacDonald et al., 2012; Papenberg et al., 2011; Papenberg et al., 2013; for neurocomputational simulations, see Li et al., 2001; Li, von Oertzen, & Lindenberger, 2006; ). Reduced RT variability for high-value targets could indicate that higher reward value enhances representational distinctiveness (see, e.g., the simulated effects in Eppinger et al.,2011) and, consequently, reduces decision noise, as reflected in RT variability. Furthermore, in a second task, participants were asked to maintain geometrical shapes in working memory while concurrently performing the reward-based visual search task. This task was included to explore how increased working memory load influences the formation of stimulus–reward associations across the different age groups. Lastly, to explore whether the participants were aware of the associations between reward value and target color, we asked participants about their subjective color preferences in a postexperiment rating.

To foreshadow our results, we found that high reward value significantly speeded up visual search times in all four age groups. Planned follow-up contrast analysis revealed that younger adults showed more sensitivity to reward value, as compared with all other age groups. Furthermore, there were clear age differences in response consistency for low- and high-reward target stimuli: Only younger adults benefited from high reward value to respond more consistently. The additional working memory task eliminated any reward-based benefits in the search task across all age groups.

Method

Participants

A total of 109 healthy right-handed volunteers participated in the study. All participants gave written informed consent and received monetary compensation for the study (minimum of 17 Euros). The ethics committee of the Max Planck Institute for Human Development approved the study. Close visual acuity was assessed prior to the experiment by having participants read digits at regular reading distance (~30 cm) binocularly; color vision was assessed using a set of six color panels (Velhagen & Broschman, 2005). All measurements were taken with optimal ocular correction. Data from 3 children and 5 older adults had to be excluded because of low color vision and/or low visual acuity. Thus, the final sample contained 26 children (8–11 years of age; mean age, 10 years; 14 female), 25 adolescents (14–16 years of age; mean age, 15 years; 14 female), 25 younger adults (20–29 years of age; mean age, 25 years; 14 female), and 25 older adults (62–75 years of age; mean age, 67 years; 14 female). These participants were all within the normal range of close vision (v > 0.5; cf. Cline, Hofstetter, & Griffin, 1997) and could report at least five out of six color panels correctly. Participants were assessed on psychometric tests of semantic knowledge (Lehrl, 1977) and perceptual speed (Wechsler, 1958). Consistent with previous findings (cf. Li et al., 2004), older adults attained lower scores in perceptual speed and higher scores in semantic knowledge than younger adults and adolescentsFootnote 1 (Table 1).

Table 1 Demographic and basic cognitive characteristics of sample

Stimuli and procedure

The testing session took place in age-homogeneous groups of 3–5 individuals and took approximately 2 h. Participants first completed computerized tests of perceptual speed (identical pictures) and semantic knowledge (spot-a-word). Then, participants performed task 1. After a short break of about 10–15 min, participants performed task 2. Participants were paid contingent upon performance in the two experiments: They received a fixed amount of three Euros as a start value and earned additional compensatory money during the experiments (1 or 5 cents for each correct trial). This payment protocol resulted in a final amount of 17.00–20.28 Euros. At the end of the session, participants completed a paper-and-pencil biographical questionnaire. Finally, participants were asked about their color preferences using a computer-based rating task.

Stimuli

Stimuli were presented on a 19-in. computer display (1,024 × 768, 85 Hz) using the Psychophysics Toolbox (Brainard, 1997) extension for MATLAB (The MathWorks). Each display consisted of eight colored letter stimuli (see Fig. 1a). The stimuli were presented on a gray background (10.5 cd/m2) within 9.5° × 15° rectangular regions that were centered to the left and right of the vertical midline. The stimuli were presented at random locations across the visual field, with the constraint that four stimuli appeared in each visual hemifield, with a minimum distance of 1.5° (border to border). Each display contained a “T” as a target letter (1.2° length, 0.4° width) that was tilted to the left or right side. Each display also contained seven distractor letters (the letter “L,” 1.2° length, 0.4° width). On half of the trials, the target letter “T” was presented in the left visual field, and on the remaining half of the trials, the target was presented in the right visual field. Similarly, the target was tilted to the left side on half of the trials and tilted to the right side on the remaining half of the trials. For each experimental block, nine new colors were randomly chosen from the CIElab color space with a distance of 40° on the color wheel. In each experimental block, two colors on the opposite sides of the color distribution were assigned to the target letter T (e.g., green and red; see Fig. 1a), and the remaining seven colors were randomly assigned to the distractor letters. That is, each of the distractors was presented in a distinct color, and the color of the target letter varied between the two chosen target colors from trial to trial in a given block (e.g., green and red; see Fig. 1a). The two target colors were distinctly associated with high and low reward values for correct responses (e.g., red with low reward and green with high reward; see the Procedure section for details). The association between reward value and target color was kept constant throughout each experimental block, but a unique set of colors was used for every experimental block. Importantly, a new set of nine colors was chosen on every block, such that high and low rewards were associated with unique colors that did not repeat across the experimental blocks. Altogether, six blocks were presented; thus, there were overall six high-reward colors and six low-reward colors across the entire first task. The target color varied randomly from trial to trial. The same set of colors was used for all participants; however, high and low reward value colors were counterbalanced between participants of each age group.

Fig. 1
figure 1

Example trial sequences. a Example trial structure for the reward-based visual search task (task 1) for a high (left) and low (right) reward value trial. Correct responses were rewarded with 5 cents or 1 cent, respectively. The reward values were linked to the target colors, but participants did not know the color of the target prior to the search display. The target colors remained the same within an experimental block but varied across blocks (altogether, six blocks were presented). b Example trial structure for the second task that included a memory task in addition to the reward-based visual search task (task 2). Participants were asked to remember the shapes that were presented along the vertical meridian for two search trials and then to indicate whether a test display matched the shapes presented in the memory display (shown here is a no-match case with set size 3). c Three example displays of the preference rating at the end of the experimental session. Participants were instructed to choose which color they liked better by pressing a left or right response button (the colors were target colors that were either high or low values during the six visual search blocks)

Procedure

Task 1: Reward-based visual search

Participants were seated approximately 70 cm in front of the computer display. Each trial started with the presentation of a central fixation cross (black; 0.45° × 0.45°) for 1 s, which was followed by a search array. Participants had to find the T in each display and report its orientation (left or right) with a buttonpress (left and right control keys). The search array disappeared when a response was registered or after a preset search interval was reached. Subsequently, a feedback screen was presented for 1 s. After a correct response, the feedback screen informed the participant of the monetary reward and displayed either “+1 cent” or “+5 cents” (see example trial structure in Fig. 1a). The reward value depended on the target color on that particular trial. Within each block, one target color was associated with a high reward value (+5 cents), and the other target color was associated with a low reward value (+1 cent). Participants were not informed about the association between target color and reward value. If the response was incorrect, no feedback was provided, and a plain gray screen was presented for 1 s. If the participant failed to respond in the preset time interval, the words “too slow” appeared on the screen, indicating a time-out trial. The preset search interval was implemented to enforce speeded responses. For each participant, it started with a fixed time of 1.5 s. To adjust for age differences in processing speed (see Table 1), the search interval was adjusted individually on the basis of the cumulative percentage of time-out trials (cf. Eppinger et al., 2008; Eppinger et al., 2009). When the cumulative percentage of time-outs exceeded 2%, search time was added in steps of 150 ms for every additional 2% of time-outs. Likewise, when time-outs decreased, search time was reduced in 150-ms steps. The intertrial interval was 1–1.5 s (jittered). Every 16 trials, a screen appeared that informed participants about their earnings (i.e., “You earned XX Euros out of possible XX Euros”). All trial types were randomly intermixed, but the same randomized sequence was used for each age group. The first task consisted of six blocks of 64 trials each. Prior to the task, participants completed 8 practice trials with no monetary reward.

Task 2: Reward-based visual search with memory load

The second task followed the same basic design as task 1. In addition to the visual search task, participants were asked to perform a short-term memory task. Every second search trial, a memory display that contained a set of shape outlines appeared for 4 s (Fig. 1b). The shapes were presented along the vertical meridian. On one third of the trials, three shapes were presented (set size 3); on another third of the trials, four shapes were presented (set size 4); and on the remaining third of the trials, five shapes were presented (set size 5). The shapes included the outer contours (black) of a circle (~1.5° × 1.5°), a square (1.5° × 1.5°), two rectangles oriented vertically or horizontally (0.9° × 2.3° or 2.3° × 0.9°), and two ovals oriented vertically or horizontally (~0.9° × 2.3° or ~2.3° × 0.9°). Shapes were chosen randomly on each trial, but no shape appeared twice in each memory display. Participants were instructed to remember the shapes and their locations. After completing two search trials, a memory test display was presented. On half of the trials, the test display matched the memory array presented before; on the remaining half of the trials, the test array differed from the memory array. The items on the test display could differ from the items on the memory display in their identity, location, or both. Participants had to indicate whether the test array matched the memory array or not by pressing one of two response buttons. To ensure the same maintenance period for each individual, the intertrial interval of the search task was adjusted individually, such that each trial lasted 3.3 s for each participant, regardless of the RTs on the search trials. Thus, the overall maintenance period lasted for about 6.6 s. All trial types were randomly intermixed, but the same randomized sequence was used for each age group. Task 2 consisted of three blocks of 64 search trials and 32 memory trials each.

Color preference rating

Participants’ color preferences were acquired in the following way. High and low reward value colors were paired randomly and presented simultaneously, one color on the left and the other color on the right side of a computer screen (each colored square: 2° × 2°; see Fig. 1c). Each color was presented twice, each time in a new randomly chosen pair. Side of presentation (left/right) was counterbalanced. Participants were instructed to press the response button that corresponded to the color they liked best (i.e., left/right control keys on a keyboard). It was emphasized that there were no correct responses.

Data analysis

Task 1

To examine how reward value influences search behavior, median RTs from correct trials and RT variability were computed separately for each reward value condition (high vs. low) and age group (children, adolescents, younger adults, older adults). Furthermore, a reward RT benefit score was computed by taking the differences between high and low reward value trials and then dividing the differences by individual mean RT on low-reward trials {[RT(low reward) − RT(high reward)]/RT(low reward)}. To obtain a variability measure independent of mean RT differences between the age groups, we computed the coefficient of variation (CV) as an index of processing variability (cf. Papenberg et al., 2011). The CV is a normalized measure of variability. It takes into account given RT baseline differences and, thus, informs about the relative differences in variations given a certain baseline RT in each age group. The CV is defined as the ratio of the standard deviation of RT to the mean RT. Dependent measures were analyzed using a repeated measures analysis of variance (ANOVA) with age group as a between-subjects factor and reward value as a within-subjects factor. When reliable age effects were ascertained, follow-up analyses were performed separately for each age group. Greenhouse–Geisser corrected p-values were applied when appropriate.

Task 2

Short-term memory accuracy (percent correct) was computed separately for each age group and set size condition and was analyzed by means of an ANOVA with age group as a between-subjects factor and memory load as a within-subjects factor. As for task 1, search median RT and search RT variability were analyzed using repeated measures ANOVAs with age group as a between-subjects factor and reward value and memory load as within-subjects factors.

Results

Task 1. Reward-based visual search

As was expected, participants’ accuracy was, overall, high in the search task (between 84% and 95% correct; see Supplementary Material for details). Participants responded faster on trials on which the target color was associated with high reward value, as compared with trials on which the target color was associated with low reward value, F(3, 97) = 53.01, p < .0001, η 2 = .20 (Fig. 2a). Furthermore, we observed a main effect of age group on RT, F(3, 97) = 49.46, p < .0001, η 2 = .58. As was expected, older adults responded slowest, followed by children, adolescents, and younger adults. All pairwise comparisons were significant (all ps < .006). An age group × reward value interaction indicated age differences in the reward value effect, F(3, 97) = 2.81, p = .044, η 2 = .02. Follow-up analyses revealed that each age group showed a value effect on RT but that the magnitude of the effect differed numerically between the age groups [children, 27 ms (±3.72), t(25) = 3.12, p = .005, η 2 = .28; adolescents, 18.7 ms (±2.6), t(24) = 3.68, p = .001. η 2 = .36; younger adults, 48 ms (±4.7), t(24) = 5.78, p < .0001, η 2 = .58; older adults, 58.6 ms (±9.7), t(25) = 3.43, p = .002, η 2 = .33]. These differences between conditions across the age groups point to the largest effect size in younger adults (η 2 = .58). To follow up the age group × reward value interaction in more detail and to control for age and individual differences in baseline RT, a reward RT benefit score was computed separately for each individual by dividing the RT difference between the high and low reward value by the individual mean RT for low reward value. Thus, this reward RT benefit score reflects the reduction of RT due to high reward that is proportional for a given participant’s baseline RT. In light of the numerical differences between the age groups, we conducted planned contrast analyses to test the assumption that younger adults showed the largest effects of reward value on RT. We first compared the RT benefit score separately between children, adolescents, and older adults and found no differences (all ps > .22). After establishing this, we contrasted the group of younger adults against all other three groups ([3, −1, −1, −1]) and found that the magnitude of the RT benefit scores was larger for younger adults, relative to all other groups, F(3, 97) = 4.84, p = .03). Thus, younger adults were more sensitive to reward value than were children, adolescents, and older adults.

Fig. 2
figure 2

Behavioral results from task 1. Median reaction times (a) and coefficients of variations (b) for high (solid) and low (striped) reward value trials, separately for children, adolescents, younger adults, and older adults. Error bars denote within-subject errors of the means

Finally, we also plotted at what rate the effect of reward evolved throughout the experiment by computing the reward value RT benefit across trials in each block. Sixteen subsequent trials of each block were combined across all six experimental blocks, revealing four separate bins. As is shown in Fig. 3, the temporal trajectories of the RT value benefit evolved gradually over time across all age groups. Statistical analysis confirmed a main effect of bin, F(3, 9) = 34.50, p < .001, η 2 = .16, but the age group × bin number interaction was not significant (p > .15). Visual inspection of the trajectories suggests that younger and older adults show a similarly steep increase of the reward effects initially, until older adults reach an asymptote at about 30 trials into the block; in contrast, younger adults’ RT benefit further increases later in the block. Children and adolescents appear to show a less steep initial increase of RT benefits, as compared with younger and older adults. On the basis of these observations, we selectively compared the initial mean slopes (bin 1 with bin 2) between children versus adolescents and younger versus older adults, showing no differences between these pairwise comparisons (all ps > .26). Then we collapsed across children and adolescents and contrasted their mean slopes against the mean slopes of younger and older adults. This analysis revealed that the initial mean slopes (bin 1 to bin 2) of younger and older adults significantly deviated from the mean slopes of children and adolescents, F(1, 99) = 4.59, p = .035, η 2 = .18.

Fig. 3
figure 3

Temporal trajectories of the reward effect [(low reward value − high reward value)/low reward value] on response time in task 1. Sixteen subsequent trials of each block are combined across all six blocks. The temporal trajectories show a gradual increase of the reward effect over time for all age groups

To test whether reward value affects within-person variability, we examined the individual RT variability (CV) separately for high and low reward value trials. The analysis revealed a main effect of age group, F(3, 97)=10.65, p < .0001, η 2 = .19, a main effect of reward value, F(3, 97) = 9.56, p = .003, η 2 = .23, and an age group × reward value interaction, F(3, 97) = 2.96, p = .031, η 2 = .11. As can be seen in Fig 2b, only younger adults had lower response variability for high-value trials, relative to low-value trials [younger adults, 4.2% (±0.25), t(24) = 3.59, p = .002, η 2 = .35], whereas all other age groups did not [children, 0.96% (±0.28), t(25) = 1.71, p = .10 η 2 = .10; adolescents, −0.2% (±0.17), t(24) = 0.528, p = .602 η 2 = .01; older adults, 0.17% (±0.19), t(24) = 0.46, p = .646, η 2 = .01]. In sum, the results indicate that higher reward value speeded search times in all age groups; this effect was most pronounced in younger adults; furthermore, reward value reduced within-person variability in younger adults only.

Task 2. Reward-based visual search with memory load

Short-term memory task

Older adults and children had lower accuracy rates, relative to younger adults and adolescents (Table 2). We observed a main effect of age group, F(3, 97) = 19.61, p < .0001, η 2 = .22, and a main effect of working memory load, F(3, 97) = 69.26, p < .0001, η 2 = .17, but no age group × load interaction, F(3, 97) = 0.55, p = .73, η 2 = .004. To examine at which level the effect of working memory emerged, pairwise comparisons were conducted for the different set size conditions. Working memory accuracy dropped significantly from set size 3 to set size 4, F(1, 97) = 174.18, p = .0001, η 2 = .22, and from set size 3 to set size 5, F(1, 97) = 92.50, p = .0001, η 2 = .19, but there was no difference between accuracy rates from set size 4 to set size 5 (p = .69). No interactions with age group were observed in any of these pairwise comparisons (all ps > .51). Thus, across all age groups, working memory performance decreased similarly from set size 3 to set sizes 4 and 5.

Table 2 Performance (percent correct) in the memory task in task 2 for set sizes 3, 4, and 5, separately for the four age groups

Visual search task

Median RT and the CV were examined with an ANOVA with age group as a between-subjects factor and memory load (high [collapsed across set sizes 4 and 5], low (set size 3]) and reward value (high, low) as within-subjects factors. There was a main effect of age group [median RT, F(3, 97) = 48.15, p < .0001, η 2 = .60; CV, F(3, 97) = 4.73, p = .004, η 2 = .12], similar to task 1. All pairwise comparisons were significant (all ps < .004). As can be seen in Fig. 4, no differences between high and low reward value trials were found [median RT, F(1, 99) = 2.27, p = .135; CV, F(1, 99) = 2,99, p = .09]. Also, no memory load × reward value interaction was present [median RT × memory load, F(1, 99) = 0.43, p = .51; CV × memory load, F(1, 99) = 0.12 p = .72], and no age group interactions were observed (all ps > .14). Thus, the additional short-term memory task completely eliminated the effect of reward value during visual search.

Fig. 4
figure 4

Behavioral results from task 2. Median reaction times (a) and coefficient of variations (b) for high (solid) and low (striped) reward value trials, separately for children, adolescents, younger adults, and older adults. Error bars denote within-subjects errors of the means

Postexperimental color preference rating

Color preferences were examined by comparing the frequencies of how often participants chose the high reward value color, relative to the low reward value color. We observed an age group × reward value interaction that reached significance, F(1, 94) = 2.69, p = .051. To further explore age group differences in the ratings, we performed post hoc comparisons for each age group separately and found that younger adults reported the high reward value color as “likable” more often (58.3%), relative to the low reward value color (41.7%), t(23) = 2.07, p = .04, η 2 = .16; none of the other age groups showed this tendency (all other ps > .14).

Discussion

Across four different age groups, we found that stimulus–reward associations can influence stimulus selection in a visual search task. When targets were associated with high reward value, participants responded faster, as compared with when targets were associated with low reward value. This effect appeared to be larger in younger adults than in children, adolescents, and older adults. Interestingly, only younger adults showed lower within-person trial-to-trial response variability when targets were associated with high reward value, as compared with when targets were associated with low reward value. Thus, our findings show that higher reward value accelerated target detection in all four age groups, with the effect being largest in younger adults. Furthermore, higher reward value rendered more consistent behavior in younger adults only.

Our result showing the largest effects of reward value in younger adults is in good agreement with previous studies on developmental and aging-related differences and impairments in reward-based processes (Chowdhury et al., 2013; Eppinger et al., 2008; Hämmerer et al., 2011; Hämmerer et al., 2013; Nieuwenhuis et al., 2002; Samanez-Larkin, Levens, Perry, Dougherty, & Knutson, 2012; van Duivenvoorde, Zanolie, Rombouts, Raijmakers, & Crone, 2008). However, although we find stronger effects of reward value in younger adults, all three other age groups showed pronounced effects of reward value on response speed. This indicates that reward can affect perceptual and attentional processes across the lifespan and suggests that children, adolescents, and older adults are sensitive to reward value associations in the context of visual search tasks. Different from the present study, most of the learning and decision-making tasks that show strong developmental and age impairments in performance are tasks that involve explicit goal-directed behavior. That is, participants are asked to learn stimulus–response associations based on rewarding or aversive outcomes or to make decisions to maximize reward. In contrast, in the present task, the feature carrying the value information (i.e., color) was orthogonal to the search task and, therefore, task irrelevant. In fact, performance and monetary gain at the end of the experiment was completely independent of whether participants learned the association between color and reward value or not. The fact that participants nevertheless acquired the association between value and color (as reflected in the RT speed-up) suggests that the underlying mechanism operates at relatively early and implicit processing stages. Initial support for this view comes from a postexperiment rating in which we asked participants about their subjective preferences for the colors that were previously associated with high and low reward value. Observers were instructed to pick the color they liked better in a two-alternative forced choice task, in which high and low reward value colors were contrasted against each other. This rating procedure allowed us to explore whether participants’ preferences were shifted toward the high value color—without having to tap into goal-directed, explicit processes. Whereas younger adults showed a tendency to report the high reward value color as more “likable,” none of the other age groups showed preferences for one or the other color. This suggests that most of the participants (at least, children, adolescents, and older adults) were not biased toward high reward value colors, implying that these associations were rather weak and of an implicit nature. Furthermore, the analysis of the temporal trajectories (Fig. 3) showing a slow and gradual development of RT value effects over time further supports the hypothesis that reward magnitude associations were learned gradually and, thus, rather implicitly. Taken together, these data are nicely consistent with other recent results suggesting that reward effects on visual attention emerge involuntarily and independently of task goals. For instance, Anderson et al. (2011) found that the presence of a task-irrelevant item in a search array that was previously associated with high reward value slows down visual search—presumably, by capturing attention automatically (see also Anderson et al., 2012, 2013; Anderson & Yantis, 2013; Krebs, Boehler, Egner, & Woldorff, 2011; Krebs, Boehler, & Woldorff, 2010; Yantis, Anderson, Wampler, & Laurent, in press). More evidence comes from electrophysiological recordings in the posterior parietal cortex of the monkey showing sustained activation for reward-associated stimuli, even when such reward-dependent activation is suboptimal for the task, implying that reward-associated features render attentional priority regardless of behavioral performance (Peck, Jangraw, Suzuki, Efem, & Gottlieb, 2009). Of additional interest, an EEG study in humans suggests that reward effects in visual search arise within the first 100 ms of visual processing, as reflected in amplitude modulations of the P1 component (Hickey, Chelazzi, & Theeuwes, 2010). Taken together, our findings may suggest that relatively early and involuntary attentional mechanisms and their modulation by reward are present across the human lifespan. Such an interpretation would be consistent with several findings in the aging and developmental literature that suggest that lower-level processes such as implicit learning may be less affected by age (Cherry & Stadler, 1995; Dennis, Howard, & Howard, 2006; Howard, Howard, Dennis, LaVine, & Valentino, 2008; Merrill, Conners, Roskos, Klinger, & Klinger, 2013; Thomas & Nelson, 2001).

Another aspect of our study that differs from previous studies is the fact that most studies on goal-directed learning and decision making focus on the direct effects of reward and punishments on behavior, whereas in the present experiment, we were primarily interested in the effects of reward value (low vs. high) on selective attention in different age groups. The finding of the reward-based speed-up in RT across all age groups may hence also be a consequence of the fact that differences in value are less salient than bivalent outcomes that signal approach and avoidance.

The result that high reward value led to more consistent behavior in younger adults indicates that although mean RT and response variability share common variance, intraindividual RT variability provides an index beyond that of mean RT (Li, Hämmerer, Müller, Hommel, & Lindenberger, 2009; Li et al., 2004; Lövdén, Li, Shing, & Lindenberger, 2007; Schmiedek, Lövdén, & Lindenberger, 2009). Indeed, intraindividual RT variability (or lack of response consistency) has been shown to be a more sensitive measure in reflecting age- or pathology-related effects (MacDonald, Nyberg, & Bäckman, 2006). For instance, RT variability has been linked to dopaminergic neuromodulation. This notion comes from computational models (Li et al., 2001; Li et al., 2006) and dopamine receptor imaging studies (MacDonald, Cervenka, Farde, Nyberg, Bäckman, 2009b), which demonstrated relations between suboptimal dopaminergic neuromodulation and higher activation fluctuations (i.e., noise) in neural networks and behavioral performance variability, respectively. Specifically, computationally it has been demonstrated that simulating higher dopamine modulation in younger adults by a larger gain regulation of the signal-to-noise ratio of neural information processing resulted in less random activation variability and less trial-by-trial performance variability (e.g., Li et al., 2001; cf. Servan-Schreiber, Printz, & Cohen, 1990). Empirical evidence lends further support to this effect. A recent finding shows that higher dopamine receptor efficacy is associated with lower trial-by-trial behavioral variability (see MacDonald, Karlsson, Rieckmann, Nyberg, & Bäckman, 2012). Our finding is also in line with recent studies that suggested that age-related impairments in learning and decision making might be due to increased temporal variability in the activity of areas that receive strong dopaminergic input, such as the ventral striatum (Chowdury et al., 2013; Eppinger et al., 2013; Samanez-Larkin et al., 2010). Thus, it seems plausible that more optimal dopaminergic neuromodulation in younger adults would affect the quality of reward-feature templates by making them less noisy, which in turn would render more consistent behavior. This is exactly what we observed for high-value targets in younger adults. Possibly, the more distinct representations also influenced the postexperiment preference ratings, in which only younger adults tended to choose the high reward value color more often. A possible candidate accounting for these age-specific value effects on RT variability are developmental and aging-related differences in the baseline levels of dopaminergic neuromodulation. Estimates from cross-sectional studies suggest that the aging-related dopamine decline in striatal and extrastriatal regions progresses with a rate of about 10% per decade, starting in the third decade of life (e.g., Inoue et al., 2001; Kaasinen et al., 2000; see Bäckman et al., 2006, for a review). With respect to child development, the literature is much scarcer due to reservations in applying invasive methods, such as PET receptor imaging. The available evidence suggests that adultlike subcortical dopamine levels are reached during middle childhood (Haycock et al., 2003; Seeman et al., 1987), whereas dopamine receptor function shows a more gradual developmental trajectory (Baddeley & Diamond, 1996; Diamond, 2007; Rothmond, Weickert, & Webster, 2012). Possibly, these developmental and aging-related differences in dopaminergic neuromodulation underlie the observed differences in value effects on RT variability. This is also consistent with recent findings showing lifespan age differences in trial-by-trial variability of EEG signals in a performance-monitoring task (Papenberg, Hämmerer, Müller, Lindenberger, & Li, 2013).

In contrast to the well-established role of dopamine prediction error signaling for reinforcement learning (for a review, see Niv & Schoenbaum, 2008), the neural mechanisms that underlie the effects of reward on early perceptual and attentional processes are not well understood. EEG and MEG data point to value-related modulations at early visual processing stages (Chelazzi, Perlato, Santandrea, & Della Libera, 2013; Hickey et al. 2010; Tallon-Baudry Meyniel, & Bourgeois-Gironde, 2011), and results from fMRI research show that value signals can modulate activity in the early visual cortex, including V1 (Seitz, Kim, & Watanabe, 2009; Serences, 2008). For example, one study found enhanced activity in higher nonretinotopic visual areas (in addition to the striatum and orbitofrontal cortex) during the presentation of rewarding feedback, which, on the following trial, led to increased activity in early visual areas (V1, V2) that was spatially specific (i.e., contralateral to the target; Weil et al., 2010). This suggests that rewards may initially trigger rather global, nonspecific signals, which subsequently influence activity in the early visual cortex (Weil et al., 2010). These findings are also consistent with findings in monkeys that show value-dependent modulations of neuronal activity in sensory areas as early as V1 (Arsenault, Nelissen, Jarraya, & Vanduffel, 2013; Stanisor, van der Togt, Pennartz, & Roelfsema, 2013). Together, these findings may suggest that mesencephalic dopamine can change the perceptual saliency of a rewarded stimulus by directly changing sensory signals at the level of the primary visual cortex (cf. Berridge & Robinson, 1998).

It seems rather unlikely that reward had such a direct influence on visual processing in the present task. More plausible is that brain areas related to selective attention mediated the effects of reward on visual processing. Hickey et al. (2010) recently showed that the anterior cingulate—an area involved in attentional control processes (Hopfinger, Buonocore, & Mangun, 2000)—plays an important role in guiding attention toward motivationally significant events. One tentative interpretation of these findings could be that reward value triggers involuntary attentional mechanisms that facilitate the early processing of reward-associated features. These effects are then reflected in speeded RT and enhanced responses in early visual processing areas (Hickey et al. 2010; Serences, 2008). Support for the argumentation that the present results do not reflect direct effects of reward on visual processing but are mediated through attention comes from the second task. In this part of the study, participants performed the reward-based visual search task but were instructed to additionally engage in a working memory task to increase demands on executive control functions. Across all four age groups, we found that the secondary task slowed down RT. Most interesting, the RT benefit for high reward value, compared with low reward value, disappeared in all age groups. One way to interpret this effect would be to assume that increased demands on control interfere with reward effects in visual search. This would imply that the attentional mechanism that mediates the reward effects is part of a more general executive control network that is involved in guiding behavior toward motivationally significant events. Increased demands on this network may lead to deficits in building up stimulus–reward associations. This interpretation would be in line with an extensive literature on the role of the medial prefrontal cortex in executive control (Botvinick, Braver, Barch, Carter, & Cohen, 2001; Corbetta & Shulman, 2002).

However, it could also be argued that in the dual-task situation, participants shifted their attentional focus to the secondary task and were thus not able to detect the reward associations. This view would be consistent with the idea that the reward value effects reflect early involuntary attentional mechanisms, rather than deliberate executive control processes. Future studies should address this question more directly by trying to dissociate involuntary shifts of attentional focus due to task demands from limitations in executive control due to the secondary task. Note that the search-memory task was always performed after the reward-based search task without the memory component. Thus, the results of the second task should be interpreted with caution. However, control analyses that examined the effect of reward value separately across experimental blocks revealed that although the reward effect seemed to be strongest during initial blocks, substantial reward effects were still observed during later blocks in task 1, in contrast to the complete elimination of the reward effects in task 2 (even in the first block).Footnote 2 Thus, whereas the order effect may moderate the effect of reward to some extent, the complete elimination of reward effect in task 2 cannot be attributed to order effects alone. Future experiments should intermix both trial types to unequivocally relate the lack of reward value on search RT to the memory component.

Taken together, the present empirical results show that reward value speeds visual search times in children, adolescents, younger adults, and older adults. This effect seems to be most pronounced for younger adults and of similar magnitude for all other age groups. Only younger adults showed more consistent behavior for high reward value trials and rated colors previously associated with high value as more “likable.” These findings lead us to propose that relatively early involuntary and implicit stimulus–reward associations can modulate visual attention across the lifespan. At the same time, these findings seem to be consistent with the hypothesis that lifespan age differences in dopaminergic neuromodulation contribute to age differences in reward processes even at these relatively early perceptual and attentional stages, as reflected in age-related differences in behavioral benefit and variability. Interestingly, we also found that reward value effects disappear when a secondary task is added to the visual search paradigm. We think that this effect may be due to a shift in the attentional focus to the more demanding secondary task.