Expecting the unexpected: a review of learning under uncertainty across development

Topel, Selin; Ma, Ili; Sleutels, Jan; van Steenbergen, Henk; de Bruijn, Ellen R. A.; van Duijvenvoorde, Anna C. K.

doi:10.3758/s13415-023-01098-0

Expecting the unexpected: a review of learning under uncertainty across development

Special Issue/Uncertainty
Open access
Published: 26 May 2023

Volume 23, pages 718–738, (2023)
Cite this article

Download PDF

You have full access to this open access article

Cognitive, Affective, & Behavioral Neuroscience Aims and scope Submit manuscript

Expecting the unexpected: a review of learning under uncertainty across development

Download PDF

Selin Topel^1,2,
Ili Ma^1,2,
Jan Sleutels^1,3,
Henk van Steenbergen^1,2,
Ellen R. A. de Bruijn^1,2 &
…
Anna C. K. van Duijvenvoorde^1,2

Abstract

Many of our decisions take place under uncertainty. To successfully navigate the environment, individuals need to estimate the degree of uncertainty and adapt their behaviors accordingly by learning from experiences. However, uncertainty is a broad construct and distinct types of uncertainty may differentially influence our learning. We provide a semi-systematic review to illustrate cognitive and neurobiological processes involved in learning under two types of uncertainty: learning in environments with stochastic outcomes, and with volatile outcomes. We specifically reviewed studies (N = 26 studies) that included an adolescent population, because adolescence is a period in life characterized by heightened exploration and learning, as well as heightened uncertainty due to experiencing many new, often social, environments. Until now, reviews have not comprehensively compared learning under distinct types of uncertainties in this age range. Our main findings show that although the overall developmental patterns were mixed, most studies indicate that learning from stochastic outcomes, as indicated by increased accuracy in performance, improved with age. We also found that adolescents tended to have an advantage compared with adults and children when learning from volatile outcomes. We discuss potential mechanisms explaining these age-related differences and conclude by outlining future research directions.

Uncertainty in learning and decision-making: Introduction to the special issue

Article 24 May 2023

The stability flexibility tradeoff and the dark side of detail

Article 24 November 2020

Developmental changes in exploration resemble stochastic optimization

Article Open access 17 August 2023

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Uncertainty is a common feature of our everyday decisions and actions, and we must deal with incomplete information in many everyday situations. Despite its pervasiveness, uncertainty comes in many shapes and forms. For instance, think about trying a new coffee place; uncertainty may stem from not knowing some of the products offered, it may stem from uncertainty about the quality of their products, and it may even stem from uncertainty about the quality-stability of their products. At the counter, our decision may depend on what we expect to be the best at that time (i.e., oat milk cappuccino). Consequently, we may need to update those expectations or beliefs based on our experiences (i.e., How tasty was it?). Choosing the best course of action (i.e., Should I order this here again?) depends on our ability to learn from experiences and adjust our expectations accordingly by keeping track of outcomes and the changes in those outcomes over time. Specific periods in life, such as adolescence, have been characterized by being attuned to learning and navigating novel and inherently uncertain environments. This study is a review of recent literature on adolescent learning under different types of uncertainty.

Adolescence is a developmental phase between childhood and adulthood in which we transition into an adult role and develop mature social goals (Crockett & Crouter, 2014). The start of adolescence is biologically marked by the start of puberty, although the end of adolescence is less clearly defined (Sawyer et al., 2018). In Western societies, adolescence approximately spans the period between ages 10-24 years (including an age range sometimes referred to as emerging adulthood; Arnett, 2000; Sawyer et al., 2018; Jaworska & MacQueen, 2015). Puberty is characterized by a rapid rise in gonadal hormones, including testosterone and estradiol, which have a large influence on bodily characteristics, brain development, and behavior (Laube & van den Bos, 2016; Schulz & Sisk, 2016). Although the exact role of these hormones is unknown, conceptual models have hypothesized that pubertal hormones trigger the limbic brain system to flexibly recruit cortical control regions and potentially boost development of higher cognitive and self-regulatory functions important for learning. For instance, recent animal work has shown that hormonal levels directly influence the organization of the prefrontal cortex and accelerated performance in a reversal learning paradigm (Piekarski et al., 2017). Despite the link between hormonal changes and human learning under uncertainty still being inconclusive, it is suggested that these underlying neurobehavioral changes influence adolescent learning.

Additionally, adolescence is characterized as a life-phase in which individuals are confronted with new environments that may result in temporarily heightened uncertainty (Hartley & Somerville, 2015; Hofmans & van den Bos, 2022). For example, adolescents find themselves confronted with new social groups when beginning high school. They may also experience uncertainty about their social position within a new group and about newly formed social relationships that become more profound in adolescence. Adolescents may form deeper friendships, start romantic relations, or join certain groups outside of their family environment where they take on more roles and different responsibilities (Crockett & Crouter, 2014; Suleiman et al., 2017). The potential sensitivity to learning in the adolescent brain may help to rapidly reduce this heightened uncertainty and flexibly adapt behavior to new and changing environments (Crone & Dahl, 2012). A specific hypothesis that has been put forward is that adolescents may be more attuned to detecting changing outcomes over time and more readily adjust their behavior compared with children and adults (Lin & Wilbrecht, 2022; Romer et al., 2017). We examined this hypothesis by reviewing the developmental literature on learning under two types of outcome uncertainty: 1) stochastic outcomes, in which the outcome variation remains stable over time; and 2) volatile outcomes, in which there is a change in mean-value or probabilities of outcomes over time.

As highlighted in these examples, many changes in the adolescents’ environments may social, and it is debated whether learning from social and nonsocial outcomes relies on the same computational and neural mechanisms (Ruff & Fehr, 2014). Although studies revealed overlap in neural mechanisms when processing social and nonsocial rewards (i.e., a common currency; Corlett et al., 2022; Martinez-Saito & Gorina, 2022), there is evidence for a degree of specificity, in which parts of the prefrontal cortex (e.g., the dorsal medial prefrontal cortex, and lateral prefrontal cortex) may respond stronger or specifically to social than nonsocial learning outcomes (Corlett et al., 2022; Greimel et al., 2018; Martinez-Saito & Gorina, 2022; Apps & Sallet, 2017). However, given the limited set of studies in adolescents that contrasts learning under different types of uncertainty in social and nonsocial situations, we do not include this as a direct comparison in our review.

The goal of this review is to explore how the current available studies support the hypothesis that adolescence is particularly attuned to learning under uncertainty, in which we group studies based on their outcome stochasticity and outcome volatility. We first provide a definition of these different types of uncertainty and elaborate on our semi-structured literature search and inclusion strategy. Second, we discuss computational methods to understand the various manifestations of uncertainty, including the proposed neurobiological measures involved. Third, we review empirical evidence from age-related comparisons in studies examining learning under stochastic or volatile outcomes. Finally, we discuss the resulting implications for our understanding of adolescent learning under different types of uncertainty and present potential next steps for future research that also target individual differences.

Different types of uncertainty: Stochasticity and volatility

The definition of uncertainty has sparked discussion in the literature. In general, uncertainty arises from outcome variability or incomplete information about the outcomes. Despite some conceptual overlap, different forms of uncertainty have been defined (Bland & Schaefer, 2012; Huettel et al., 2006; Piray & Daw, 2021; Pulcu & Browning, 2019; Soltani & Izquierdo, 2019; Yu & Dayan, 2005). Although other and more fine-grained distinctions have been made, we focus on stochasticity—also referred to as risk, expected or irreducible uncertainty—and volatility—sometimes referred to as unexpected uncertainty (e.g., differences between volatility and unexpected uncertainty; Bland & Schaefer, 2012). Both types of uncertainty can play a role in learning from repeated choices (e.g., learning task, Fig. 1A). To illustrate these different types of uncertainty in the lives of an adolescent, consider adolescents’ interactions. Stochastic outcomes refer to situations in which making the same decision may result in different outcomes (i.e., when there is outcome variance), a pattern that remains stable over time (Fig. 1B, upper panel). For example, meeting a friend after school is usually fun, but the friend is sometimes in a bad mood, which makes some interactions less enjoyable but still overall good. In contrast, volatile outcomes refer to situations in which the outcomes have changed, resulting in a new mean value and possibly different outcome variance (Fig. 1B, lower panel). For example, this friend has decided that they want to gain popularity in high school by joining a different social group and often is not friendly to you anymore. While seemingly dramatic, these examples are prevalent and representative of the lives of adolescents as this developmental phase comes with erratic mood changes (Maciejewski et al., 2019), formation of self-identity (Klimstra et al., 2010; Pfeifer & Berkman, 2018) and an increased importance of peer status and evaluation by peers (LaFontana & Cillessen, 2010; Sherman et al., 2016, 2018).

Making the distinction between stochastic and volatile outcomes is important. The literature on reinforcement learning suggests that these different types of uncertainty should optimally elicit different choice and learning strategies. That is, in an environment with high outcome stochasticity, an adaptive learner should integrate outcomes of their past decisions to form and update their internal value representation of the choice options. In an environment with high volatility, an adaptive learner should form and update expectations rapidly and based on recent outcomes after detecting a change (Behrens et al., 2007). Prior research has highlighted the importance for individuals to distinguish between volatility and stochasticity in their environment as these factors can interact (Piray & Daw, 2021; Yu & Dayan, 2005). For example, in environments with high estimated outcome stochasticity, unexpected events are more likely to be attributed to chance even when this event has occurred due to a real change. In other words, the ability to infer stochasticity and volatility can assist individuals in the challenging task of accurately estimating and responding to outcomes that arise from chance (stochasticity) versus those that indicate a change (volatility).

Despite the relevance of differentiating these two types of uncertainty, most developmental studies primarily focus on one of the two by making use of either probabilistic reinforcement learning paradigms targeting stochastic learning environments or reversal-learning paradigms targeting mainly volatility in learning environments, but also including a level of stochasticity. In this way, it could be suggested that these paradigms target, but do not isolate, learning under volatility. For a more detailed description of the task paradigms typically used in developmental and adult samples, see Box 1. Although relevant for understanding adolescent learning, the developmental literature is yet to distinguish between the behavioral and neural findings of learning under stochasticity and volatility. Therefore, as a first step, we reviewed and compared developmental studies that included an adolescent age range in stochastic learning contexts (without volatility) and volatile learning contexts (with or without stochasticity).

Box 1. Commonly used paradigms to study learning and decision-making under uncertainty

Probabilistic learning Probabilistic learning paradigms commonly consist of two stimuli or actions to choose from, and depending on the underlying probabilities or contingencies, the choice leads to either a positive (e.g., reward, absence of punishment) or a negative outcome (e.g., punishment, absence of reward) with some variance. Thus, even after the associations are learned, it is not possible to always experience the same (rewarding) outcome due to the noise or stochasticity in this environment. For example, in such a learning paradigm, choosing one option could lead to a reward 80% of the time, whereas choosing the other leads to a reward only 20% of the time. These outcomes for the two options can be either perfectly anticorrelated or independent. Reversal learning Reversal learning paradigms are generally used to study cognitive flexibility and appear similar to the probabilistic learning paradigms. However, they require participants to detect when the contingencies for different options are reversed after every few trials (e.g., a previously more rewarding option becomes less rewarding and vice versa). There are versions of reversal paradigms with deterministic and probabilistic outcomes. In deterministic reversal learning paradigms, the better option leads to the reward 100% of the time when chosen and surprising outcomes signal a reversal. In probabilistic reversal learning paradigms, the surprising outcomes may indicate that the stimulus or action associated with reward most of the time has changed or it might be a result of stochasticity. The frequent contingency reversals increase the volatility in these task environments. Some of these paradigms introduce reversals only after certain criteria are met (e.g., choosing the more rewarding option at least three times in a row; Weiss et al., 2021). Predictive inference Instead of probabilities, a task environment might depend on more continuous outcomes, such as points gained or the location of a hidden stimulus. The outcomes of actions vary around a mean value, which leads to stochasticity. While estimating the underlying mean value, learners should not update their expectations too much due to these random fluctuations (e.g., estimation and choice tasks in Jepma et al., 2020). However, there also might be changes in the mean value that are not due to stochasticity in which case learners should update their predictions more quickly. In a task with continuous outcomes, volatility would be reflected as a the rate of change in the generative mean value (e.g., changepoint task; Nassar et al., 2016). Risky decision-making Finally, there may be alternative paradigms that include an element of ambiguity (unknown probabilities) or risk (known probabilities) when learning. These experience-based, decision-making tasks require participants to make choices between risky and safe(r) options that are presented (Christakou et al., 2013; Jepma et al., 2022; Nussenbaum et al., 2022; Rodriguez Buritica et al., 2019). Risky options are generally operationalized as the ones with greater outcome variability, and consistently choosing such risky options can be either beneficial or detrimental in the long run based on their average value (Jepma et al., 2022; Nussenbaum et al., 2022). Thus, both the average expected outcome values and variability (akin to stochasticity) should be learned or estimated over time for different options.

Neural and computational mechanisms of learning under uncertainty

Neuroscientists have described well-defined, reward-learning networks, including cortico-basal-ganglia loops, with the striatum and medial prefrontal cortex being key regions in this network (Haber & Knutson, 2010). The interpretation of learning signals in the brain has benefited from cognitive computational modeling that quantifies different parameters of learning that rely on deviations of expectations (i.e., prediction errors; Box 2). Rewards that exceed our expectations generate positive prediction errors, which can reinforce behavior. Conversely, worse-than-expected rewards generate negative prediction errors and lead to extinction of behavior. When the prediction error becomes zero, no further learning is possible and the prediction remains stable (Schultz, 2015). The extent to which a prediction error alters subsequent subjective valuation of choice options depends on one’s learning rate (Box 2). Prediction-error (PE) learning processes are assumed to depend on midbrain dopamine signaling (i.e., ventral tegmental area, substantia nigra) and their projections (Schultz, 2007; Schultz et al., 1997). Consistently, findings have pointed to a distributed network for prediction error coding, including dopamine-innervated regions, such as the striatum, ventral medial prefrontal cortex (PFC), and anterior cingulate cortex (Garrison et al., 2013), and also observed PEs in regions, such as the insula and lateral PFC (e.g., extensive meta-analysis on domain-general and domain-specific PEs, Corlett et al., 2022). It is debated whether there are specific brain networks involved in the updating of expectations (Bruckner et al., 2022), but at least one study has observed learning rates to be related to functional connectivity between the striatum and ventral medial PFC (van den Bos et al., 2012). Developmental research has aimed to quantify age-related changes in parameters of reinforcement-learning models and relate these to age-related changes in brain functioning. The use of computational models in different age groups in combination with brain measures may provide insights in how learning develops on multiple levels of explanation (Lockwood et al., 2020), but see recent reviews for discussion on the use of computational models in understanding learning processes (Nassar & Frank, 2016; Eckstein et al., 2021).

Another neurobiological framework on learning under uncertainty quantifies the importance of neurotransmitter systems, such as acetylcholine and noradrenaline (NA). Volatility, or unexpected changes, are thought to depend, at least partly, on the locus coeruleus-NA system (Bruckner et al., 2022). This system has been associated with uncertainty in Bayesian modeling approaches (Box 2) with rapid learning-rate adjustments. There is some evidence that inhibiting NA levels by using a pharmacological antagonist increases individuals’ learning rate through which beliefs about volatility are updated (Marshall et al., 2016). This indicates that NA stabilizes individual’s estimate of environmental volatility. Brain regions that have been related to coding uncertainty and surprise overlap partly with regions that are sensitive to prediction errors and include the anterior cingulate cortex (ACC; Behrens et al., 2007; d’Acremont & Bossaerts, 2016), posterior cingulate cortex (Payzan-LeNestour et al., 2013), and wider frontal-parietal brain regions (Kao et al., 2020). Other work has suggested the basolateral amygdala to be a key region for detecting outcome volatility, which may depend on the connections with the ACC, or potentially dopaminergic innervations (Soltani & Izquierdo, 2019). Overall, these findings point to a distributed network that includes regions of the PFC, parietal cortex, and subcortical regions involved in learning under uncertainty, as well as the neurochemical involvement of, at least, dopamine and NA. Many of these regions undergo large structural and functional development during adolescence and into adulthood (Silverman et al., 2015; Tamnes et al., 2017), and similarly changes in neurotransmitter systems are prevalent across adolescence (Larsen et al., 2020; Wahlstrom et al., 2010). It is at the moment, however, unclear how these changes contribute to adolescent learning under uncertainty.

Box 2. Computational models used to model learning and choice in their simplest form

Reinforcement learning (RL) models In their simplest and widely used form, RL models include the Rescorla-Wagner learning rule (Equation 1b) combined with the Softmax choice function (Equation 2). An important element of learning in these models is the prediction error (PE), which is the difference between an expected (EV) and a received (O) outcome as a result of an action (e.g., choosing the right option) (Equation 1a). \(PE_{t}=O{t}-EV(right)_{t}\qquad\qquad\qquad(1 \text a)\) \(EV(right)_{t+1}=EV(right)_{t}+a(PE_{t})\quad(1 \text b)\) Parameters in RL models that are estimated from the data and used to calculate PEs are a learning rate (α) and decision temperature (β). Learning rates reflect the degree of updating of expectations (EV), i.e., expected value of a stimulus or action. The EV is then updated for each stimulus or action separately at time t (note that there might be variations of these models where EVs for both options are updated simultaneously based on the outcome received for one of them, i.e., when the options are perfectly anticorrelated). Although these models can be extended in several ways, one common version includes separate learning rates for positive (better-than-expected) and negative (worse-than-expected) PEs. \(p{(right)}_t=\frac{1}{1+{e}^{-\beta \left( EV{(right)}_t- EV{(left)}_t\right)}}\kern2.25em (2)\) The choice is then determined by the Softmax function, which assigns higher choice probability to the option with the higher EV proportional to the difference of the EVs for different options with varying sensitivity (Equation 2). The decision temperature (also called inverse temperature) indicates the degree of this sensitivity and can indicate more or less exploratory choice behavior depending on its value. Learning rates determine how much influence PEs have on the updating; a higher learning rate would lead to larger influence of the most recent outcomes, whereas a lower learning rate would lead to slower integration across a history of multiple outcomes. Bayesian Updating Models Simple reinforcement-learning models do not incorporate uncertainty directly in their computational framework. In contrast, Bayesian models assume that individuals attempt to infer the environment’s hidden states given an individual’s observations (i.e., given the outcomes). In Bayesian models, uncertainty is explicitly built in. That is, in Bayesian learning models, there is not a single estimation of EV, but there is a belief distribution over the world state of interest given the observations. This belief distribution starts with a prior belief distribution and is updated with each observation based on Bayes rule, resulting in the posterior belief distribution of an individual. The posterior distribution is then used in the decision rule by maximizing the expected utility under the posterior (e.g., maximum a posteriori (MAP) decision rule), while the width of the distribution corresponds to uncertainty about the environment’s state. For more information see e.g. Ma et al. (2022a).

Semi-systematic review approach

Semi-systematic literature reviews are used to integrate evidence on topics that are conceptualized and studied in different ways which may impede the process of a full systematic review and/or meta-analysis (Snyder, 2019). We opted for a semi-systematic review to study the developmental differences in learning performance and strategies from stochastic and volatile outcomes by making use of an extant literature including a diverse set of studies on belief updating and reinforcement learning. Thus, we searched terms on the PubMed database related to Uncertainty, Probability Learning, Reversal Learning, Reinforcement; together with terms, such as Developmental, Adolescent Development, Young Adult, Puberty (final search date July 26, 2022; see full list of terms in Supplementary Tables S1-2). In addition to screening these articles published from 2010 (excluding review articles, studies that did not include adolescent samples and those that did not include any age-related analyses), we also used snowballing methods by searching for the citing papers of these articles, and articles cited by them to identify other relevant papers and preprints (Supplementary Figure S1, flow diagram). Tables 1 and 2 summarize all studies, including age ranges and paradigms, model parameters, and whether neuroimaging data were included. Supplementary Table 1 includes the means of parameters estimates in the studies (if reported). We discuss the studies of learning under stochasticity (Table 1) and volatility (Table 2) separately in relation to age-related differences in behavioral, computational modeling, and neural findings and make suggestions for future studies.

Table 1 Studies that investigated developmental differences in learning and decision-making with stochastic outcomes

Full size table

Table 2 Table outlining studies that investigated developmental differences in learning and decision-making with volatile outcomes

Full size table

Results

Development of learning from stochastic outcomes

Table 1 lists empirical studies comparing developmental samples using tasks that involve outcome stochasticity. The majority of these studies employed probabilistic learning tasks with stable reward contingencies, and a few used experience-based decision-making tasks. Among these are studies that used a RL model (except Hämmerer et al., 2011; Humphreys et al., 2016; and Smith et al., 2012) and studies with (n = 7) or without (n = 12) neuroimaging. Only four of the reviewed studies with stochastic but stable outcome contingencies used social rewards or feedback, and the nature of these were highly diverse (i.e., prosocial reward, reciprocity of trust, acceptance, and feedback about others’ mental states), hindering our ability to directly compare social to nonsocial tasks.

When summarizing these developmental findings, we first consider how quickly individuals at different ages update values of different stimuli or actions in contexts with stochastic but otherwise stable outcomes which should evoke higher degrees of expected uncertainty. These findings have been mixed. Whereas some studies reported that adolescents had lower learning rates than adults (Davidow et al., 2016; Jones et al., 2014; Rosenblau et al., 2018; Xia et al., 2021), others reported a decrease in learning rates with age (Decker et al., 2015; Jepma et al., 2020; van den Bos et al., 2012; Westhoff et al., 2020, 2021) or no age-related differences (Palminteri et al., 2016; Raab & Hartley, 2020). A subset of these studies (N = 6) reported asymmetrical learning rates for positive (better-than-expected) and negative (worse-than-expected) PEs—referred to as positive and negative learning rates in short—instead of single learning rates, which further added to divergent findings in the literature (Christakou et al., 2013; Jones et al., 2014; Nussenbaum et al., 2022; Rodriguez Buritica et al., 2019; van den Bos et al., 2012; Xia et al., 2021). If we look at these separately, however, positive learning rates in children and adolescents showed mixed findings. One study reported higher positive learning rates in children and adults relative to adolescents (Jones et al., 2014); another reported a marginal increase with age from childhood to adulthood (van den Bos et al., 2012). Two studies reported opposite patterns: one reported a decrease in positive learning rates from early adolescence to adulthood (Christakou et al., 2013), and the other reported an increase (Xia et al., 2021). Yet others reported no difference in positive learning rates across ages (Nussenbaum et al., 2022; Rodriguez Buritica et al., 2019). Negative learning rates seemed to be relatively more consistent where children showed either the highest (Nussenbaum et al., 2022; Rodriguez Buritica et al., 2019; van den Bos et al., 2012) or similar levels (Jones et al., 2014) compared with other age groups. Adolescents showed similar negative learning rates to adults (Jones et al., 2014; Rodriguez Buritica et al., 2019) or negative learning rates decreased with age (Nussenbaum et al., 2022; van den Bos et al., 2012), except for one study that reported an increase in negative learning rates with age in adolescence but not in adulthood (Christakou et al., 2013).

In contrast, findings from most studies indicate a decrease in choice stochasticity and exploration (i.e., inverse temperature) in adults compared with children and adolescents in most studies (Decker et al., 2015; Jepma et al., 2020; Nussenbaum et al., 2022; Palminteri et al., 2016; Rodriguez Buritica et al., 2019; Westhoff et al., 2021; Xia et al., 2021; but see Davidow et al., 2016, and van den Bos et al., 2012). Moreover, learning performance—as indicated by the proportion of choices for the option with higher underlying mean value, correct responses, or more accurate predictions depending on the task characteristics—generally increased with age (Christakou et al., 2013; Cohen et al., 2010; Humphreys et al., 2016; Jepma et al., 2020; Jepma et al., 2022; Jones et al., 2014; Nussenbaum et al., 2022; Palminteri et al., 2016; Rosenblau et al., 2018; Westhoff et al., 2021; Xia et al., 2021) . Compared with the number of studies that reported an age-related increase in learning performance, fewer studies reported a decrease in performance from adolescence through adulthood (Davidow et al., 2016; Raab & Hartley, 2020), or they reported no age-related differences between adolescents and young adults (Rodriguez Buritica et al., 2019) but found that children and older adults performed worse than adolescents and young adults (Decker et al., 2015; Hämmerer et al., 2011). One study also found a U-shaped relationship with age from childhood to mid-late adolescence with lowest performance between ages 10-13 years (Smith et al., 2012).

Neuroimaging findings show that in a learning context with outcome stochasticity, PEs scale with the activity in the ventral striatum, and medial PFC (Cohen et al., 2010; Davidow et al., 2016; Jones et al., 2014; van den Bos et al., 2012; Westhoff et al., 2021). In a social learning task where participants made predictions about the preferences of peers, activation in the fusiform cortex was associated with PEs (Rosenblau et al., 2018). In terms of age-related differences, studies reported 1) no age-related change in PE responses when learning for self (Westhoff et al., 2021), 2) peak striatal activity in adolescence (Cohen et al., 2010), 3) greater hippocampal PE-related activity in adolescents vs. adults (Davidow et al., 2016), and 4) greater activation in insula with positive PEs specific to adolescents (Jones et al., 2014). The expected values and predictions in these tasks correlated with the medial PFC responses, which were stronger in adults relative to adolescents (Jones et al., 2014; Rosenblau et al., 2018). In addition, one study investigating the functional connectivity between striatum and medial PFC reported enhanced connectivity during the receipt of positive versus negative feedback, which also increased with age (van den Bos et al., 2012).

Taken together, these findings are difficult to reconcile in terms of systematic developmental changes. The reported inconsistencies seem due to the variety of tasks (e.g., some requiring higher working-memory capacity or social tasks) and computational models (e.g., single vs. asymmetrical learning rates) used as well as sample characteristics. For example, there are inconsistencies in the cutoff ages that different studies used in order to group participants as children, adolescents, and adults. Combined with differences in analytic approaches (e.g., age used as a continuous variable vs. grouping variable), such sample differences may have contributed to mixed findings when comparing ages. Despite this, most studies using stable probabilistic learning tasks that involve expected uncertainty (due to outcome stochasticity) somewhat consistently reported a decrease in choice stochasticity (Decker et al., 2015; Palminteri et al., 2016; Westhoff et al., 2021; Xia et al., 2021 but see Davidow et al., 2016; and van den Bos et al., 2012) and an increase in performance from childhood to adulthood (Decker et al., 2015; Hämmerer et al., 2011; Jones et al., 2014; Palminteri et al., 2016; Rosenblau et al., 2018; Westhoff et al., 2021; Xia et al., 2021 but see Davidow et al., 2016; Raab & Hartley, 2020; and Smith et al., 2012).

Results: Development of learning from volatile outcomes

Dynamic and volatile environments contain reversals or sudden changes in the outcome statistics that evoke unexpected uncertainty. In these environments, typically the challenge is to optimally respond to unexpected outcomes, because they might signal either a change or occur due to stochasticity or noise in the environment. Table 2 shows the overview of developmental studies that used tasks with high volatility, such as probabilistic or deterministic reversal learning tasks (n = 6) and a predictive inference task (n = 1; Bruckner et al., 2020). The majority (n = 6) of these studies also employed computational models to analyze the behavioral patterns. All studies recruited adolescents. Except for one study, which compared children to adolescents (Weiss et al., 2021), the others compared younger participants to adults. A subset of studies (n = 3) included neuroimaging findings. Except for one experimental condition in one of the reported studies (i.e., videos showing an individual smiling and giving thumbs up; Weiss et al., 2021), all studies with volatile outcomes focused on learning from nonsocial reward or feedback.

Similar to tasks that involved outcome stochasticity but not volatility, these studies using tasks with higher volatility also reported mixed findings regarding age-related differences in the patterns of learning rates. Some studies reported no differences in learning rates (Bruckner et al., 2020; Javadi et al., 2014; Waltmann et al., 2023). Others reported higher learning rates in adolescents compared with adults (particularly for negative outcomes; Hauser et al., 2015), or found higher learning rates in adolescents than in children (Bruckner et al., 2020; Weiss et al., 2021). In contrast, a recent study reported lowest negative learning rates in adolescents among all the age groups (Eckstein et al., 2022). Only two of these studies modeled and reported on both positive and negative learning rates (Eckstein et al., 2022; Hauser et al., 2015), whereas others reported on a single learning rate.

With regard to the inverse temperature parameter, these studies reported either no age-related differences (Hauser et al., 2015; Weiss et al., 2021) or increases with age (Eckstein et al., 2022; Javadi et al., 2014), indicating less exploration or noisy choices with age. A recent study found that adolescents were less sensitive to particularly positive reinforcement than adults, which leads adolescents to show more response switching akin to more exploratory/noisy choice behavior (Waltmann et al., 2023). In addition, whereas some studies reported peak performance in adolescence compared with other ages (Eckstein et al., 2022; van der Schaaf et al., 2011; Weiss et al., 2021), or better performance of adolescents than adults in early trials of more volatile phases (Waltmann et al., 2023), the others did not find any differences in performance between adolescents and adults (Bruckner et al., 2020; Hauser et al., 2015; Javadi et al., 2014).

Neuroimaging findings show that in a learning context with high volatility, PEs were found to be associated with the activity in the striatum, ventral medial PFC, and posterior cingulate cortex, yet with neglectable or very limited age-related differences (Hauser et al., 2015; Javadi et al., 2014; Waltmann et al., 2023). One study reported an increased right insula response to negative PEs in adolescents compared with adults (Hauser et al., 2015). Another study reported that activity in the medial PFC scaled with choice probability predicted by the computational model and was stronger in adults than adolescents (Waltmann et al., 2023).

Although all these studies involved volatility, the characteristics of the experimental paradigms, computational models used, and samples varied considerably. This group of studies most commonly employed probabilistic learning tasks. However, even when we only compare the probabilistic reversal tasks that involve choosing between two options, the exact probabilities associated with a given outcome were 80%, 75%, or 60% in different studies. The outcomes could be gain and loss, gain and no gain, or loss and no-loss. In addition, there were inconsistencies in the cutoffs used to define different age groups along with differences in the analytic approach to assess age-related effects in these studies similar to those in studies that employed tasks with stochastic outcomes. Despite these differences, it seems that adolescents either performed comparable to adults (Bruckner et al., 2020; Hauser et al., 2015; Javadi et al., 2014) or better (Eckstein et al., 2022; van der Schaaf et al., 2011; during early reversal phases in Waltmann et al., 2023; recently similar findings were reported in Chierchia et al., 2022) in such dynamic and changing environments with higher levels of volatility.

Interim summary: Comparing the development of learning from stochastic to volatile outcomes

The results suggest that adolescents might have an advantage over younger and older age groups when learning in dynamic environments with volatile outcomes. Particularly, they might perform relatively better than adults when learning from volatile outcomes compared with learning from only stochastic outcomes where they generally seem to perform worse than adults. More specifically, among the studies, three of six that included adolescents and adults showed that adolescents were better at learning from volatile outcomes than adults (note that this was only true in early reversal phases in Waltmann et al., 2023); the other three showed no age-related changes from adolescence to adulthood. Interestingly, none of these studies reported that adolescents performed worse (i.e., fewer correct choices) at learning from volatile outcomes. Although the results for stochastic outcomes appear to be somewhat mixed, there seemed to be an improvement in learning from stochastic but stable outcomes with age. Eleven of 17 studies that reported on learning performance showed that adolescent's learning improved with age, whereas two studies found an age-related decline and three did not find age-related differences in performance between adolescents and adults.

Only 10 among the 26 studies reviewed included neuroimaging data. Moreover, the heterogeneity of the learning paradigms and modeling approaches in the reviewed studies makes it difficult to compare the neural correlates of the processes involved in learning under stochastic and volatile outcomes. Interestingly, most of these studies do not find age-related differences in the processing of PEs in these regions (but see Cohen et al., 2010). Also, we did not identify any regions that dissociated learning from stochastic outcomes and volatile outcomes. One explanation is that these learning processes may overlap and depend on the same learning systems in the brain. Alternatively, different levels of volatility (or surprise) may target more specific neural systems, although there are limited indications for a distinction in learning systems in the developmental comparison we included.

Discussion and future directions

Mechanisms underlying adolescent learning and decision-making under different types of uncertainty

From this review, our findings indicate that adolescents, compared with children and adults, seem to have a relative advantage when learning from volatile outcomes. When learning from stochastic outcomes adolescents, compared with adults, have a relative disadvantage, but future studies are needed to identify the underlying causal mechanisms for this effect. These empirical studies may suggest at least several candidate mechanisms. First, although the findings for learning rates were largely mixed, explorative or noisy choices—e.g., as indicated by the (inverse) temperature parameter—decreased from adolescence to adulthood across learning tasks (Nussenbaum & Hartley, 2019). On average, these exploratory or noisy choices can result in gaining less reward or incurring greater loss in environments where outcomes are stochastic, but stable. However, in environments where outcomes are volatile, these choices can result in faster detection of changes in outcomes, as the lower value options keep being sampled instead of avoided entirely (Denrell, 2005; Fan et al., 2022; Lloyd et al., 2022). The decrease in exploratory or noisy choices with age is therefore one potential mechanism by which adolescents might perform better when learning from volatile outcomes than when learning from only stochastic outcomes relative to adults.

Second, adolescents may be more prone to perceive volatility in environments in which outcomes are in reality only stochastic (Jepma et al., 2020). This could indicate that adolescents either expect more volatility in their environment or that they mistake stochastic outcomes as signals of volatility. Furthermore, adolescents who estimate higher volatility in such environments may have both higher learning rates and engage in more exploratory or noisy choices (Jepma et al., 2020). The majority of studies we reviewed employed learning tasks with a choice component. In such tasks, both the updating of the expected values and choice behavior play a role in determining performance. Thus, making use of designs that examine these processes partially independently (e.g., an estimation task without a choice function, and a probabilistic learning task with a choice function as in Jepma et al., 2020) under different types of outcome uncertainty may help better understand the mechanisms that give rise to age-related differences in performance when learning from volatile and stochastic outcomes.

Finally, neurobiological, hormonal, and environmental changes that take place during adolescence may explain why the developmental changes in learning from stochastic and volatile outcomes occur and relate to the changes in exploration, noise, and perceptions of volatility. In a developmental perspective, an interesting question for future research is therefore whether, for instance, pubertal onset is tied to these cognitive computations and expectations. In addition, studies have suggested that pubertal changes may initiate a cascade of neurobiological changes that influence learning and brain plasticity, including for instance dopamine functioning (Larsen & Luna, 2018). More research is needed to combine learning in stochastic and volatile environments to these developmental changes in hormonal and neurotransmitter functioning. Longitudinal studies, in particular, could be crucial in differentiating the effects of age versus puberty on learning in diverse uncertain situations.

Understanding the links between volatility, exploration, and noise in empirical studies

In decision making, noise refers to random fluctuations that can affect the accuracy and consistency of our judgments or actions, whereas exploration refers to the process of seeking out new information or options to improve our understanding of a situation, which would then be used to identify better courses of action. Across our reviewed studies, one of the more consistent developmental differences in computational model parameters was observed in the inverse temperature. Although historically considered to reflect exploration (Daw et al., 2006), this parameter could be interpreted as a form of exploration, as decision noise, and sometimes these accounts are difficult to distinguish.

Recent frameworks, such as those proposed by Gershman (2018) and Wilson et al. (2014), provide a more nuanced view on exploration and are promising for future developmental studies. For example, some decision contexts may call for directed exploration (e.g., when there is relative uncertainty, the more uncertain option may be favored). Other decision contexts call for random exploration (e.g., when the total uncertainty is high, not dependent on relative uncertainty) (Fan et al., 2022; Gershman, 2018; Tomov et al., 2020). When the outcomes of the options are volatile as opposed to only stochastic, this also leads to more exploration (Fan et al., 2022). Random exploration is thought to be stable across development, but interestingly, the strategic use of directed exploration has been suggested to emerge across adolescence (Somerville et al., 2017). This puts forward a promising hypothesis regarding age-related changes in goal-directed exploration and its interactions with outcome uncertainty, which can be targeted by using specific experimental paradigms and models (Fan et al., 2022; Tomov et al., 2020).

Another recent framework that would be interesting to test using a developmental perspective disentangles decision noise from computation noise (Findling et al., 2019). According to this framework, the variability in choice behavior that would be traditionally attributed to decision noise (or “exploration”) could be, to a large degree, explained by noise in the updating of the action values (i.e., computation noise; Findling et al., 2019; Findling & Wyart, 2021). An interesting feature of computation noise is that it increases with the magnitude of the prediction errors, particularly in volatile environments (Findling & Wyart, 2021). The potential benefits of increased computation noise in volatile environments could be to support the flexibility to adapt to unpredictable changes or balancing the cost of computation precision. It is a possibility that the increased choice stochasticity in adolescents that we observed in the studies reviewed also can be attributed to computation noise. However, no study to date has directly examined age-related changes in computation noise. This remains to be addressed in future studies.

One important point to consider is that several of the tasks analyzed in our review, including those that involve volatile task environments (e.g., implementing a mean-level shift in the reward structure), typically incorporate stochastic elements (Fig. 1; Supplementary Table S3). At the moment almost no developmental studies explicitly estimate stochasticity and volatility (but see Jepma et al., 2020). However, such an approach is important when the goal is to understand the mechanisms that give rise to behavioral differences when learning from outcomes that involve different types of uncertainty. For example, adolescents may be more prone to perceive volatility in environments in which outcomes are, in reality, only stochastic (Jepma et al., 2020). Recent models that explicitly estimate both stochasticity and volatility (Piray & Daw, 2021) can be combined with paradigms that manipulate stochasticity and volatility within the same individuals while keeping other task characteristics as similar as possible (Behrens et al., 2007). These experimental and methodological advances would allow us to examine more directly the developmental differences in learning under stochasticity and volatility.

Testing these frameworks also requires studies with larger samples or multiple studies using the same task and models. As mentioned in our interim summary and Table 1 (see also Supplementary Table S3 for reported means of parameter values), most studies include different paradigms with slightly different computational approaches, making it difficult to directly compare parameter values across studies. The issue of generalizability of computational approaches is clearly outlined in previous reviews (Eckstein et al., 2021; 2020). In brief, their findings show that in many cases, computational parameters cannot be directly compared between studies, because the processes that are captured depend on task characteristics, such as feedback valence, memory load, choice of parameters, volatility, and others. We therefore capitalized on a comparison of age (groups) within studies and subsequently summarized these findings for task environments that differ in volatility versus stochasticity. Also, differences between studies may occur due to variations in sample characteristics. Studies covering the period of adolescence have relied on different age ranges and cutoffs (Table 1), in which the youngest adolescents included were aged 10, 12, 13, or 14 years and the oldest were aged 14, 15, 17, 18, and 19 years. Finally, although not reported here, socioeconomic status, or education levels may differ between studies and/or age groups. For future studies, these sample characteristics are important to consistently report in the literature (Qu et al., 2021).

Individual differences in learning and decision-making with uncertainty

The developmental studies that were identified in this review largely ignored the subjective experience of uncertainty. Uncertainty is perceived to be threatening by most people and is associated with stress (de Berker et al., 2016; Grupe & Nitschke, 2013; Peters et al., 2017). Anxious individuals have been shown to have difficulties processing the cues in their environment to estimate the type of uncertainty and adjust their learning accordingly (Piray & Daw, 2021; Pulcu & Browning, 2019). For example, individuals with higher trait anxiety and transdiagnostic anxious and depressive symptomatology showed little difference in learning rates between volatile compared with stable (stochastic) environments, whereas optimal learners increase their learning rates (i.e., learn faster) in volatile environments (Browning, 2015; Gagne et al., 2020). According to a recent conceptual framework (Piray & Daw, 2021), learners simultaneously make inferences about the stochasticity and volatility in an environment, which are compensatory processes influencing the adjustment of learning rates. Within this framework, anxiety is suggested to be mainly associated with the maladaptive functioning of the processes involved in stochasticity inference such that anxious individuals assume higher volatility in environments that are actually stable but highly stochastic. Alternatively, anxiety might be related to reduced exploration and adaptation of exploratory behavior to volatility where exploring more might be beneficial (Fan et al., 2022; Lloyd et al., 2022). Anxiety and depressive symptoms are particularly relevant to include from a developmental perspective, as the onset of anxiety disorders and depression are most prevalent during adolescence (Blakemore, 2019; de Lijster et al., 2017; Kessler & Bromet, 2013; McLaughlin & King, 2015). To what extent uncertainty, and the mechanisms that may drive the experience of uncertainty, play a role in the development of mental health symptomatology is an important question for future developmental studies. A longitudinal perspective will be crucial to unravel who is at risk for developing mental health illnesses.

Uncertainty in social environments

Our review includes paradigms that examine learning in both social and nonsocial environments. However, most studies in our review use abstract paradigms, which necessitate individuals to learn a stimulus-outcome association based solely on their personal experiences without any social cues. Although learning through prediction errors can occur in both social and nonsocial contexts (Ruff & Fehr, 2014), learning in a social context may sometimes involve different strategies than learning in a nonsocial context (Hackel et al., 2020). Also, many of the uncertainties that adolescents learn to navigate in this phase of life stem from social interactions as they begin to interact with their environments as autonomous individuals, take on various social roles, and form new friendships and romantic relationships (Crockett & Crouter, 2014). Moreover, adolescence is a developmental phase in which social reorientation takes place such that importance of peers and salience of social information becomes more prominent (Crone & Dahl, 2012; Nelson et al., 2005, 2016). This supports the relevance of learning and decision-making in social contexts given that adolescents’ understanding of what their peers value and think should considerably weigh into their value estimations and influence their decisions (Pfeifer & Berkman, 2018). It is therefore important for adolescents to learn about and update their knowledge of the characteristics of other people (e.g., what and whom they like; Jones et al., 2014; Rosenblau et al., 2018, or how trustworthy they are; Ma et al., 2022b) and social groups (e.g., how cooperative or trustworthy they are; Westhoff et al., 2020). For building social ties, it is important to learn how consequences of our actions influence ourselves and others (e.g., whether our actions are harmful for others; Westhoff et al., 2021) or by observing others and benefiting from their experiences (Rodriguez Buritica et al., 2019). There have been efforts to address the importance of studying the uncertainty processing in social contexts during adolescence (Blankenstein et al., 2016; Hofmans & van den Bos, 2022; Ma et al., 2022b). For example, one study used computational models to examine uncertainty in social contexts directly and found that adolescents have weaker prior expectations about the social behavior of their peers, which resulted in faster learning about their peers (Ma et al., 2022b).

Additionally, it has been suggested that adolescents’ ability to adapt to volatile social environments may manifest in the increased variability of their moods (Gregorova et al., 2022). For example, their positive or negative mood may signal a general increase or decrease of social rewards in the adolescent environment, thereby facilitating quick adjustment to interactions with friendly (positive mood) or hostile (negative mood) others. However, in cases where one’s mood largely biases their learning or where one engages in suboptimal learning (e.g., estimating higher volatility in an environment with stable but stochastic outcomes), increased mood variability may pose a risk for mental health problems (Gregorova et al., 2022). Taken together, future research is needed to unpack how the uncertainty in adolescents’ social environments may provide rich and adaptive opportunities for learning.

Conclusions

The ability to tailor learning and decision-making under uncertainty is crucial for adaptive behavior, especially given that uncertainty is intrinsic to most real-life situations. In this review, we discussed different types of uncertainty, focusing mainly on two types of outcome uncertainty: stochasticity and volatility, as these have different influences on learning and decision-making. Taking a developmental approach, with a focus on adolescence as a period characterized by change and uncertainty, we summarized the recent findings from studies that compared different age groups in learning tasks that involved different types of uncertainty. While we observed that the findings were mixed, there were interesting consistencies in the age-related differences in model parameters and performance. The findings suggest that the development of learning under uncertainty might depend on the statistics of the environment and the type of uncertainty that the individual is exposed to. Interestingly, adolescents may have an advantage when learning from volatile outcomes. In contrast, adolescents’ more exploratory or noisy choice behavior seems a disadvantage when learning from stochastic outcomes in relatively stable contexts. This is possibly an adaptive response to the rather complex and continuously changing environments that adolescents encounter in real life. Future studies are needed to test this relationship more directly and expose mechanisms through which adolescents gain this advantage in learning. Together, these findings contribute to the understanding of adolescence as a sensitive period for learning in uncertain and dynamically changing environments.

References

Apps, M. A., & Sallet, J. (2017). Social learning in the medial prefrontal cortex. Trends in Cognitive Sciences, 21(3), 151–152. https://doi.org/10.1016/j.tics.2017.01.008
Arnett, J. J. (2000). Emerging adulthood: A theory of development from the late teens through the twenties. American Psychologist, 55(5), 469. https://doi.org/10.1037/0003-066X.55.5.469
Article PubMed Google Scholar
Behrens, T. E. J., Woolrich, M. W., Walton, M. E., & Rushworth, M. F. S. (2007). Learning the value of information in an uncertain world. Nature Neuroscience, 10(9), 1214–1221. https://doi.org/10.1038/nn1954
Article PubMed Google Scholar
Blakemore, S.-J. (2019). Adolescence and mental health. The Lancet, 393(10185), 2030–2031. https://doi.org/10.1016/S0140-6736(19)31013-X
Article Google Scholar
Bland, A. R., & Schaefer, A. (2012). Different varieties of uncertainty in human decision-making. Frontiers in Neuroscience, 6. https://doi.org/10.3389/fnins.2012.00085
Blankenstein, N. E., Crone, E. A., van den Bos, W., & van Duijvenvoorde, A. C. K. (2016). Dealing with uncertainty: Testing risk- and ambiguity-attitude across adolescence. Developmental Neuropsychology, 41(1–2), 77–92. https://doi.org/10.1080/87565641.2016.1158265
Article PubMed Google Scholar
Browning, M. (2015). Anxious individuals have difficulty learning the causal statistics of aversive environments. Nature Neuroscience, 18(4), 9.
Article Google Scholar
Bruckner, R., Nassar, M. R., Li, S.-C., & Eppinger, B. (2020). Default beliefs guide learning under uncertainty in children and older adults [preprint]. PsyArXiv. https://doi.org/10.31234/osf.io/we3ct
Bruckner, R., Heekeren, H. R., & Nassar, M. R. (2022). Understanding learning through uncertainty and bias [preprint]. PsyArXiv. https://doi.org/10.31234/osf.io/xjkbg
Chierchia, G., Soukupová, M., Kilford, E. J., Griffin, C., Leung, J., Palminteri, S., & Blakemore, S. J. (2022). Confirmatory reinforcement learning changes with age during adolescence. Developmental Science, e13330. https://doi.org/10.1111/desc.13330
Christakou, A., Gershman, S. J., Niv, Y., Simmons, A., Brammer, M., & Rubia, K. (2013). Neural and psychological maturation of decision-making in adolescence and young adulthood. Journal of Cognitive Neuroscience, 25(11), 1807–1823. https://doi.org/10.1162/jocn_a_00447
Article PubMed Google Scholar
Cohen, J. R., Asarnow, R. F., Sabb, F. W., Bilder, R. M., Bookheimer, S. Y., Knowlton, B. J., & Poldrack, R. A. (2010). A unique adolescent response to reward prediction errors. Nature Neuroscience, 13(6), 669–671. https://doi.org/10.1038/nn.2558
Article PubMed PubMed Central Google Scholar
Corlett, P. R., Mollick, J. A., & Kober, H. (2022). Meta-analysis of human prediction error for incentives, perception, cognition, and action. Neuropsychopharmacology, 47(7), 1339–1349. https://doi.org/10.1038/s41386-021-01264-3
Article PubMed PubMed Central Google Scholar
Crockett, L. J., & Crouter, A. C. (2014). Pathways through adolescence: Individual development in relation to social contexts. Psychology Press.
Book Google Scholar
Crone, E. A., & Dahl, R. E. (2012). Understanding adolescence as a period of social–affective engagement and goal flexibility. Nature Reviews Neuroscience, 13(9), 636–650. https://doi.org/10.1038/nrn3313
Article PubMed Google Scholar
d’Acremont, M., & Bossaerts, P. (2016). Neural mechanisms behind identification of leptokurtic noise and adaptive behavioral response. Cerebral Cortex, 26(4), 1818–1830. https://doi.org/10.1093/cercor/bhw013
Article PubMed PubMed Central Google Scholar
Davidow, J. Y., Foerde, K., Galván, A., & Shohamy, D. (2016). An upside to reward sensitivity: The hippocampus supports enhanced reinforcement learning in adolescence. Neuron, 92(1), 93–99. https://doi.org/10.1016/j.neuron.2016.08.031
Article PubMed Google Scholar
Daw, N. D., O’doherty, J. P., Dayan, P., Seymour, B., & Dolan, R. J. (2006). Cortical substrates for exploratory decisions in humans. Nature, 441(7095), 876–879. https://doi.org/10.1038/nature04766
Article PubMed PubMed Central Google Scholar
de Berker, A. O., Rutledge, R. B., Mathys, C., Marshall, L., Cross, G. F., Dolan, R. J., & Bestmann, S. (2016). Computations of uncertainty mediate acute stress responses in humans. Nature Communications, 7(1), 10996. https://doi.org/10.1038/ncomms10996
Article PubMed PubMed Central Google Scholar
Decker, J. H., Lourenco, F. S., Doll, B. B., & Hartley, C. A. (2015). Experiential reward learning outweighs instruction prior to adulthood. Cognitive, Affective, & Behavioral Neuroscience, 15(2), 310–320. https://doi.org/10.3758/s13415-014-0332-5
Article Google Scholar
de Lijster, J. M., Dierckx, B., Utens, E. M. W. J., Verhulst, F. C., Zieldorff, C., Dieleman, G. C., & Legerstee, J. S. (2017). The age of onset of anxiety disorders: A meta-analysis. The Canadian Journal of Psychiatry, 62(4), 237–246. https://doi.org/10.1177/0706743716640757
Article PubMed Google Scholar
Denrell, J. (2005). Why Most people disapprove of me: Experience sampling in impression formation. Psychological Review, 112, 951–978. https://doi.org/10.1037/0033-295X.112.4.951
Article PubMed Google Scholar
Eckstein, M. K., Wilbrecht, L., & Collins, A. G. E. (2021). What do reinforcement learning models measure? Interpreting model parameters in cognition and neuroscience. Current Opinion in Behavioral Sciences, 41, 128–137. https://doi.org/10.1016/j.cobeha.2021.06.004
Eckstein, M. K., Master, S. L., Dahl, R. E., Wilbrecht, L., & Collins, A. G. E. (2022). Reinforcement learning and Bayesian inference provide complementary models for the unique advantage of adolescents in stochastic reversal. Developmental Cognitive Neuroscience, 55, 101106. https://doi.org/10.1016/j.dcn.2022.101106
Article PubMed PubMed Central Google Scholar
Fan, H., Gershman, S. J., & Phelps, E. A. (2022). Trait somatic anxiety is associated with reduced directed exploration and underestimation of uncertainty. Nature Human Behaviour, 1-12. https://doi.org/10.1038/s41562-022-01455-y
Findling, C., Skvortsova, V., Dromnelle, R., Palminteri, S., & Wyart, V. (2019). Computational noise in reward-guided learning drives behavioral variability in volatile environments. Nature Neuroscience, 22(12), 2066–2077. https://doi.org/10.1038/s41593-019-0518-9
Article PubMed Google Scholar
Findling, C., & Wyart, V. (2021). Computation noise in human learning and decision-making origin, impact, function. Current Opinion in Behavioral Sciences, 38, 124–132. https://doi.org/10.1016/j.cobeha.2021.02.018
Article Google Scholar
Frank, M. J., Seeberger, L. C., & O’Reilly, R. C. (2004). By carrot or by stick: Cognitive reinforcement learning in parkinsonism. Science, 306(5703), 1940–1943. https://doi.org/10.1126/science.1102941
Article PubMed Google Scholar
Gagne, C., Zika, O., Dayan, P., & Bishop, S. J. (2020). Impaired adaptation of learning to contingency volatility in internalizing psychopathology. ELife, 9, e61387. https://doi.org/10.7554/eLife.61387
Article PubMed PubMed Central Google Scholar
Garrison, J., Erdeniz, B., & Done, J. (2013). Prediction error in reinforcement learning: A meta-analysis of neuroimaging studies. Neuroscience & Biobehavioral Reviews, 37(7), 1297–1310. https://doi.org/10.1016/j.neubiorev.2013.03.023
Article Google Scholar
Gershman, S. J. (2018). Deconstructing the human algorithms for exploration. Cognition, 173, 34–42. https://doi.org/10.1016/j.cognition.2017.12.014
Article PubMed Google Scholar
Gregorova, K., Eldar, E., Deserno, L., & Reiter, A. M. (2022). A cognitive-computational account of mood swings in adolescence [preprint]. PsyArXiv. https://doi.org/10.31234/osf.io/u62vr
Grupe, D. W., & Nitschke, J. B. (2013). Uncertainty and anticipation in anxiety: An integrated neurobiological and psychological perspective. Nature Reviews Neuroscience, 14(7), 488–501. https://doi.org/10.1038/nrn3524
Article PubMed PubMed Central Google Scholar
Greimel, E., Bakos, S., Landes, I., Töllner, T., Bartling, J., Kohls, G., & Schulte-Körne, G. (2018). Sex differences in the neural underpinnings of social and monetary incentive processing during adolescence. Cognitive, Affective, & Behavioral Neuroscience, 18(2), 296–312. https://doi.org/10.3758/s13415-018-0570-z
Article Google Scholar
Haber, S. N., & Knutson, B. (2010). The reward circuit: Linking primate anatomy and human imaging. Neuropsychopharmacology, 35(1), 4–26. https://doi.org/10.1038/npp.2009.129
Article PubMed Google Scholar
Hackel, L. M., Mende-Siedlecki, P., & Amodio, D. M. (2020). Reinforcement learning in social interaction: The distinguishing role of trait inference. Journal of Experimental Social Psychology, 88, 103948. https://doi.org/10.1016/j.jesp.2019.103948
Article Google Scholar
Hämmerer, D., Li, S.-C., Müller, V., & Lindenberger, U. (2011). Life span differences in electrophysiological correlates of monitoring gains and losses during probabilistic reinforcement learning. Journal of Cognitive Neuroscience, 23(3), 579–592. https://doi.org/10.1162/jocn.2010.21475
Article PubMed Google Scholar
Hartley, C. A., & Somerville, L. H. (2015). The neuroscience of adolescent decision-making. Current Opinion in Behavioral Sciences, 5, 108–115. https://doi.org/10.1016/j.cobeha.2015.09.004
Hauser, T. U., Iannaccone, R., Walitza, S., Brandeis, D., & Brem, S. (2015). Cognitive flexibility in adolescence: Neural and behavioral mechanisms of reward prediction error processing in adaptive decision making during development. NeuroImage, 104, 347–354. https://doi.org/10.1016/j.neuroimage.2014.09.018
Article PubMed Google Scholar
Hofmans, L., & van den Bos, W. (2022). Social learning across adolescence: A Bayesian neurocognitive perspective. Developmental Cognitive Neuroscience, 58, 101151. https://doi.org/10.1016/j.dcn.2022.101151
Article PubMed PubMed Central Google Scholar
Huettel, S. A., Stowe, C. J., Gordon, E. M., Warner, B. T., & Platt, M. L. (2006). Neural signatures of economic preferences for risk and ambiguity. Neuron, 49(5), 765–775. https://doi.org/10.1016/j.neuron.2006.01.024
Article PubMed Google Scholar
Humphreys, K. L., Telzer, E. H., Flannery, J., Goff, B., Gabard-Durnam, L., Gee, D. G., Lee, S. S., & Tottenham, N. (2016). Risky decision making from childhood through adulthood: Contributions of learning and sensitivity to negative feedback. Emotion, 16(1), 101–109. https://doi.org/10.1037/emo0000116
Javadi, A. H., Schmidt, D. H. K., & Smolka, M. N. (2014). Adolescents adapt more slowly than adults to varying reward contingencies. Journal of Cognitive Neuroscience, 26(12), 2670–2681. https://doi.org/10.1162/jocn_a_00677
Article PubMed PubMed Central Google Scholar
Jaworska, N., & MacQueen, G. (2015). Adolescence as a unique developmental period. Journal of Psychiatry & Neuroscience: JPN, 40(5), 291.
Article Google Scholar
Jepma, M., Schaaf, J. V., Visser, I., & Huizenga, H. M. (2020). Uncertainty-driven regulation of learning and exploration in adolescents: A computational account. PLoS Computational Biology, 16(9), 1–29. https://doi.org/10.1371/journal.pcbi.1008276
Article Google Scholar
Jepma, M., Schaaf, J. V., Visser, I., & Huizenga, H. M. (2022). Impaired learning to dissociate advantageous and disadvantageous risky choices in adolescents. Scientific Reports, 12(1), 1–14. https://doi.org/10.1038/s41598-022-10100-7
Article Google Scholar
Jones, R. M., Somerville, L. H., Li, J., Ruberry, E. J., Powers, A., Mehta, N., Dyke., J., & Casey, B. J. (2014). Adolescent-specific patterns of behavior and neural activity during social reinforcement learning. Cognitive, Affective, & Behavioral Neuroscience, 14(2), 683–697. https://doi.org/10.3758/s13415-014-0257-z
Kao, C.-H., Khambhati, A. N., Bassett, D. S., Nassar, M. R., McGuire, J. T., Gold, J. I., & Kable, J. W. (2020). Functional brain network reconfiguration during learning in a dynamic environment. Nature Communications, 11(1), 1682. https://doi.org/10.1038/s41467-020-15442-2
Article PubMed PubMed Central Google Scholar
Kessler, R. C., & Bromet, E. J. (2013). The epidemiology of depression across cultures. Annual Review of Public Health, 34(1), 119–138. https://doi.org/10.1146/annurev-publhealth-031912-114409
Article PubMed PubMed Central Google Scholar
Klimstra, T. A., Hale, W. W., III, Raaijmakers, Q. A. W., Branje, S. J. T., & Meeus, W. H. J. (2010). Identity formation in adolescence: Change or stability? Journal of Youth and Adolescence, 39(2), 150–162. https://doi.org/10.1007/s10964-009-9401-4
Article PubMed Google Scholar
LaFontana, K. M., & Cillessen, A. H. N. (2010). Developmental changes in the priority of perceived status in childhood and adolescence. Social Development, 19(1), 130–147. https://doi.org/10.1111/j.1467-9507.2008.00522.x
Article Google Scholar
Larsen, B., Olafsson, V., Calabro, F., Laymon, C., Tervo-Clemmens, B., Campbell, E., & Luna, B. (2020). Maturation of the human striatal dopamine system revealed by PET and quantitative MRI. Nature Communications, 11(1), 846. https://doi.org/10.1038/s41467-020-14693-3
Article PubMed PubMed Central Google Scholar
Larsen, B., & Luna, B. (2018). Adolescence as a neurobiological critical period for the development of higher-order cognition. Neuroscience & Biobehavioral Reviews, 94, 179–195. https://doi.org/10.1016/j.neubiorev.2018.09.005
Article Google Scholar
Laube, C., & van den Bos, W. (2016). Hormones and affect in adolescent decision making. In Recent developments in neuroscience research on human motivation (Vol. 19, pp. 259–281). Emerald Group Publishing Limited. https://doi.org/10.1108/S0749-742320160000019013
Chapter Google Scholar
Lin, W. C., & Wilbrecht, L. (2022). Making sense of strengths and weaknesses observed in adolescent laboratory rodents. Current Opinion in Psychology, 45, 101297. https://doi.org/10.1016/j.copsyc.2021.12.009
Article PubMed Google Scholar
Lloyd, A., McKay, R., & Furl, N. (2022). Stochastic decisions support optimal foraging of volatile environments, and are disrupted by anxiety [preprint]. PsyArXiv. https://doi.org/10.31234/osf.io/ty6j2
Lockwood, P. L., Apps, M. A. J., & Chang, S. W. C. (2020). Is there a ‘social’ brain? Implementations and algorithms. Trends in Cognitive Sciences, 24(10), 802–813. https://doi.org/10.1016/j.tics.2020.06.011
Article PubMed PubMed Central Google Scholar
Ma, W., Goldreich, D., & Kording, K. P. (2022a). Bayesian modeling of perception and action, text book, under contract. Oxford University Press.
Google Scholar
Ma, I., Westhoff, B., & van Duijvenvoorde, A. C. K. (2022b). Uncertainty about others’ trustworthiness increases during adolescence and guides social information sampling. Scientific Reports, 12(1), 7634. https://doi.org/10.1038/s41598-022-09477-2
Maciejewski, D. F., Keijsers, L., van Lier, P. A. C., Branje, S. J. T., Meeus, W. H. J., & Koot, H. M. (2019). Most fare well—But some do not: Distinct profiles of mood variability development and their association with adjustment during adolescence. Developmental Psychology, 55, 434–448. https://doi.org/10.1037/dev0000650
Article PubMed Google Scholar
Marshall, L., Mathys, C., Ruge, D., Berker, A. O. de, Dayan, P., Stephan, K. E., & Bestmann, S. (2016). Pharmacological fingerprints of contextual uncertainty. PLoS Biology, 14(11), e1002575. https://doi.org/10.1371/journal.pbio.1002575
Martinez-Saito, M., & Gorina, E. (2022). Learning under social versus nonsocial uncertainty: A meta-analytic approach. Human Brain Mapping, 43(13), 4185–4206. https://doi.org/10.1002/hbm.25948
Article PubMed PubMed Central Google Scholar
McLaughlin, K. A., & King, K. (2015). Developmental trajectories of anxiety and depression in early adolescence. Journal of Abnormal Child Psychology, 43(2), 311–323. https://doi.org/10.1007/s10802-014-9898-1
Article PubMed PubMed Central Google Scholar
Nassar, M. R., & Frank, M. J. (2016). Taming the beast: extracting generalizable knowledge from computational models of cognition. Current Opinion in Behavioral Sciences, 11, 49–54. https://doi.org/10.1016/j.cobeha.2016.04.003
Nassar, M. R., Bruckner, R., Gold, J. I., Li, S.-C., Heekeren, H. R., & Eppinger, B. (2016). Age differences in learning emerge from an insufficient representation of uncertainty in older adults. Nature Communications, 7(1), 11609. https://doi.org/10.1038/ncomms11609
Article PubMed PubMed Central Google Scholar
Nelson, E. E., Jarcho, J. M., & Guyer, A. E. (2016). Social re-orientation and brain development: An expanded and updated view. Developmental Cognitive Neuroscience, 17, 118–127. https://doi.org/10.1016/j.dcn.2015.12.008
Article PubMed Google Scholar
Nelson, E. E., Leibenluft, E., McCLURE, E. B., & Pine, D. S. (2005). The social re-orientation of adolescence: A neuroscience perspective on the process and its relation to psychopathology. Psychological Medicine, 35(2), 163–174. https://doi.org/10.1017/S0033291704003915
Article PubMed Google Scholar
Nussenbaum, K., & Hartley, C. A. (2019). Reinforcement learning across development: What insights can we draw from a decade of research? Developmental Cognitive Neuroscience, 40, 100733. https://doi.org/10.1016/j.dcn.2019.100733
Article PubMed PubMed Central Google Scholar
Nussenbaum, K., Velez, J. A., Washington, B. T., Hamling, H. E., & Hartley, C. A. (2022). Flexibility in valenced reinforcement learning computations across development. Child Development, cdev.13791. https://doi.org/10.1111/cdev.13791
Palminteri, S., Kilford, E. J., Coricelli, G., & Blakemore, S.-J. (2016). The computational development of reinforcement learning during adolescence. PLoS Computational Biology, 12(6), e1004953. https://doi.org/10.1371/journal.pcbi.1004953
Article PubMed PubMed Central Google Scholar
Payzan-LeNestour, E., Dunne, S., Bossaerts, P., & O’Doherty, J. P. (2013). The neural representation of unexpected uncertainty during value-based decision making. Neuron, 79(1), 191–201. https://doi.org/10.1016/j.neuron.2013.04.037
Article PubMed PubMed Central Google Scholar
Peters, A., McEwen, B. S., & Friston, K. (2017). Uncertainty and stress: Why it causes diseases and how it is mastered by the brain. Progress in Neurobiology, 156, 164–188. https://doi.org/10.1016/j.pneurobio.2017.05.004
Article PubMed Google Scholar
Pfeifer, J. H., & Berkman, E. T. (2018). The development of self and identity in adolescence: Neural evidence and implications for a value-based choice perspective on motivated behavior. Child Development Perspectives, 12(3), 158–164. https://doi.org/10.1111/cdep.12279
Article PubMed PubMed Central Google Scholar
Piekarski, D. J., Boivin, J. R., & Wilbrecht, L. (2017). Ovarian hormones organize the maturation of inhibitory neurotransmission in the frontal cortex at puberty onset in female mice. Current Biology, 27(12), 1735–1745.e3. https://doi.org/10.1016/j.cub.2017.05.027
Article PubMed Google Scholar
Piray, P., & Daw, N. D. (2021). A model for learning based on the joint estimation of stochasticity and volatility. Nature Communications, 12(1), 6587. https://doi.org/10.1038/s41467-021-26731-9
Article PubMed PubMed Central Google Scholar
Pulcu, E., & Browning, M. (2019). The Misestimation of uncertainty in affective disorders. Trends in Cognitive Sciences, 23(10), 865–875. https://doi.org/10.1016/j.tics.2019.07.007
Article PubMed Google Scholar
Qu, Y., Jorgensen, N. A., & Telzer, E. H. (2021). A call for greater attention to culture in the study of brain and development. Perspectives on Psychological Science : A Journal of the Association for Psychological Science, 16(2), 275–293. https://doi.org/10.1177/1745691620931461
Raab, H. A., & Hartley, C. A. (2020). Adolescents exhibit reduced Pavlovian biases on instrumental learning. Scientific Reports, 10(1), 15770. https://doi.org/10.1038/s41598-020-72628-w
Article PubMed PubMed Central Google Scholar
Rodriguez Buritica, J. M., Heekeren, H. R., & van den Bos, W. (2019). The computational basis of following advice in adolescents. Journal of Experimental Child Psychology, 180, 39–54. https://doi.org/10.1016/j.jecp.2018.11.019
Article PubMed PubMed Central Google Scholar
Romer, D., Reyna, V. F., & Satterthwaite, T. D. (2017). Beyond stereotypes of adolescent risk taking: Placing the adolescent brain in developmental context. Developmental Cognitive Neuroscience, 27, 19–34. https://doi.org/10.1016/j.dcn.2017.07.007
Article PubMed PubMed Central Google Scholar
Rosenblau, G., Korn, C. W., & Pelphrey, K. A. (2018). A computational account of optimizing social predictions reveals that adolescents are conservative learners in social contexts. The Journal of Neuroscience, 38(4), 974–988. https://doi.org/10.1523/JNEUROSCI.1044-17.2017
Article PubMed PubMed Central Google Scholar
Ruff, C. C., & Fehr, E. (2014). The neurobiology of rewards and values in social decision making. Nature Reviews Neuroscience, 15(8), 549–562. https://doi.org/10.1038/nrn3776
Article PubMed Google Scholar
Sawyer, S. M., Azzopardi, P. S., Wickremarathne, D., & Patton, G. C. (2018). The age of adolescence. The Lancet Child & Adolescent Health, 2(3), 223–228. https://doi.org/10.1016/S2352-4642(18)30022-1
Schultz, W. (2007). Behavioral dopamine signals. Trends in Neurosciences, 30(5), 203–210. https://doi.org/10.1016/j.tins.2007.03.007
Article PubMed Google Scholar
Schultz, W. (2015). Neuronal reward and decision signals: From theories to data. Physiological Reviews, 95(3), 853–951. https://doi.org/10.1152/physrev.00023.2014
Article PubMed PubMed Central Google Scholar
Schultz, W., Dayan, P., & Montague, P. R. (1997). A neural substrate of prediction and reward. Science, 275(5306), 1593–1599. https://doi.org/10.1126/science.275.5306.1593
Article PubMed Google Scholar
Schulz, K. M., & Sisk, C. L. (2016). The organizing actions of adolescent gonadal steroid hormones on brain and behavioral development. Neuroscience & Biobehavioral Reviews, 70, 148–158. https://doi.org/10.1016/j.neubiorev.2016.07.036
Article Google Scholar
Sherman, L. E., Greenfield, P. M., Hernandez, L. M., & Dapretto, M. (2018). Peer influence via Instagram: Effects on brain and behavior in adolescence and young adulthood. Child Development, 89(1), 37–47. https://doi.org/10.1111/cdev.12838
Article PubMed Google Scholar
Sherman, L. E., Payton, A. A., Hernandez, L. M., Greenfield, P. M., & Dapretto, M. (2016). The power of the like in adolescence: Effects of peer influence on neural and behavioral responses to social media. Psychological Science, 27(7), 1027–1035. https://doi.org/10.1177/0956797616645673
Article PubMed PubMed Central Google Scholar
Silverman, M. H., Jedd, K., & Luciana, M. (2015). Neural networks involved in adolescent reward processing: An activation likelihood estimation meta-analysis of functional neuroimaging studies. NeuroImage, 122, 427–439. https://doi.org/10.1016/j.neuroimage.2015.07.083
Article PubMed Google Scholar
Smith, D. G., Xiao, L., & Bechara, A. (2012). Decision making in children and adolescents: Impaired Iowa gambling task performance in early adolescence. Developmental Psychology, 48(4), 1180–1187. https://doi.org/10.1037/a0026342
Article PubMed Google Scholar
Snyder, H. (2019). Literature review as a research methodology: An overview and guidelines. Journal of Business Research, 104, 333–339. https://doi.org/10.1016/j.jbusres.2019.07.039
Article Google Scholar
Soltani, A., & Izquierdo, A. (2019). Adaptive learning under expected and unexpected uncertainty. Nature Reviews Neuroscience, 20(10), 635–644. https://doi.org/10.1038/s41583-019-0180-y
Article PubMed PubMed Central Google Scholar
Somerville, L. H., Sasse, S. F., Garrad, M. C., Drysdale, A. T., Abi Akar, N., Insel, C., & Wilson, R. C. (2017). Charting the expansion of strategic exploratory behavior during adolescence. Journal of Experimental Psychology: General, 146(2), 155. https://doi.org/10.1037/xge0000250
Article PubMed Google Scholar
Suleiman, A. B., Galván, A., Harden, K. P., & Dahl, R. E. (2017). Becoming a sexual being: The ‘elephant in the room’ of adolescent brain development. Developmental Cognitive Neuroscience, 25, 209–220. https://doi.org/10.1016/j.dcn.2016.09.004
Article PubMed Google Scholar
Tamnes, C. K., Herting, M. M., Goddings, A.-L., Meuwese, R., Blakemore, S.-J., Dahl, R. E., Güroğlu, B., Raznahan, A., Sowell, E. R., Crone, E. A., & Mills, K. L. (2017). Development of the cerebral cortex across adolescence: A multisample study of inter-related longitudinal changes in cortical volume, surface area, and thickness. Journal of Neuroscience, 37(12), 3402–3412. https://doi.org/10.1523/JNEUROSCI.3302-16.2017
Tomov, M. S., Truong, V. Q., Hundia, R. A., & Gershman, S. J. (2020). Dissociable neural correlates of uncertainty underlie different exploration strategies. Nature Communications, 11(1), 2371. https://doi.org/10.1038/s41467-020-15766-z
Article PubMed PubMed Central Google Scholar
van den Bos, W., Cohen, M. X., Kahnt, T., & Crone, E. A. (2012). Striatum–Medial Prefrontal Cortex Connectivity Predicts Developmental Changes in Reinforcement Learning. Cerebral Cortex, 22(6), 1247–1255. https://doi.org/10.1093/cercor/bhr198
van der Schaaf, M. E., Warmerdam, E., Crone, E. A., & Cools, R. (2011). Distinct linear and non-linear trajectories of reward and punishment reversal learning during development: Relevance for dopamine’s role in adolescent decision making. Developmental Cognitive Neuroscience, 1(4), 578–590. https://doi.org/10.1016/j.dcn.2011.06.007
Article PubMed PubMed Central Google Scholar
Wahlstrom, D., Collins, P., White, T., & Luciana, M. (2010). Developmental changes in dopamine neurotransmission in adolescence: Behavioral implications and issues in assessment. Brain and Cognition, 72(1), 146–159. https://doi.org/10.1016/j.bandc.2009.10.013
Article PubMed Google Scholar
Waltmann, M., Herzog, N., Reiter, A. M., Villringer, A., Horstmann, A., & Deserno, L. (2023). Diminished reinforcement sensitivity in adolescence is associated with enhanced response switching and reduced coding of choice probability in the medial frontal pole. Developmental Cognitive Neuroscience, 101226. https://doi.org/10.1016/j.dcn.2023.101226
Weiss, E. O., Kruppa, J. A., Fink, G. R., Herpertz-Dahlmann, B., Konrad, K., & Schulte-Rüther, M. (2021). Developmental differences in probabilistic reversal learning: A computational modeling approach. Frontiers in Neuroscience, 14, 536596. https://doi.org/10.3389/fnins.2020.536596
Article PubMed PubMed Central Google Scholar
Westhoff, B., Blankenstein, N. E., Schreuders, E., Crone, E. A., & van Duijvenvoorde, A. C. K. (2021). Increased ventromedial prefrontal cortex activity in adolescence benefits prosocial reinforcement learning. Developmental Cognitive Neuroscience, 52, 101018. https://doi.org/10.1016/j.dcn.2021.101018
Article PubMed PubMed Central Google Scholar
Westhoff, B., Molleman, L., Viding, E., van den Bos, W., & van Duijvenvoorde, A. C. K. (2020). Developmental asymmetries in learning to adjust to cooperative and uncooperative environments. Scientific Reports, 10(1), 21761. https://doi.org/10.1038/s41598-020-78546-1
Article PubMed PubMed Central Google Scholar
Wilson, R. C., Geana, A., White, J. M., Ludvig, E. A., & Cohen, J. D. (2014). Humans use directed and random exploration to solve the explore-exploit dilemma. Journal of Experimental Psychology. General, 143(6), 2074–2081. https://doi.org/10.1037/a0038199
Article PubMed PubMed Central Google Scholar
Xia, L., Master, S. L., Eckstein, M. K., Baribault, B., Dahl, R. E., Wilbrecht, L., & Collins, A. G. E. (2021). Modeling changes in probabilistic reinforcement learning during adolescence. PLoS Computational Biology, 17(7), e1008524. https://doi.org/10.1371/journal.pcbi.1008524
Article PubMed PubMed Central Google Scholar
Yu, A. J., & Dayan, P. (2005). Uncertainty, neuromodulation, and attention. Neuron, 46(4), 681–692. https://doi.org/10.1016/j.neuron.2005.04.026
Article PubMed Google Scholar

Download references

Acknowledgments

ST, JS, HvS, EdB, and AvD are supported by the interdisciplinary research program “Social Resilience and Security” of Leiden University. IM is supported by the National Science Foundation, award ID 2021060. The authors thank Franz Wurm for valuable discussions.

Author information

Authors and Affiliations

Leiden University, Institute of Psychology, Wassenaarseweg 52, 2333, AK, Leiden, The Netherlands
Selin Topel, Ili Ma, Jan Sleutels, Henk van Steenbergen, Ellen R. A. de Bruijn & Anna C. K. van Duijvenvoorde
Leiden Institute for Brain and Cognition, Leiden, The Netherlands
Selin Topel, Ili Ma, Henk van Steenbergen, Ellen R. A. de Bruijn & Anna C. K. van Duijvenvoorde
Leiden University, Institute for Philosophy, Leiden, The Netherlands
Jan Sleutels

Authors

Selin Topel
View author publications
You can also search for this author in PubMed Google Scholar
Ili Ma
View author publications
You can also search for this author in PubMed Google Scholar
Jan Sleutels
View author publications
You can also search for this author in PubMed Google Scholar
Henk van Steenbergen
View author publications
You can also search for this author in PubMed Google Scholar
Ellen R. A. de Bruijn
View author publications
You can also search for this author in PubMed Google Scholar
Anna C. K. van Duijvenvoorde
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

ST: conceptualization, investigation, visualization, writing – original draft and writing, writing – review & editing; IM: writing – original draft and writing, writing – review & editing; HvS: conceptualization, writing – original draft and writing, writing – review & editing; JS: writing – review & editing; EB: conceptualization, writing – original draft and writing, writing – review & editing; AvD: conceptualization, writing – original draft and writing, writing – review & editing

Corresponding author

Correspondence to Selin Topel.

Ethics declarations

Conflicts of interest

The authors have no affiliations with or involvement in any organization or entity with any financial interest or nonfinancial interest in the subject matter or materials discussed in this manuscript.

Additional information

Open practices statements

Not applicable

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

ESM 1

(DOCX 207 kb)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Topel, S., Ma, I., Sleutels, J. et al. Expecting the unexpected: a review of learning under uncertainty across development. Cogn Affect Behav Neurosci 23, 718–738 (2023). https://doi.org/10.3758/s13415-023-01098-0

Download citation

Accepted: 28 March 2023
Published: 26 May 2023
Issue Date: June 2023
DOI: https://doi.org/10.3758/s13415-023-01098-0

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Expecting the unexpected: a review of learning under uncertainty across development

Abstract

Similar content being viewed by others

Uncertainty in learning and decision-making: Introduction to the special issue

The stability flexibility tradeoff and the dark side of detail

Developmental changes in exploration resemble stochastic optimization

Introduction

Different types of uncertainty: Stochasticity and volatility

Neural and computational mechanisms of learning under uncertainty

Semi-systematic review approach

Results

Development of learning from stochastic outcomes

Results: Development of learning from volatile outcomes

Interim summary: Comparing the development of learning from stochastic to volatile outcomes

Discussion and future directions

Mechanisms underlying adolescent learning and decision-making under different types of uncertainty

Understanding the links between volatility, exploration, and noise in empirical studies

Individual differences in learning and decision-making with uncertainty

Uncertainty in social environments

Conclusions

References

Acknowledgments

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflicts of interest

Additional information

Open practices statements

Publisher’s note

Supplementary Information

ESM 1

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Expecting the unexpected: a review of learning under uncertainty across development

Abstract

Similar content being viewed by others

Uncertainty in learning and decision-making: Introduction to the special issue

The stability flexibility tradeoff and the dark side of detail

Developmental changes in exploration resemble stochastic optimization

Introduction

Different types of uncertainty: Stochasticity and volatility

Neural and computational mechanisms of learning under uncertainty

Semi-systematic review approach

Results

Development of learning from stochastic outcomes

Results: Development of learning from volatile outcomes

Interim summary: Comparing the development of learning from stochastic to volatile outcomes

Discussion and future directions

Mechanisms underlying adolescent learning and decision-making under different types of uncertainty

Understanding the links between volatility, exploration, and noise in empirical studies

Individual differences in learning and decision-making with uncertainty

Uncertainty in social environments

Conclusions

References

Acknowledgments

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflicts of interest

Additional information

Open practices statements

Publisher’s note

Supplementary Information

ESM 1

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation