The Striatum and Decision-Making Based on Value

Graybiel, Ann M.

doi:10.1007/978-3-319-28802-4_6

Ann M. Graybiel^3,4

Part of the book series: Research and Perspectives in Neurosciences ((NEUROSCIENCE))

7625 Accesses
1 Citations
6 Altmetric

Abstract

Our behaviors range from mindful, deliberative streams of action to sequences of action that are so nearly automatic that we can perform them almost without thinking. Transitions between these modes of behavior occur as we learn behavioral routines. We have studied these transitions and the neural activity that occurs in corticostriatal loops as they take place. We find that neural activity in these loops is strongly modified during habit learning and that specific corticostriatal circuits can powerfully control value-based decision-making and habits.

You have full access to this open access chapter, Download chapter PDF

Neurophysiology of Reward-Guided Behavior: Correlates Related to Predictions, Value, Motivation, Errors, Attention, and Action

Prefrontal mechanisms combining rewards and beliefs in human decision-making

Article Open access 17 January 2019

Frontal cortex function as derived from hierarchical predictive coding

Article Open access 01 March 2018

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

As we move about and act in our environment, the brain constantly updates not only our physical position and the moment-to-moment stimuli around us, but also updates the value of the actions that we perform. How these values are attached to our behaviors is still incompletely understood.

In our laboratory, we have approached this issue by teaching animals to perform simple habits, capitalizing on much evidence that, at first, behaviors that are candidate habits are sensitive to reinforcement, but later they become nearly independent of whether or not the performance of the behavior is reinforced.

We have found that as this behavioral transition occurs, the spike activity and local field potential activity recorded in the prefrontal cortex and striatum are also transformed (Jog et al. 1999; Barnes et al. 2005; Thorn et al. 2010; Smith and Graybiel 2013). In typical experiments, we have taught rodents to run in simple T-mazes, with cues indicating to them whether to turn left or right to receive a food reward. The neural activity in regions known to be necessary for habit formation gradually shifts: early on, the population activity in the sensorimotor part of the striatum is high during the full time of the maze runs, but later during the learning process, the population activity becomes concentrated at the action points of the runs, especially the beginning and end of the runs. As the behavior of the animals becomes fully habitual through extensive training (called ‘over-training’) on the task, this beginning-and-end bracketing pattern becomes nearly fixed within the sensorimotor striatum. A quite similar bracketing pattern later develops in the prefrontal cortex, but it remains sensitive to reinforcement; if rewards are made unpalatable, then the animals cease the habitual runs and the cortical bracketing activity pattern becomes degraded.

We then found that we could block already formed habits and even toggle the habit off and on by optogenetically suppressing this prefrontal cortical activity (Smith et al. 2012). Comparable optogenetic inhibition of the same small prefrontal cortical zone could block the formation of habits altogether when the optogenetic inhibition was applied during the over-training period (Smith and Graybiel 2013).

These experiments raise the possibility that neural circuits involving the medial prefrontal cortex can evaluate whether actions are beneficial and should be allowed to be performed. The fact that this apparent control is effective even for behaviors that seem to be nearly fully automatic suggests that there is on-line, value-related control of behavior.

This potential was vividly seen in other experiments in which we blocked compulsive grooming behavior in a mouse model of obsessive-compulsive disorder by manipulating an orbitofrontal corticostriatal circuit (Burguiere et al. 2013). In these experiments, we could block a conditioned compulsion by intervening either at the level of the cortex or at the level of the medial striatum. Therefore, the control was exerted by a corticostriatal circuit.

In a new set of experiments, we have asked whether we can identify critical corticostriatal circuits that operate in these deliberative or repetitive decisions. We focused on a circuit that is thought to lead from localized zones in the prefrontal cortex to striosomes. These are dispersed zones within the striatum that can access the dopamine-containing neurons of the midbrain (Crittenden and Graybiel 2011; Fujiyama et al. 2011; Watabe-Uchida et al. 2012). We mimicked a situation often faced in everyday life, in which we can acquire something, but only at a cost. In this situation, costs as well as benefits have to be weighed. We used decision-making tasks in which animals were required to choose an action sequence in response to cues indicating that mixtures of rewarding and annoying reinforcers could either be accepted or be rejected. This design meant that the animals could reject an offer, but then they would miss out on the reward coupled to the cost.

This kind of decision-making, given the name ‘approach-avoidance decision-making,’ has been studied extensively in human subjects, particularly in relation to distinguishing between anxiety and depression in affected individuals who face conflicting motivations to approach and to avoid. We thus were attempting to target forms of decision-making that, in humans, involve value-based estimates of the future.

In initial studies, Dr. Ken-ichi Amemori and I focused on the pregenual anterior cingulate cortex in macaque monkeys (Amemori and Graybiel 2012), which earlier work had shown to project preferentially to striosomes in the head of the caudate nucleus (Eblen and Graybiel 1995). There, many neurons increased their activity during the decision period, either when the monkey would subsequently choose an approach response (accepting the good and bad symbolized by cues on a computer screen) or when the monkey would subsequently reject the offer. In one localized pregenual region, the avoidance-related neurons outnumbered the approach-related neurons. At other sites, similar numbers of these two classes were recorded. Microstimulation applied during the decision period had little or no effect on the decisions at most sites, but in the regions matching the sites with predominance of avoidance-related neurons, the microstimulation induced significant increases in avoidance. We found that treatment with the anxiolytic diazepam could block the microstimulation effects. Notably, we found no effects of the microstimulation in a control ‘approach-approach’ task in which both offered options were good.

In subsequent, still-ongoing experiments, Ken-ichi Amemori, Satoko Amemori and I are determining whether, as initial results suggest, the ‘hot-spot’ for pessimistic decision-making preferentially projects to striosomes (Amemori et al. in preparation). If so, these experimental findings would squarely place the corticostriatal system interacting with striosomes as part of the circuitry underpinning decision-making in which conflicting motivations must be handled.

With the technical opportunities presented by work in rodents, we returned to T-maze experiments, but this time introduced costs and benefits at each end-arm of the mazes. In work spearheaded by Alexander Friedman, Daigo Homma, and Leif Gibb, with Ken-ichi Amemori and others, we found striking evidence for a selective functional engagement of a striosome-targeting prefrontal circuit (Friedman et al. 2015). The evidence rests on the use of multiple decision-making tasks, presenting cost-benefit, benefit-benefit, reverse cost-benefit and cost-cost decision-making challenges to the animals. We then used optogenetics to interrupt the cortico-striosomal circuit. Across all of these tasks, it was only in the cost-benefit task that the putative striosome-targeting prefrontal pathway was engaged. By contrast, comparable optogenetic experiments inhibiting a matrix-targeting prefronto-striatal circuit produced effects on decision-making in all of the tasks.

Evidence from our own and other laboratories suggests that striosomes may have privileged access to the dopamine-containing neurons of the substantia nigra pars compacta, either directly or by way of a multi-synaptic pathway via the lateral habenula (Rajakumar et al. 1993; Graybiel 2008; Stephenson-Jones et al. 2013). The details of these pathways remain unknown. It is known, however, that the lateral habenula neurons increase their firing rates to negative reinforcers or their predictors; the dopamine-containing nigral neurons fire in relation to positive or, in some populations, to negative reinforcers and predictors (Hong and Hikosaka 2013). This potential dual downstream circuitry, combined with the experimental evidence summarized here, suggests that striosomes could be nodal sites in mood- and emotion-related corticostriatal networks influencing downstream modulators of motivational states.

References

Amemori K, Graybiel AM (2012) Localized microstimulation of primate pregenual cingulate cortex induces negative decision-making. Nat Neurosci 15:776–785
Article CAS PubMed PubMed Central Google Scholar
Barnes T, Kubota Y, Hu D, Jin DZ, Graybiel AM (2005) Activity of striatal neurons reflects dynamic encoding and recoding of procedural memories. Nature 437:1158–1161
Article CAS PubMed Google Scholar
Burguiere E, Monteiro P, Feng G, Graybiel AM (2013) Optogenetic stimulation of lateral orbitofronto-striatal pathway suppresses compulsive behaviors. Science 340:1243–1246
Article CAS PubMed Google Scholar
Crittenden JR, Graybiel AM (2011) Basal ganglia disorders associated with imbalances in the striatal striosome and matrix compartments. Front Neuroanat 5:59
Article PubMed PubMed Central Google Scholar
Eblen F, Graybiel AM (1995) Highly restricted origin of prefrontal cortical inputs to striosomes in the macaque monkey. J Neurosci 15:5999–6013
CAS PubMed Google Scholar
Friedman A, Homma D, Gibb LG, Amemori K, Rubin SJ, Hood AS, Riad MH, Graybiel AM (2015) A corticostriatal path targeting striosomes controls decision-making under conflict. Cell 161:1320–1333
Article CAS PubMed Google Scholar
Fujiyama F, Sohn J, Nakano T, Furuta T, Nakamura KC, Matsuda W, Kaneko T (2011) Exclusive and common targets of neostriatofugal projections of rat striosome neurons: a single neuron-tracing study using a viral vector. Eur J Neurosci 33:668–677
Article PubMed Google Scholar
Graybiel AM (2008) Habits, rituals and the evaluative brain. Annu Rev Neurosci 31:359–387
Article CAS PubMed Google Scholar
Hong S, Hikosaka O (2013) Diverse sources of reward value signals in the basal ganglia nuclei transmitted to the lateral habenula in the monkey. Front Hum Neurosci 7:778
PubMed PubMed Central Google Scholar
Jog M, Kubota Y, Connolly CI, Hillegaart V, Graybiel AM (1999) Building neural representations of habits. Science 286:1745–1749
Article CAS PubMed Google Scholar
Rajakumar N, Elisevich K, Flumerfelt BA (1993) Compartmental origin of the striato-entopeduncular projection in the rat. J Comp Neurol 331:286–296
Article CAS PubMed Google Scholar
Smith KS, Graybiel AM (2013) A dual operator view of habitual behavior reflecting cortical and striatal dynamics. Neuron 79:361–374
Article CAS PubMed PubMed Central Google Scholar
Smith KS, Virkud A, Deisseroth K, Graybiel AM (2012) Reversible online control of habitual behavior by optogenetic perturbation of medial prefrontal cortex. Proc Natl Acad Sci USA 109:18932–18937
Article CAS PubMed PubMed Central Google Scholar
Stephenson-Jones M, Kardamakis AA, Robertson B, Grillner S (2013) Independent circuits in the basal ganglia for the evaluation and selection of actions. Proc Natl Acad Sci USA 110:E3670–E3679
Article CAS PubMed PubMed Central Google Scholar
Thorn CA, Atallah H, Howe M, Graybiel A (2010) Differential dynamics of activity changes in dorsolateral and dorsomedial striatal loops during learning. Neuron 66:781–795
Article CAS PubMed PubMed Central Google Scholar
Watabe-Uchida M, Zhu L, Ogawa SK, Vamanrao A, Uchida N (2012) Whole-brain mapping of direct inputs to midbrain dopamine neurons. Neuron 74:858–873
Article CAS PubMed Google Scholar

Download references

Author information

Authors and Affiliations

McGovern Institute for Brain Research, Massachusetts Institute of Technology, Cambridge, MA, USA
Ann M. Graybiel
Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA, USA
Ann M. Graybiel

Authors

Ann M. Graybiel
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ann M. Graybiel .

Editor information

Editors and Affiliations

The Neuroscience Institute, New York University, School of Medicine, New York, New York, USA
György Buzsáki
Fondation Ipsen, Boulogne-Billancourt, France
Yves Christen

Rights and permissions

Open Access This chapter is distributed under the terms of the Creative Commons Attribution-Noncommercial 2.5 License (http://creativecommons.org/licenses/by-nc/2.5/) which permits any noncommercial use, distribution, and reproduction in any medium, provided the original author(s) and source are credited.

The images or other third party material in this chapter are included in the work’s Creative Commons license, unless indicated otherwise in the credit line; if such material is not included in the work’s Creative Commons license and the respective action is not permitted by statutory regulation, users will need to obtain permission from the license holder to duplicate, adapt or reproduce the material.

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Graybiel, A.M. (2016). The Striatum and Decision-Making Based on Value. In: Buzsáki, G., Christen, Y. (eds) Micro-, Meso- and Macro-Dynamics of the Brain. Research and Perspectives in Neurosciences. Springer, Cham. https://doi.org/10.1007/978-3-319-28802-4_6

Download citation

DOI: https://doi.org/10.1007/978-3-319-28802-4_6
Published: 03 May 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-28801-7
Online ISBN: 978-3-319-28802-4
eBook Packages: Biomedical and Life SciencesBiomedical and Life Sciences (R0)

Publish with us

Policies and ethics

The Striatum and Decision-Making Based on Value

Abstract

Similar content being viewed by others

Neurophysiology of Reward-Guided Behavior: Correlates Related to Predictions, Value, Motivation, Errors, Attention, and Action

Prefrontal mechanisms combining rewards and beliefs in human decision-making

Frontal cortex function as derived from hierarchical predictive coding

Keywords

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Publish with us

Navigation

The Striatum and Decision-Making Based on Value

Abstract

Similar content being viewed by others

Neurophysiology of Reward-Guided Behavior: Correlates Related to Predictions, Value, Motivation, Errors, Attention, and Action

Prefrontal mechanisms combining rewards and beliefs in human decision-making

Frontal cortex function as derived from hierarchical predictive coding

Keywords

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Share this chapter

Publish with us

Search

Navigation