The temporal precision of reward prediction in dopamine neurons

Fiorillo, Christopher D; Newsome, William T; Schultz, Wolfram

doi:10.1038/nn.2159

The temporal precision of reward prediction in dopamine neurons

Article
Published: 27 July 2008

Volume 11, pages 966–973, (2008)
Cite this article

From

View current issue Submit your manuscript

Christopher D Fiorillo¹,
William T Newsome¹ &
Wolfram Schultz²

4776 Accesses
206 Citations
9 Altmetric
1 Mention
Explore all metrics

Abstract

Midbrain dopamine neurons are activated when reward is greater than predicted, and this error signal could teach target neurons both the value of reward and when it will occur. We used the dopamine error signal to measure how the expectation of reward was distributed over time. Animals were trained with fixed-duration intervals of 1–16 s between conditioned stimulus onset and reward. In contrast to the weak responses that have been observed after short intervals (1–2 s), activations to reward increased steeply and linearly with the logarithm of the interval. Results with varied stimulus-reward intervals suggest that the neural expectation was substantial after just half an interval had elapsed. Thus, the neural expectation of reward in these experiments was not highly precise and the precision declined sharply with interval duration. The neural precision of expectation appeared to be at least qualitatively similar to the precision of anticipatory licking behavior.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

**Figure 1: Timing of anticipatory licking behavior (Experiment 1).**

**Figure 2: Dopamine neurons are sensitive to interval duration (Experiment 1).**

**Figure 3: Response of dopamine neurons to juice delivered earlier or later than usual (Experiment 2).**

**Figure 4: Response of dopamine neurons to juice delivered following a stimulus-reward interval that varied across trials (Experiment 3).**

**Figure 5: Responses of dopamine neurons as a function of a stimulus-reward interval that varies from trial to trial (Experiment 3).**

**Figure 6: A model of interval timing and the dopamine error signal that could account for the data of Figure 2d.**

A distributional code for value in dopamine-based reinforcement learning

Article 15 January 2020

Dopamine Prediction Errors and the Relativity of Value

Dopamine neurons share common response function for reward prediction error

Article 08 February 2016

References

Buhusi, C.V. & Meck, W.H. What makes us tick? Functional and neural mechanisms of interval timing. Nat. Rev. Neurosci. 6, 755–765 (2005).
Article CAS PubMed Google Scholar
Mauk, M.D. & Buonomano, D.V. The neural basis of temporal processing. Annu. Rev. Neurosci. 27, 307–340 (2004).
Article CAS PubMed Google Scholar
Rao, S.M., Mayer, A.R. & Harrington, D.L. The evolution of brain activation during temporal processing. Nat. Neurosci. 4, 317–323 (2001).
Article CAS PubMed Google Scholar
Gibbon, J., Malapani, C., Dale, C.L. & Gallistel, C.R. Toward a neurobiology of temporal cognition: advances and challenges. Curr. Opin. Neurobiol. 7, 170–184 (1997).
Article CAS PubMed Google Scholar
Meck, W.H. Neuropharmacology of timing and time perception. Brain Res. Cogn. Brain Res. 3, 227–242 (1996).
Article CAS PubMed Google Scholar
Matell, M.S. & Meck, W.H. Cortico-striatal circuits and interval timing: coincidence detection of oscillatory processes. Brain Res. Cogn. Brain Res. 21, 139–170 (2004).
Article PubMed Google Scholar
Houk, J., Adams, J. & Barto, A. A model of how the basal ganglia generate and use neural signals that predict reinforcement. Models of Information Processing in the Basal Ganglia (eds Houk, J., Davis, J. & Beiser, D.) 249–270 (MIT Press, Cambridge, Massachusetts, 1995).
Google Scholar
Montague, P.R., Dayan, P. & Sejnowski, T.J. A framework for mesencephalic dopamine systems based on predictive Hebbian learning. J. Neurosci. 16, 1936–1947 (1996).
Article CAS PubMed Google Scholar
Schultz, W., Dayan, P. & Montague, R.R. A neural substrate of prediction and reward. Science 275, 1593–1599 (1997).
Article CAS PubMed Google Scholar
Berns, G.S. & Sejnowski, T.J. A computational model of how the basal ganglia produce sequences. J. Cogn. Neurosci. 10, 108–121 (1998).
Article CAS PubMed Google Scholar
Brown, J., Bullock, D. & Grossberg, S. How the basal ganglia use parallel excitatory and inhibitory learning pathways to selectively respond to unexpected rewarding cues. J. Neurosci. 19, 10502–10511 (1999).
Article CAS PubMed Google Scholar
Contreras-Vidal, J.L. & Schultz, W. A predictive reinforcement model of dopamine neurons for learning approach behavior. J. Comput. Neurosci. 6, 191–214 (1999).
Article CAS PubMed Google Scholar
Suri, R.E. & Schultz, W. A neural network model with dopamine-like reinforcement signal that learns a spatial delayed response task. Neuroscience 91, 871–890 (1999).
Article CAS PubMed Google Scholar
Daw, N.D., Courville, A.C. & Touretzky, D.S. Representation and timing in theories of the dopamine system. Neural Comput. 18, 1637–1677 (2006).
Article PubMed Google Scholar
Schultz, W., Apicella, P. & Ljungberg, T. Responses of monkey dopamine neurons to reward and conditioned stimuli during successive steps of learning a delayed response task. J. Neurosci. 13, 900–913 (1993).
Article CAS PubMed Google Scholar
Hollerman, J.R. & Schultz, W. Dopamine neurons report an error in the temporal prediction of reward during learning. Nat. Neurosci. 1, 304–309 (1998).
Article CAS PubMed Google Scholar
Schultz, W. The predictive reward signal of dopamine neurons. J. Neurophysiol. 80, 1–27 (1998).
Article CAS PubMed Google Scholar
Fiorillo, C.D., Tobler, P.N. & Schultz, W. Discrete coding of reward probability and uncertainty by dopamine neurons. Science 299, 1898–1902 (2003).
Article CAS PubMed Google Scholar
Satoh, T., Nakai, S., Sato, T. & Kimura, M. Correlated coding of motivation and outcome of decision by dopamine neurons. J. Neurosci. 23, 9913–9923 (2003).
Article CAS PubMed Google Scholar
Nakahara, H., Itho, H., Kawagoe, R., Takikawa, Y. & Hikosaka, O. Dopamine neurons can represent context-dependent prediction error. Neuron 41, 269–280 (2004).
Article CAS PubMed Google Scholar
Morris, G., Arkadir, D., Nevet, A., Vaadia, E. & Bergman, H. Coincident, but distinct, messages of midbrain dopamine and striatal tonically active neurons. Neuron 43, 133–143 (2004).
Article CAS PubMed Google Scholar
Pan, W.X., Schmidt, R., Wickens, J.R. & Hyland, B.I. Dopamine cells respond to predicted events during classical conditioning: evidence for eligibility traces in the reward-learning network. J. Neurosci. 25, 6235–6242 (2005).
Article CAS PubMed Google Scholar
Tobler, P.N., Fiorillo, C.D. & Schultz, W. Adaptive coding of reward value by dopamine neurons. Science 307, 1642–1645 (2005).
Article CAS PubMed Google Scholar
Bayer, H.M. & Glimcher, P.W. Midbrain dopamine neurons encode a quantitative reward prediction error signal. Neuron 47, 129–141 (2005).
Article CAS PubMed PubMed Central Google Scholar
Rakitin, B.C. et al. Scalar expectancy theory and peak-interval timing in humans. J. Exp. Psychol. Anim. Behav. Process. 24, 15–33 (1998).
Article CAS PubMed Google Scholar
Kacelnik, A. & Brito e Abreu, F. Risky choice and Weber's Law. J. Theor. Biol. 194, 289–298 (1998).
Article CAS PubMed Google Scholar
Janssen, P. & Shadlen, M.N. A representation of the hazard rate of elapsed time in macaque area LIP. Nat. Neurosci. 8, 234–241 (2005).
Article CAS PubMed Google Scholar
Ghose, G.M. & Maunsell, J.H.R. Attentional modulation in visual cortex depends on task-timing. Nature 419, 616–620 (2002).
Article CAS PubMed Google Scholar
Bateson, M. & Kacelnik, A. Accuracy of memory for amount in the foraging starling, Sturnus vulgaris. Anim. Behav. 50, 431–443 (1995).
Article Google Scholar
Komura, Y. et al. Retrospective and prospective coding for predicted reward in the sensory thalamus. Nature 412, 546–549 (2001).
Article CAS PubMed Google Scholar
Brody, C.D., Hernandez, A., Zanos, A. & Romo, R. Timing and neural encoding of somatosensory parametric working memory in macaque prefrontal cortex. Cereb. Cortex 13, 1196–1207 (2003).
Article Google Scholar
Leon, M.I. & Shadlen, M.N. Representation of time by neurons in the posterior parietal cortex of the macaque. Neuron 38, 317–327 (2003).
Article CAS PubMed Google Scholar
Renoult, L.Roux. S. & Riehle, A. Time is a rubberband: neuronal activity in monkey motor cortex in relation to time estimation. Eur. J. Neurosci. 23, 3098–3108 (2006).
Article PubMed Google Scholar
O'Reilly, R.C., Frank, M.J., Hazy, T.E. & Watz, B. PVLV: the primary value and learned value Pavlovian learning algorithm. Behav. Neurosci. 121, 31–49 (2007).
Article PubMed Google Scholar
Grossberg, S. & Schmajuk, N.A. Neural dynamics of adaptive timing and temporal discrimination during associative learning. Neural Netw. 2, 79–102 (1989).
Article Google Scholar
Medina, J.F., Nores, W.L. & Mauk, M.D. Inhibition of climbing fibers is a signal for the extinction of conditioned eyelid responses. Nature 416, 330–333 (2002).
Article CAS PubMed Google Scholar
Schultz, W. & Romo, R. Responses of nigrostriatal dopamine neurons to high-intensity somatosensory stimulation in the anesthetized monkey. J. Neurophysiol. 57, 201–217 (1987).
Article CAS PubMed Google Scholar
Nieder, A. & Miller, E.K. Coding of cognitive magnitude: compressed scaling of numerical information in the primate prefrontal cortex. Neuron 37, 149–157 (2003).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

This work was supported by grants from the Human Frontiers Science Program (C.D.F.), the Howard Hughes Medical Institute (W.T.N.), the US National Institutes of Health (EY 05603, W.T.N.), the Swiss National Science Funds (W.S.) and the Wellcome Trust (W.S.).

Author information

Authors and Affiliations

Department of Neurobiology, Fairchild Building, D209, 299 Campus Drive West, Stanford University, Stanford, 94305-5125, California, USA
Christopher D Fiorillo & William T Newsome
Department of Physiology, Development and Neuroscience, University of Cambridge, Downing Street, Cambridge, CB2 3DY, UK
Wolfram Schultz

Authors

Christopher D Fiorillo
View author publications
You can also search for this author in PubMed Google Scholar
William T Newsome
View author publications
You can also search for this author in PubMed Google Scholar
Wolfram Schultz
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

C.D.F. conducted the experiments, analyzed the data and developed the mathematical model of dopamine responses. C.D.F. and W.S. designed the experiments. C.D.F. wrote the manuscript with feedback from W.T.N. and W.S.

Corresponding author

Correspondence to Christopher D Fiorillo.

Supplementary information

Supplementary Text and Figures

Supplementary Figures 1–4, Supplementary Methods and Supplementary Results (PDF 3905 kb)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Fiorillo, C., Newsome, W. & Schultz, W. The temporal precision of reward prediction in dopamine neurons. Nat Neurosci 11, 966–973 (2008). https://doi.org/10.1038/nn.2159

Download citation

Received: 17 March 2008
Accepted: 03 June 2008
Published: 27 July 2008
Issue Date: August 2008
DOI: https://doi.org/10.1038/nn.2159
Springer Nature America, Inc.

This article is cited by

Neural inhibition as implemented by an actor-critic model involves the human dorsal striatum and ventral tegmental area
- Ana Araújo
- Isabel Catarina Duarte
- Miguel Castelo-Branco
Scientific Reports (2024)
Dopaminergic prediction errors in the ventral tegmental area reflect a multithreaded predictive model
- Yuji K. Takahashi
- Thomas A. Stalnaker
- Geoffrey Schoenbaum
Nature Neuroscience (2023)
Blocking D2/D3 dopamine receptors in male participants increases volatility of beliefs when learning to trust others
- Nace Mikus
- Christoph Eisenegger
- Michael Naef
Nature Communications (2023)
Efficient coding of cognitive variables underlies dopamine response and choice behavior
- Asma Motiwala
- Sofia Soares
- Christian K. Machens
Nature Neuroscience (2022)
Functional architecture of executive control and associated event-related potentials in macaques
- Amirsaman Sajad
- Steven P. Errington
- Jeffrey D. Schall
Nature Communications (2022)

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

The temporal precision of reward prediction in dopamine neurons

From

Abstract

Access this article

Similar content being viewed by others

A distributional code for value in dopamine-based reinforcement learning

Dopamine Prediction Errors and the Relativity of Value

Dopamine neurons share common response function for reward prediction error

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Supplementary information

Supplementary Text and Figures

Rights and permissions

About this article

Cite this article

This article is cited by

Neural inhibition as implemented by an actor-critic model involves the human dorsal striatum and ventral tegmental area

Dopaminergic prediction errors in the ventral tegmental area reflect a multithreaded predictive model

Blocking D2/D3 dopamine receptors in male participants increases volatility of beliefs when learning to trust others

Efficient coding of cognitive variables underlies dopamine response and choice behavior

Functional architecture of executive control and associated event-related potentials in macaques

Navigation

The temporal precision of reward prediction in dopamine neurons

Abstract

Access this article

Similar content being viewed by others

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Navigation