Abstract
Phasic activities of dopamine (DA) neurons in the primate midbrain have been considered as representing temporal difference (TD) errors from a computational perspective. Recently, several studies have reported that, in stochastic reward tasks, the DA activities gradually increase before receiving actual rewards, which is not well explained by the simple TD model. In this study, we propose an alternative model based on a probabilistic formulation of the stochastic reward task. In simulation experiments, expectation errors, defined by the probabilistic modeling, well described the gradually increasing DA activities during a wait period even in a single trial.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Schultz, W., Dayan, P., Montague, P.R.: A neural substrate of prediction and reward. Science 275, 1593–1599 (1997)
Fiorillo, C.D., Tobler, P.N., Schultz, W.: Discrete coding of reward probability and uncertainty by dopamine neurons. Science 299, 1898–1902 (2003)
Tobler, P.N., Fiorillo, C.D., Schultz, W.: Adaptive coding of reward value by dopamine neurons. Science 307, 1642–1645 (2005)
Niv, Y., Duff, M.O., Dayan, P.: Dopamine, uncertainty and TD learning. Behav. Brain Funct. 1, 6 (2005)
Fiorillo, C.D., Tobler, P.N., Schultz, W.: Evidence that the delay-period activity of dopamine neurons corresponds to reward uncertainty rather than backpropagating TD errors. Behav. Brain Funct. 1, 7 (2005)
Schweighofer, N., Doya, K.: Meta-learning in Reinforcement Learning. Neural Netw. 16, 5–9 (2003)
Daw, N.D., Courville, A.C., Touretzky, D.S.: Representation and Timing in Theories of the Dopamine System. Neural Comput. 18, 1637–1677 (2006)
Doya, K.: Metalearning and neuromodulation. Neural Netw. 15, 495–506 (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Asahina, A., Hirayama, Ji., Ishii, S. (2009). Interpreting Dopamine Activities in Stochastic Reward Tasks. In: Köppen, M., Kasabov, N., Coghill, G. (eds) Advances in Neuro-Information Processing. ICONIP 2008. Lecture Notes in Computer Science, vol 5506. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-02490-0_44
Download citation
DOI: https://doi.org/10.1007/978-3-642-02490-0_44
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-02489-4
Online ISBN: 978-3-642-02490-0
eBook Packages: Computer ScienceComputer Science (R0)