Interpreting Dopamine Activities in Stochastic Reward Tasks

Asahina, Akiyo; Hirayama, Jun-ichiro; Ishii, Shin

doi:10.1007/978-3-642-02490-0_44

Akiyo Asahina^19,20,
Jun-ichiro Hirayama²⁰ &
Shin Ishii^19,20

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 5506))

Included in the following conference series:

International Conference on Neural Information Processing

2100 Accesses

Abstract

Phasic activities of dopamine (DA) neurons in the primate midbrain have been considered as representing temporal difference (TD) errors from a computational perspective. Recently, several studies have reported that, in stochastic reward tasks, the DA activities gradually increase before receiving actual rewards, which is not well explained by the simple TD model. In this study, we propose an alternative model based on a probabilistic formulation of the stochastic reward task. In simulation experiments, expectation errors, defined by the probabilistic modeling, well described the gradually increasing DA activities during a wait period even in a single trial.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Schultz, W., Dayan, P., Montague, P.R.: A neural substrate of prediction and reward. Science 275, 1593–1599 (1997)
Article Google Scholar
Fiorillo, C.D., Tobler, P.N., Schultz, W.: Discrete coding of reward probability and uncertainty by dopamine neurons. Science 299, 1898–1902 (2003)
Article Google Scholar
Tobler, P.N., Fiorillo, C.D., Schultz, W.: Adaptive coding of reward value by dopamine neurons. Science 307, 1642–1645 (2005)
Article Google Scholar
Niv, Y., Duff, M.O., Dayan, P.: Dopamine, uncertainty and TD learning. Behav. Brain Funct. 1, 6 (2005)
Article Google Scholar
Fiorillo, C.D., Tobler, P.N., Schultz, W.: Evidence that the delay-period activity of dopamine neurons corresponds to reward uncertainty rather than backpropagating TD errors. Behav. Brain Funct. 1, 7 (2005)
Article Google Scholar
Schweighofer, N., Doya, K.: Meta-learning in Reinforcement Learning. Neural Netw. 16, 5–9 (2003)
Article Google Scholar
Daw, N.D., Courville, A.C., Touretzky, D.S.: Representation and Timing in Theories of the Dopamine System. Neural Comput. 18, 1637–1677 (2006)
Article MathSciNet MATH Google Scholar
Doya, K.: Metalearning and neuromodulation. Neural Netw. 15, 495–506 (2002)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Graduate School of Information Science, Nara Institute of Science and Technology, 8916-5 Takayama, Ikoma, Nara, 630-0192, Japan
Akiyo Asahina & Shin Ishii
Graduate School of Informatics, Kyoto University, Gokasho, Uji, Kyoto, 611-0011, Japan
Akiyo Asahina, Jun-ichiro Hirayama & Shin Ishii

Authors

Akiyo Asahina
View author publications
You can also search for this author in PubMed Google Scholar
Jun-ichiro Hirayama
View author publications
You can also search for this author in PubMed Google Scholar
Shin Ishii
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Kyushu Institute of Technology, Network Design and Research Center, 680-4 Fukuoka, 820-8502, Kawazu, Iizuka, Japan
Mario Köppen
Knowledge Engineering and Discovery Research Institute (KEDRI), School of Computing and Mathematical Sciences, Auckland University of Technology, 350 Queen Street, 10110, Auckland, New Zealand
Nikola Kasabov
Department of Electrical and Computer Engineering, Robotics Laboratory, Auckland University of Technology, 38 Princes Street, 1142, Auckland, New Zealand
George Coghill

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Asahina, A., Hirayama, Ji., Ishii, S. (2009). Interpreting Dopamine Activities in Stochastic Reward Tasks. In: Köppen, M., Kasabov, N., Coghill, G. (eds) Advances in Neuro-Information Processing. ICONIP 2008. Lecture Notes in Computer Science, vol 5506. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-02490-0_44

Download citation

DOI: https://doi.org/10.1007/978-3-642-02490-0_44
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-02489-4
Online ISBN: 978-3-642-02490-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics