Muller et al. demonstrate that reward signals recorded from the frontal cortex of nonhuman primates exhibit a population-based scheme for learning probability distributions over reward values. This study provides evidence that neural signals outside of the midbrain reflect the principles of distributional reinforcement-learning theory.
References
Kacelnik, A. & Bateson, M. Trends Cogn. Sci. 1, 304–309 (1997).
Bellemare, M. G., Dabney, W. & Munos, R. A distributional perspective on reinforcement learning. In Proc 34th Internatl Conf. Machine Learning vol. 70 (eds. Precup, D. & Teh, Y. W.) 449–458 (PMLR, 2017).
Muller, T. H. et al. Nat. Neurosci. https://doi.org/10.1038/s41593-023-01535-w (2024).
Schultz, W., Apicella, P. & Ljungberg, T. J. Neurosci. 13, 900–913 (1993).
Oyama, K., Hernádi, I., Iijima, T. & Tsutsui, K. J. Neurosci. 30, 11447–11457 (2010).
Miranda, B., Malalasekera, W. M. N., Behrens, T. E., Dayan, P. & Kennerley, S. W. PLoS Comput. Biol. 16, e1007944 (2020).
Kennerley, S. W., Behrens, T. E. & Wallis, J. D. Nat. Neurosci. 14, 1581–1589 (2011).
Dabney, W. et al. Nature 577, 671–675 (2020).
Rothenhoefer, K. M., Hong, T., Alikaya, A. & Stauffer, W. R. Nat. Neurosci. 24, 465–469 (2021).
Kolling, N., Wittmann, M. & Rushworth, M. F. S. Neuron 81, 1190–1202 (2014).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Rights and permissions
About this article
Cite this article
Hong, T., Stauffer, W.R. Anterior cingulate learns reward distribution. Nat Neurosci 27, 391–392 (2024). https://doi.org/10.1038/s41593-024-01571-0
Published:
Issue Date:
DOI: https://doi.org/10.1038/s41593-024-01571-0
- Springer Nature America, Inc.