Dual credit assignment processes underlie dopamine signals in a complex spatial environment

Timothy A Krausz; Alison E Comrie; Loren M Frank; Nathaniel D Daw; Joshua D Berke

doi:10.1101/2023.02.15.528738

Dual credit assignment processes underlie dopamine signals in a complex spatial environment

bioRxiv [Preprint]. 2023 Mar 19:2023.02.15.528738. doi: 10.1101/2023.02.15.528738.

Authors

Timothy A Krausz¹, Alison E Comrie¹, Loren M Frank^{1

2

3

4}, Nathaniel D Daw⁵, Joshua D Berke^{1

2

6}

Affiliations

¹ Neuroscience Graduate Program, University of California, San Francisco.
² Kavli Institute for Fundamental Neuroscience, and Weill Institute for Neurosciences, UCSF.
³ Howard Hughes Medical Institute.
⁴ Department of Physiology, UCSF.
⁵ Department of Psychology, and Princeton Neuroscience Institute, Princeton University, NJ.
⁶ Department of Neurology, and Department of Psychiatry and Behavioral Science, UCSF.

Abstract

Dopamine in the nucleus accumbens helps motivate behavior based on expectations of future reward ("values"). These values need to be updated by experience: after receiving reward, the choices that led to reward should be assigned greater value. There are multiple theoretical proposals for how this credit assignment could be achieved, but the specific algorithms that generate updated dopamine signals remain uncertain. We monitored accumbens dopamine as freely behaving rats foraged for rewards in a complex, changing environment. We observed brief pulses of dopamine both when rats received reward (scaling with prediction error), and when they encountered novel path opportunities. Furthermore, dopamine ramped up as rats ran towards reward ports, in proportion to the value at each location. By examining the evolution of these dopamine place-value signals, we found evidence for two distinct update processes: progressive propagation along taken paths, as in temporal-difference learning, and inference of value throughout the maze, using internal models. Our results demonstrate that within rich, naturalistic environments dopamine conveys place values that are updated via multiple, complementary learning algorithms.

Publication types

Preprint

Abstract

Publication types

Grants and funding