Dopaminergic action prediction errors serve as a value-free teaching signal

成果类型:
Article
署名作者:
Greenstreet, Francesca; Vergara, Hernando Martinez; Johansson, Yvonne; Pati, Sthitapranjya; Schwarz, Laura; Lenzi, Stephen C.; Geerts, Jesse P.; Wisdom, Matthew; Gubanova, Alina; Rollik, Lars B.; Kaur, Jasvin; Moskovitz, Theodore; Cohen, Joseph; Thompson, Emmett; Margrie, Troy W.; Clopath, Claudia; Stephenson-Jones, Marcus
署名单位:
University of London; University College London; Imperial College London; University of London; University College London
刊物名称:
Nature
ISSN/ISSBN:
0028-1426
DOI:
10.1038/s41586-025-09008-9
发表日期:
2025-07-31
关键词:
basal ganglia circuits reward neurons REPRESENTATION MOVEMENT BEHAVIOR mouse acquisition striatum dorsal
摘要:
Choice behaviour of animals is characterized by two main tendencies: taking actions that led to rewards and repeating past actions1,2. Theory suggests that these strategies may be reinforced by different types of dopaminergic teaching signals: reward prediction error to reinforce value-based associations and movement-based action prediction errors to reinforce value-free repetitive associations3, 4, 5-6. Here we use an auditory discrimination task in mice to show that movement-related dopamine activity in the tail of the striatum encodes the hypothesized action prediction error signal. Causal manipulations reveal that this prediction error serves as a value-free teaching signal that supports learning by reinforcing repeated associations. Computational modelling and experiments demonstrate that action prediction errors alone cannot support reward-guided learning, but when paired with the reward prediction error circuitry they serve to consolidate stable sound-action associations in a value-free manner. Together we show that there are two types of dopaminergic prediction errors that work in tandem to support learning, each reinforcing different types of association in different striatal areas.