Neural Circuitry of Reward Prediction Error,Annual Review of Neuroscience

当前位置： X-MOL 学术 › Annu. Rev. Neurosci. › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

Neural Circuitry of Reward Prediction Error
Annual Review of Neuroscience ( IF 12.1 ) Pub Date : 2017-08-03 00:00:00 , DOI: 10.1146/annurev-neuro-072116-031109
Mitsuko Watabe-Uchida ₁ , Neir Eshel _{1,

2} , Naoshige Uchida ₁

Affiliation

Dopamine neurons facilitate learning by calculating reward prediction error, or the difference between expected and actual reward. Despite two decades of research, it remains unclear how dopamine neurons make this calculation. Here we review studies that tackle this problem from a diverse set of approaches, from anatomy to electrophysiology to computational modeling and behavior. Several patterns emerge from this synthesis: that dopamine neurons themselves calculate reward prediction error, rather than inherit it passively from upstream regions; that they combine multiple separate and redundant inputs, which are themselves interconnected in a dense recurrent network; and that despite the complexity of inputs, the output from dopamine neurons is remarkably homogeneous and robust. The more we study this simple arithmetic computation, the knottier it appears to be, suggesting a daunting (but stimulating) path ahead for neuroscience more generally.

中文翻译：

奖励预测误差的神经回路

多巴胺神经元通过计算奖励预测误差或预期奖励与实际奖励之间的差异来促进学习。尽管进行了二十年的研究，但仍不清楚多巴胺神经元是如何进行这种计算的。在这里，我们回顾了从解剖学到电生理学到计算模型和行为的多种方法来解决这个问题的研究。这种合成产生了几种模式：多巴胺神经元本身计算奖励预测误差，而不是从上游区域被动地继承它。它们结合了多个单独且冗余的输入，它们本身在密集的循环网络中互连；尽管输入很复杂，但多巴胺神经元的输出却非常均匀且稳定。我们越研究这种简单的算术运算，

更新日期：2017-08-03

点击分享查看原文

点击收藏

公开下载

阅读更多本刊最新论文本刊介绍/投稿指南11