当前位置: X-MOL 学术Annu. Rev. Psychol. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Learning, Reward, and Decision Making
Annual Review of Psychology ( IF 24.8 ) Pub Date : 2017-01-04 00:00:00 , DOI: 10.1146/annurev-psych-010416-044216
John P O'Doherty 1 , Jeffrey Cockburn 1 , Wolfgang M Pauli 1
Affiliation  

In this review, we summarize findings supporting the existence of multiple behavioral strategies for controlling reward-related behavior, including a dichotomy between the goal-directed or model-based system and the habitual or model-free system in the domain of instrumental conditioning and a similar dichotomy in the realm of Pavlovian conditioning. We evaluate evidence from neuroscience supporting the existence of at least partly distinct neuronal substrates contributing to the key computations necessary for the function of these different control systems. We consider the nature of the interactions between these systems and show how these interactions can lead to either adaptive or maladaptive behavioral outcomes. We then review evidence that an additional system guides inference concerning the hidden states of other agents, such as their beliefs, preferences, and intentions, in a social context. We also describe emerging evidence for an arbitration mechanism between model-based and model-free reinforcement learning, placing such a mechanism within the broader context of the hierarchical control of behavior.

中文翻译:


学习、奖励和决策

在这篇综述中,我们总结了支持控制奖励相关行为的多种行为策略存在的发现,包括工具调节领域中目标导向或基于模型的系统与习惯或无模型系统之间的二分法和巴甫洛夫条件反射领域也存在类似的二分法。我们评估了来自神经科学的证据,这些证据支持至少部分不同的神经元基质的存在,这些神经元基质有助于这些不同控制系统功能所需的关键计算。我们考虑这些系统之间相互作用的本质,并展示这些相互作用如何导致适应性或适应不良的行为结果。然后,我们审查了证据,表明一个额外的系统指导有关其他主体的隐藏状态的推断,例如他们在社会背景下的信仰、偏好和意图。我们还描述了基于模型和无模型强化学习之间仲裁机制的新证据,将这种机制置于行为分层控制的更广泛背景下。

更新日期:2017-01-04
down
wechat
bug