当前位置: X-MOL 学术Prog. Neurobiol. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Frontal eye field neurons selectively signal the reward value of prior actions.
Progress in Neurobiology ( IF 6.7 ) Pub Date : 2020-07-03 , DOI: 10.1016/j.pneurobio.2020.101881
Xiaomo Chen 1 , Marc Zirnsak 1 , Gabriel M Vega 1 , Tirin Moore 1
Affiliation  

The consequences of individual actions are typically unknown until well after they are executed. This fact necessitates a mechanism that bridges delays between specific actions and reward outcomes. We looked for the presence of such a mechanism in the post-movement activity of neurons in the frontal eye field (FEF), a visuomotor area in prefrontal cortex. Monkeys performed an oculomotor gamble task in which they made eye movements to different locations associated with dynamically varying reward outcomes. Behavioral data showed that monkeys tracked reward history and made choices according to their own risk preferences. Consistent with previous studies, we observed that the activity of FEF neurons is correlated with the expected reward value of different eye movements before a target appears. Moreover, we observed that the activity of FEF neurons continued to signal the direction of eye movements, the expected reward value, and their interaction well after the movements were completed and when targets were no longer within the neuronal response field. In addition, this post-movement information was also observed in local field potentials, particularly in low-frequency bands. These results show that neural signals of prior actions and expected reward value persist across delays between those actions and their experienced outcomes. These memory traces may serve a role in reward-based learning in which subjects need to learn actions predicting delayed reward.



中文翻译:

额叶眼场神经元选择性地发出先前动作的奖励值信号。

个人行为的后果通常是未知的,直到它们被执行。这一事实需要一种机制来弥合特定行动和奖励结果之间的延迟。我们在额眼区 (FEF)(前额叶皮层的视觉运动区)神经元的运动后活动中寻找这种机制的存在。猴子执行了一项眼球运动赌博任务,在该任务中,它们将眼球移动到与动态变化的奖励结果相关的不同位置。行为数据显示,猴子会追踪奖励历史,并根据自己的风险偏好做出选择。与之前的研究一致,我们观察到 FEF 神经元的活动与目标出现前不同眼球运动的预期奖励值相关。而且,我们观察到,FEF 神经元的活动在运动完成后以及目标不再位于神经元反应场内时,继续发出眼球运动方向、预期奖励值及其相互作用的信号。此外,这种运动后信息也在局部场电位中观察到,特别是在低频段。这些结果表明,先前行动的神经信号和预期的奖励价值在这些行动与其经历的结果之间的延迟中持续存在。这些记忆痕迹可能在基于奖励的学习中发挥作用,在这种学习中,受试者需要学习预测延迟奖励的行为。以及在运动完成后以及当目标不再位于神经元反应场内时它们的相互作用。此外,这种运动后信息也在局部场电位中观察到,特别是在低频段。这些结果表明,先前行动的神经信号和预期的奖励价值在这些行动与其经历的结果之间的延迟中持续存在。这些记忆痕迹可能在基于奖励的学习中发挥作用,在这种学习中,受试者需要学习预测延迟奖励的行为。以及在运动完成后以及当目标不再位于神经元反应场内时它们的相互作用。此外,这种运动后信息也在局部场电位中观察到,特别是在低频段。这些结果表明,先前行动的神经信号和预期的奖励价值在这些行动与其经历的结果之间的延迟中持续存在。这些记忆痕迹可能在基于奖励的学习中发挥作用,在这种学习中,受试者需要学习预测延迟奖励的行为。

更新日期:2020-07-03
down
wechat
bug