首页> 外文期刊>Journal of Cognitive Neuroscience >Learning Contextual Reward Expectations for Value Adaptation
【24h】

Learning Contextual Reward Expectations for Value Adaptation

机译:学习情境奖励对价值适应的期望

获取原文
获取原文并翻译 | 示例

摘要

ara>Substantial evidence indicates that subjective value is adapted to the statistics of reward expected within a given temporal context. However, how these contextual expectations are learned is poorly understood. To examine such learning, we exploited a recent observation that participants performing a gambling task adjust their preferences as a function of context. We show that, in the absence of contextual cues providing reward information, an average reward expectation was learned from recent past experience. Learning dependent on contextual cues emerged when two contexts alternated at a fast rate, whereas both cue-independent and cue-dependent forms of learning were apparent when two contexts alternated at a slower rate. Motivated by these behavioral findings, we reanalyzed a previous fMRI data set to probe the neural substrates of learning contextual reward expectations. We observed a form of reward prediction error related to average reward such that, at option presentation, activity in ventral tegmental area/substantia nigra and ventral striatum correlated positively and negatively, respectively, with the actual and predicted value of options. Moreover, an inverse correlation between activity in ventral tegmental area/substantia nigra (but not striatum) and predicted option value was greater in participants showing enhanced choice adaptation to context. The findings help understanding the mechanisms underlying learning of contextual reward expectation.
机译:大量证据表明,主观价值适合于给定时间范围内预期的报酬统计。但是,如何了解这些上下文期望却知之甚少。为了检验这种学习,我们利用了最近的观察结果,即参加赌博任务的参与者根据情境调整自己的偏好。我们表明,在没有提供奖励信息的上下文提示的情况下,从最近的过去经验中可以获得平均奖励期望。当两个情境以很快的速度交替出现时,就会出现依赖于情境提示的学习,而当两个情境以较低的速度交替出现时,独立于提示和依赖于提示的学习形式就会出现。受这些行为调查结果的激励,我们重新分析了先前的fMRI数据集,以探究学习情境奖励期望的神经基础。我们观察到一种与平均奖励有关的奖励预测误差,这样,在期权呈报时,腹侧被盖区/黑质和腹侧纹状体的活动分别与期权的实际价值和预测价值呈正相关和负相关。此外,参与者的腹侧被盖区/黑质(而不是纹状体)的活动与预测的选项值之间的负相关性更大,表明参与者对背景的选择适应性增强。这些发现有助于理解情境奖励期望学习的潜在机制。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号