...
首页> 外文期刊>NeuroImage >Frontal theta links prediction errors to behavioral adaptation in reinforcement learning.
【24h】

Frontal theta links prediction errors to behavioral adaptation in reinforcement learning.

机译:额叶theta将预测错误与强化学习中的行为适应联系起来。

获取原文
获取原文并翻译 | 示例
           

摘要

Investigations into action monitoring have consistently detailed a frontocentral voltage deflection in the event-related potential (ERP) following the presentation of negatively valenced feedback, sometimes termed the feedback-related negativity (FRN). The FRN has been proposed to reflect a neural response to prediction errors during reinforcement learning, yet the single-trial relationship between neural activity and the quanta of expectation violation remains untested. Although ERP methods are not well suited to single-trial analyses, the FRN has been associated with theta band oscillatory perturbations in the medial prefrontal cortex. Mediofrontal theta oscillations have been previously associated with expectation violation and behavioral adaptation and are well suited to single-trial analysis. Here, we recorded EEG activity during a probabilistic reinforcement learning task and fit the performance data to an abstract computational model (Q-learning) for calculation of single-trial reward prediction errors. Single-trial theta oscillatory activities following feedback were investigated within the context of expectation (prediction error) and adaptation (subsequent reaction time change). Results indicate that interactive medial and lateral frontal theta activities reflect the degree of negative and positive reward prediction error in the service of behavioral adaptation. These different brain areas use prediction error calculations for different behavioral adaptations, with medial frontal theta reflecting the utilization of prediction errors for reaction time slowing (specifically following errors), but lateral frontal theta reflecting prediction errors leading to working memory-related reaction time speeding for the correct choice.
机译:在呈现负价反馈(有时称为反馈相关负性(FRN))之后,对动作监控的研究一直详细地描述了事件相关电位(ERP)中的额中央电压偏转。提出了FRN以反映强化学习期间对预测错误的神经反应,但神经活动与期望违反量之间的单项试验关系仍未经测试。尽管ERP方法不太适合单次分析,但FRN与内侧前额叶皮层中的θ带振荡扰动有关。额叶θ振荡先前已与期望违背和行为适应相关联,并且非常适合单次试验分析。在这里,我们记录了概率增强学习任务期间的脑电活动,并将性能数据拟合到抽象计算模型(Q学习)中,用于计算单项试验奖励预测误差。在期望(预测误差)和适应(随后的反应时间变化)的背景下,研究了反馈后的单试验θ振荡活动。结果表明,互动的内侧和外侧额叶theta活动反映了行为适应服务中的负面和正面报酬预测误差的程度。这些不同的大脑区域对不同的行为适应使用了预测误差计算,内侧额叶theta反映了预测误差在反应时间变慢方面的利用率(特别是跟随误差),而外侧额叶theta反映了预测误差,导致工作记忆相关的反应时间加快。正确的选择。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号