...
首页> 外文期刊>NeuroImage >Principal components analysis of reward prediction errors in a reinforcement learning task
【24h】

Principal components analysis of reward prediction errors in a reinforcement learning task

机译:强化学习任务中奖励预测错误的主成分分析

获取原文
获取原文并翻译 | 示例
           

摘要

Models of reinforcement learning represent reward and punishment in terms of reward prediction errors (RPEs), quantitative signed terms describing the degree to which outcomes are better than expected (positive RPEs) or worse (negative RPEs). An electrophysiological component known as feedback related negativity (FRN) occurs at frontocentral sites 240-340 ms after feedback on whether a reward or punishment is obtained, and has been claimed to neurally encode an RPE. An outstanding question however, is whether the FRN is sensitive to the size of both positive RPEs and negative RPEs. Previous attempts to answer this question have examined the simple effects of RPE size for positive RPEs and negative RPEs separately. However, this methodology can be compromised by overlap from components coding for unsigned prediction error size, or "salience", which are sensitive to the absolute size of a prediction error but not its valence. In our study, positive and negative RPEs were parametrically modulated using both reward likelihood and magnitude, with principal components analysis used to separate out overlying components. This revealed a single RPE encoding component responsive to the size of positive RPEs, peaking at similar to 330ms, and occupying the delta frequency band. Other components responsive to unsigned prediction error size were shown, but no component sensitive to negative RPE size was found. (C) 2015 Elsevier Inc. All rights reserved.
机译:强化学习的模型以奖励预测错误(RPE)表示量化的奖励和惩罚,量化有符号的术语描述了结果好于预期(积极RPE)或更差(负面RPE)的程度。在获得奖励或惩罚的反馈后240-340 ms,额叶中央部位出现了一种称为反馈相关负电荷(FRN)的电生理成分,并据称可以对RPE进行神经编码。但是,一个悬而未决的问题是FRN是否对正RPE和负RPE的大小都敏感。先前回答该问题的尝试已经分别检查了RPE尺寸对正RPE和负RPE的简单影响。但是,此方法可能会因编码无符号预测误差大小或“显着性”的分量重叠而受到损害,这些分量对预测误差的绝对大小敏感,但对其价数不敏感。在我们的研究中,使用奖励可能性和幅度对正负RPE进行参数调制,并使用主成分分析来分离出重叠成分。这揭示了响应于正RPE大小的单个RPE编码组件,峰值类似于330ms,并占据了增量频带。显示了对无符号预测误差大小有响应的其他组件,但未发现对负RPE大小敏感的组件。 (C)2015 Elsevier Inc.保留所有权利。

著录项

  • 来源
    《NeuroImage》 |2016年第1期|共11页
  • 作者

  • 作者单位
  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号