Principal components analysis of reward prediction errors in a reinforcement learning task

首页> 外文期刊>NeuroImage >Principal components analysis of reward prediction errors in a reinforcement learning task

【24h】

Principal components analysis of reward prediction errors in a reinforcement learning task

机译：强化学习任务中奖励预测错误的主成分分析

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Models of reinforcement learning represent reward and punishment in terms of reward prediction errors (RPEs), quantitative signed terms describing the degree to which outcomes are better than expected (positive RPEs) or worse (negative RPEs). An electrophysiological component known as feedback related negativity (FRN) occurs at frontocentral sites 240-340 ms after feedback on whether a reward or punishment is obtained, and has been claimed to neurally encode an RPE. An outstanding question however, is whether the FRN is sensitive to the size of both positive RPEs and negative RPEs. Previous attempts to answer this question have examined the simple effects of RPE size for positive RPEs and negative RPEs separately. However, this methodology can be compromised by overlap from components coding for unsigned prediction error size, or "salience", which are sensitive to the absolute size of a prediction error but not its valence. In our study, positive and negative RPEs were parametrically modulated using both reward likelihood and magnitude, with principal components analysis used to separate out overlying components. This revealed a single RPE encoding component responsive to the size of positive RPEs, peaking at similar to 330ms, and occupying the delta frequency band. Other components responsive to unsigned prediction error size were shown, but no component sensitive to negative RPE size was found. (C) 2015 Elsevier Inc. All rights reserved.

机译：强化学习的模型以奖励预测错误（RPE）表示量化的奖励和惩罚，量化有符号的术语描述了结果好于预期（积极RPE）或更差（负面RPE）的程度。在获得奖励或惩罚的反馈后240-340 ms，额叶中央部位出现了一种称为反馈相关负电荷（FRN）的电生理成分，并据称可以对RPE进行神经编码。但是，一个悬而未决的问题是FRN是否对正RPE和负RPE的大小都敏感。先前回答该问题的尝试已经分别检查了RPE尺寸对正RPE和负RPE的简单影响。但是，此方法可能会因编码无符号预测误差大小或“显着性”的分量重叠而受到损害，这些分量对预测误差的绝对大小敏感，但对其价数不敏感。在我们的研究中，使用奖励可能性和幅度对正负RPE进行参数调制，并使用主成分分析来分离出重叠成分。这揭示了响应于正RPE大小的单个RPE编码组件，峰值类似于330ms，并占据了增量频带。显示了对无符号预测误差大小有响应的其他组件，但未发现对负RPE大小敏感的组件。（C）2015 Elsevier Inc.保留所有权利。

著录项

来源
《NeuroImage》 |2016年第1期|共11页
作者

展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Principal components analysis of reward prediction errors in a reinforcement learning task [J] . NeuroImage . 2016,第Pta1期

机译：强化学习任务中奖励预测错误的主成分分析
2. Subjective and model-estimated reward prediction: association with the feedback-related negativity (FRN) and reward prediction error in a reinforcement learning task. [J] . Ichikawa N, Siegle GJ, Dombrovski A, International journal of psychophysiology: official journal of the International Organization of Psychophysiology . 2010,第3期

机译：主观和模型估计的奖励预测：与强化学习任务中与反馈相关的负性（FRN）和奖励预测错误相关联。
3. Feedback delay impaired reinforcement learning: Principal components analysis of Reward Positivity [J] . Hang Yin, Yu Wang, Xukai Zhang, Neuroscience Letters: An International Multidisciplinary Journal Devoted to the Rapid Publication of Basic Research in the Brain Sciences . 2018,第期

机译：反馈延迟损害加固学习：奖励积极性的主要成分分析
4. Analogue-dynamical prediction of numerical model errors based on principal component analysis [C] . Wang Qiguang, Aixia Feng, Feng Guolin, 2011 Eighth International Conference on Fuzzy Systems and Knowledge Discovery . 2011

机译：基于主成分分析的数值模型误差的模拟动力学预测
5. Reward Prediction Errors Shape Memory during Reinforcement Learning [D] . Rouhani, Nina. 2020

机译：奖励预测错误在加固学习期间形状内存
6. Inferring reward prediction errors in patients with schizophrenia: a dynamic reward task for reinforcement learning [O] . Chia-Tzu Li, Wen-Sung Lai, Chih-Min Liu, 2014

机译：推断精神分裂症患者的奖励预测错误：强化学习的动态奖励任务
7. Principal components analysis of reward prediction errors in a reinforcement learning task. [O] . Sambrook TD, Goslin J 2016

机译：强化学习任务中奖励预测错误的主成分分析。

Principal components analysis of reward prediction errors in a reinforcement learning task

摘要

著录项

相似文献

相关主题

期刊订阅