The relation between reinforcement learning parameters and the influence of reinforcement history on choice behavior

Katahira Kentaro

首页> 外文期刊>Journal of Mathematical Psychology >The relation between reinforcement learning parameters and the influence of reinforcement history on choice behavior

【24h】

The relation between reinforcement learning parameters and the influence of reinforcement history on choice behavior

机译：强化学习参数与强化历史对选择行为的影响之间的关系

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Reinforcement learning (RL) models have been widely used to analyze the choice behavior of humans and other animals in a broad range of fields, including psychology and neuroscience. Linear regression-based models that explicitly represent how reward and choice history influences future choices have also been used to model choice behavior. While both approaches have been used independently, the relation between the two models has not been explicitly described. The aim of the present study is to describe this relation and investigate how the parameters in the RL model mediate the effects of reward and choice history on future choices. To achieve these aims, we performed analytical calculations and numerical simulations. First, we describe a special case in which the RL and regression models can provide equivalent predictions of future choices. The general properties of the RL model are discussed as a departure from this special case. We clarify the role of the RL-model parameters, specifically, the learning rate, inverse temperature, and outcome value (also referred to as the reward value, reward sensitivity, or motivational value), in the formation of history dependence. (C) 2015 The Author. Published by Elsevier Inc. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).

机译：强化学习（RL）模型已广泛用于分析人类和其他动物在心理学和神经科学等广泛领域中的选择行为。基于线性回归的模型可以显式表示奖励和选择历史如何影响未来选择，这些模型也已用于对选择行为进行建模。尽管这两种方法已被独立使用，但尚未明确描述这两种模型之间的关系。本研究的目的是描述这种关系，并研究RL模型中的参数如何介导奖励和选择历史对未来选择的影响。为了实现这些目标，我们进行了分析计算和数值模拟。首先，我们描述一种特殊情况，其中RL和回归模型可以提供对未来选择的等效预测。讨论了RL模型的一般属性，以偏离这种特殊情况。我们阐明了RL模型参数的作用，特别是学习率，逆温度和结果值（也称为奖励值，奖励敏感性或动机值）在形成历史依赖时的作用。（C）2015作者。由Elsevier Inc.发行。这是CC BY-NC-ND许可下的开放获取文章（http://creativecommons.org/licenses/by-nc-nd/4.0/）。

著录项

来源
《Journal of Mathematical Psychology》 |2015年第null期|共11页
作者
Katahira Kentaro;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类数理心理学、心理统计法;
关键词
Reinforcement learning; History dependence; Regression model; Model-based analysis;

机译：强化学习;历史依赖;回归模型;基于模型的分析;

相似文献

外文文献
中文文献
专利

1. The relation between reinforcement learning parameters and the influence of reinforcement history on choice behavior [J] . Katahira Kentaro Journal of Mathematical Psychology . 2015,第Null期

机译：强化学习参数与强化历史对选择行为的影响之间的关系
2. Predicting Psychosis Across Diagnostic Boundaries: Behavioral and Computational Modeling Evidence for Impaired Reinforcement Learning in Schizophrenia and Bipolar Disorder With a History of Psychosis [J] . Strauss Gregory P., Thaler Nicholas S., Matveeva Tatyana M., Journal of abnormal psychology . 2015,第3期

机译：跨诊断边界预测精神病：精神分裂症和患有精神病史的双相情感障碍的强化学习受损的行为和计算模型证据
3. Autonomic responses to choice outcomes: Links to task performance and reinforcement-learning parameters [J] . Hayes William M., Wedell Douglas H. Biological Psychology . 2020,第1期

机译：选择结果的自主反应：与任务性能和强化学习参数的联系
4. Vision-based reinforcement learning for humanoid behavior generation with rhythmic walking parameters [C] . Ogino, M., Katoh, Tools With Artificial Intelligence (TAI), IEEE Conference on . 2003

机译：基于视觉的强化学习，用于有节奏步行参数的人形行为生成
5. The influence of a Differential Reinforcement of Other behaviors (DRO) protocol with an embedded token economy to reduce challenging behaviors among children with autism [D] . Gongola, Leah C. 2008

机译：具有嵌入式令牌经济的其他行为差异强化（DRO）协议对减少自闭症儿童挑战性行为的影响
6. How much of reinforcement learning is working memory not reinforcement learning? A behavioral computational and neurogenetic analysis [O] . Anne G. E. Collins, Michael J. Frank -1

机译：钢筋学习多少是工作记忆而不是加强学习？行为计算和神经肝分析
7. The relation between reinforcement learning parameters and the influence of reinforcement history on choice behavior [O] . Katahira Kentaro 2015

机译：强化学习参数与强化历史对选择行为的影响之间的关系

The relation between reinforcement learning parameters and the influence of reinforcement history on choice behavior

摘要

著录项

相似文献

相关主题

期刊订阅