Generalization of value in reinforcement learning by humans

机译：人类加强学习价值的概括

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

Research in decision making has focused on the role of dopamine and its striatal targets in guiding choices via learned stimulus-reward or stimulus-response associations, behavior that is well-described by reinforcement learning (RL) theories. However, basic RL is relatively limited in scope and does not explain how learning about stimulus regularities or relations may guide decision making. A candidate mechanism for this type of learning comes from the domain of memory, which has highlighted a role for the hippocampus in learning of stimulus-stimulus relations, typically dissociated from the role of the striatum in stimulus-response learning. Here, we used fMRI and computational model-based analyses to examine the joint contributions of these mechanisms to RL. Humans performed an RL task with added relational structure, modeled after tasks used to isolate hippocampal contributions to memory. On each trial participants chose one of four options, but the reward probabilities for pairs of options were correlated across trials. This (uninstructed) relationship between pairs of options potentially enabled an observer to learn about options’ values based on experience with the other options and to generalize across them. We observed BOLD activity related to learning in the striatum and also in the hippocampus. By comparing a basic RL model to one augmented to allow feedback to generalize between correlated options, we tested whether choice behavior and BOLD activity were influenced by the opportunity to generalize across correlated options. Although such generalization goes beyond standard computational accounts of RL and striatal BOLD, both choices and striatal BOLD were better explained by the augmented model. Consistent with the hypothesized role for the hippocampus in this generalization, functional connectivity between the ventral striatum and hippocampus was modulated, across participants, by the ability of the augmented model to capture participants’ choice. Our results thus point toward an interactive model in which striatal RL systems may employ relational representations typically associated with the hippocampus.

机译：决策的研究侧重于多巴胺及其纹状体目标通过学习刺激奖励或刺激 - 响应协会，由加强学习（RL）理论良好描述的行为的指导选择。然而，基本RL的范围相对较为有限，并且没有解释关于刺激规律或关系的学习如何指导决策。这种学习的候选机制来自记忆域，这凸显了海马在学习刺激刺激关系方面的作用，通常与刺激反应学习中的纹章中的作用解离。在这里，我们使用FMRI和基于计算模型的分析来检查这些机制对RL的联合贡献。人类使用添加关系结构进行了RL任务，在用于将海马贡献隔离到内存的任务后建模。在每次试验中，参与者选择了四种选项中的一个，但在试验中，奖励概率与对的奖励概率相关联。对选项对之间的这种（无解释的）关系可能使观察者基于具有其他选项的体验和概括地概括了观察者来了解选项的值。我们观察到与纹状体中的学习和海马有关的大胆活动。通过将基本RL模型与一个增强进行比较以允许反馈来概括相关选项之间的概括，我们测试了选择行为和大胆活动是否受到跨相关选择概括的机会的影响。虽然这种概括超出了RL的标准计算账户和纹纹粗体，但增强模型更好地解释了两种选择和纹身大胆。与海马的假设作用一致，通过增强模型捕获参与者选择的能力，调制腹部纹状体和海马之间的功能性连通性。因此，我们的结果指出了一种交互式模型，其中纹状体RL系统可以采用通常与海马相关联的关系表示。

著录项

期刊名称 other
作者
G. Elliott Wimmer; Nathaniel D. Daw; Daphna Shohamy;
展开▼
作者单位

展开▼
年(卷),期 -1(35),7
年度 -1
页码 1092–1104
总页数 28
原文格式 PDF
正文语种
中图分类
关键词
hippocampus ventral striatum reward memory computational model;

机译：海马;腹侧纹状体;奖励;记忆;计算模型;

相似文献

外文文献
中文文献
专利

1. Generalization of value in reinforcement learning by humans [J] . WimmerG.E., DawN.D., ShohamyD. The European Journal of Neuroscience . 2012,第7a8期

机译：强化学习中的价值概括
2. Time Horizon Generalization in Reinforcement Learning: Generalizing Multiple Q-Tables in Q-Learning Agents [J] . Yasuyo Hatcho, Kiyohiko Hattori, Keiki Takadama Journal of Advanced Computatioanl Intelligence and Intelligent Informatics . 2009,第6a72期

机译：强化学习中的时间范围泛化：Q学习代理中的多个Q表泛化
3. Multi-Agent Deep Reinforcement Learning-Based Algorithm For Fast Generalization On Routing Problems [J] . Ibraheem Barbahan, Vladimir Baikalov, Valeriy Vyatkin, Procedia Computer Science . 2021,第a期

机译：基于多功能深度加强学习算法，用于快速概括对路由问题的影响
4. Learning and Generalization of Dynamic Movement Primitives by Hierarchical Deep Reinforcement Learning from Demonstration [C] . Wonchul Kim, Chungkeun Lee, H. Jin Kim IEEE/RSJ International Conference on Intelligent Robots and Systems . 2018

机译：通过演示的分层深度强化学习对动态动作基元进行学习和泛化
5. On Deep Reinforcement Learning for Games: Generalization of Deep Q-Learning with Multiple Policy Heads [D] . Boucher, Mathieu. 2020

机译：关于游戏的深度加固学习：多重政策头部深度Q学的泛化
6. Novelty and Inductive Generalization in Human Reinforcement Learning [O] . Samuel J. Gershman, Yael Niv -1

机译：强化学习中的新颖性和归纳概括
7. Generalization of value in reinforcement learning by humans [O] . G. Elliott Wimmer, Nathaniel D. Daw, Daphna Shohamy 2012

机译：人类加强学习价值的概括

Generalization of value in reinforcement learning by humans

摘要

著录项

相似文献

相关主题

期刊订阅