Gaze data reveal distinct choice processes underlying model-based and model-free reinforcement learning

Arkady Konovalov; Ian Krajbich

首页> 外文期刊>Nature Communications >Gaze data reveal distinct choice processes underlying model-based and model-free reinforcement learning

【24h】

Gaze data reveal distinct choice processes underlying model-based and model-free reinforcement learning

机译：凝视数据揭示了基于模型和无模型强化学习的独特选择过程

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Organisms appear to learn and make decisions using different strategies known as model-free and model-based learning; the former is mere reinforcement of previously rewarded actions and the latter is a forward-looking strategy that involves evaluation of action-state transition probabilities. Prior work has used neural data to argue that both model-based and model-free learners implement a value comparison process at trial onset, but model-based learners assign more weight to forward-looking computations. Here using eye-tracking, we report evidence for a different interpretation of prior results: model-based subjects make their choices prior to trial onset. In contrast, model-free subjects tend to ignore model-based aspects of the task and instead seem to treat the decision problem as a simple comparison process between two differentially valued items, consistent with previous work on sequential-sampling models of decision making. These findings illustrate a problem with assuming that experimental subjects make their decisions at the same prescribed time.

机译：有机体似乎使用称为无模型和基于模型的学习的不同策略来学习和决策。前者仅仅是对先前奖励行动的加强，而后者是一种前瞻性战略，涉及对行动状态转变概率的评估。先前的工作已经使用神经数据论证了基于模型的学习者和无模型的学习者都在试验开始时就实施了价值比较过程，但是基于模型的学习者将更多的权重分配给前瞻性计算。在这里，通过眼动追踪，我们报告了对先前结果的不同解释的证据：基于模型的受试者在试验开始之前就做出了选择。相反，无模型主体倾向于忽略任务的基于模型的方面，而是将决策问题视为两个具有不同价值的项目之间的简单比较过程，这与先前对决策制定的顺序抽样模型所做的工作一致。这些发现说明了假设实验对象在相同的规定时间做出决定的问题。

著录项

来源
《Nature Communications》 |2016年第1期|共页
作者
Arkady Konovalov; Ian Krajbich;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类自然科学总论;
关键词

相似文献

外文文献
中文文献
专利

1. States versus rewards: dissociable neural prediction error signals underlying model-based and model-free reinforcement learning. [J] . Glascher J, Daw N, Dayan P, Neuron . 2010,第4期

机译：状态与回报：基于模型和无模型的强化学习背后的可分离的神经预测错误信号。
2. Neurostimulation Reveals Context-Dependent Arbitration Between Model-Based and Model-Free Reinforcement Learning [J] . Weissengruber Sebastian, Lee Sang Wan, ODoherty John P., Cerebral cortex . 2019,第11期

机译：Neurostimulation揭示了基于模型和无模型加强学习之间的背景相关的仲裁
3. Multifidelity Reinforcement Learning With Gaussian Processes: Model-Based and Model-Free Algorithms [J] . Suryan Varun, Gondhalekar Nahush, Tokekar Pratap IEEE Robotics & Automation Magazine . 2020,第2期

机译：高斯工艺的多程度强化学习：基于模型和无模型算法
4. EEG-based classification of learning strategies : Model-based and model-free reinforcement learning [C] . Dongjae Kim, Charles Weston, Sang Wan Lee 2018 6th International Conference on Brain-Computer Interface . 2018

机译：基于脑电图的学习策略分类：基于模型和无模型的强化学习
5. Understanding Model-Based Reinforcement Learning and its Application in Safe Reinforcement Learning [D] . Hu, Dingcheng . 2019

机译：了解基于模型的强化学习及其在安全强化学习中的应用
6. Gaze data reveal distinct choice processes underlying model-based and model-free reinforcement learning [O] . Arkady Konovalov, Ian Krajbich -1

机译：凝视数据揭示了基于模型和无模型的强化学习背后的不同选择过程
7. States versus Rewards: Dissociable Neural Prediction Error Signals Underlying Model-Based and Model-Free Reinforcement Learning [O] . Gläscher Jan, Daw Nathaniel, Dayan Peter, 2010

机译：状态与奖励：基于模型和免费模型的强化学习背后的可分离神经预测误差信号

Gaze data reveal distinct choice processes underlying model-based and model-free reinforcement learning

摘要

著录项

相似文献

相关主题

期刊订阅