...
首页> 外文期刊>Nature Communications >Gaze data reveal distinct choice processes underlying model-based and model-free reinforcement learning
【24h】

Gaze data reveal distinct choice processes underlying model-based and model-free reinforcement learning

机译:凝视数据揭示了基于模型和无模型强化学习的独特选择过程

获取原文
           

摘要

Organisms appear to learn and make decisions using different strategies known as model-free and model-based learning; the former is mere reinforcement of previously rewarded actions and the latter is a forward-looking strategy that involves evaluation of action-state transition probabilities. Prior work has used neural data to argue that both model-based and model-free learners implement a value comparison process at trial onset, but model-based learners assign more weight to forward-looking computations. Here using eye-tracking, we report evidence for a different interpretation of prior results: model-based subjects make their choices prior to trial onset. In contrast, model-free subjects tend to ignore model-based aspects of the task and instead seem to treat the decision problem as a simple comparison process between two differentially valued items, consistent with previous work on sequential-sampling models of decision making. These findings illustrate a problem with assuming that experimental subjects make their decisions at the same prescribed time.
机译:有机体似乎使用称为无模型和基于模型的学习的不同策略来学习和决策。前者仅仅是对先前奖励行动的加强,而后者是一种前瞻性战略,涉及对行动状态转变概率的评估。先前的工作已经使用神经数据论证了基于模型的学习者和无模型的学习者都在试验开始时就实施了价值比较过程,但是基于模型的学习者将更多的权重分配给前瞻性计算。在这里,通过眼动追踪,我们报告了对先前结果的不同解释的证据:基于模型的受试者在试验开始之前就做出了选择。相反,无模型主体倾向于忽略任务的基于模型的方面,而是将决策问题视为两个具有不同价值的项目之间的简单比较过程,这与先前对决策制定的顺序抽样模型所做的工作一致。这些发现说明了假设实验对象在相同的规定时间做出决定的问题。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号