首页> 美国卫生研究院文献>other >Model-Based and Model-Free Pavlovian Reward Learning: Revaluation Revision and Revelation

【2h】

Model-Based and Model-Free Pavlovian Reward Learning: Revaluation Revision and Revelation

机译：基于模型和免费模型的巴甫洛夫奖赏学习：重估修订和启示

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

Evidence supports at least two methods for learning about reward and punishment and making predictions for guiding actions. One method, called model-free, progressively acquires cached estimates of the long-run values of circumstances and actions from retrospective experience. The other method, called model-based, uses representations of the environment, expectations and prospective calculations to make cognitive predictions of future value. Extensive attention has been paid to both methods in computational analyses of instrumental learning. By contrast, although a full computational analysis has been lacking, Pavlovian learning and prediction has typically been presumed to be solely model-free. Here, we revise that presumption and review compelling evidence from Pavlovian revaluation experiments showing that Pavlovian predictions can involve their own form of model-based evaluation. In model-based Pavlovian evaluation, prevailing states of the body and brain influence value computations, and thereby produce powerful incentive motivations that can sometimes be quite new. We consider the consequences of this revised Pavlovian view for the computational landscape of prediction, response and choice. We also revisit differences between Pavlovian and instrumental learning in the control of incentive motivation.

机译：证据至少支持两种方法来学习奖励和惩罚，并为指导行动做出预测。一种方法称为无模型，它从追溯经验中逐步获取对环境和操作的长期价值的缓存估计。另一种方法称为基于模型的方法，它使用环境的表示形式，期望值和前瞻性计算来对未来价值进行认知预测。在工具学习的计算分析中已经广泛关注这两种方法。相比之下，尽管缺乏完整的计算分析，但通常假定巴甫洛夫式的学习和预测是完全无模型的。在这里，我们修改了推定，并回顾了巴甫洛夫重估实验的有力证据，这些证据表明巴甫洛夫的预测可能涉及其自身的基于模型的评估形式。在基于模型的巴甫洛夫式评估中，身体和大脑的普遍状态影响价值计算，从而产生有时可能是相当新的强大激励动机。我们考虑了这种修改后的巴甫洛夫观点对预测，响应和选择的计算前景的影响。我们还回顾了在激励动机控制方面，巴甫洛夫学习法和工具学习法之间的差异。

著录项

期刊名称 other
作者
Peter Dayan; Kent C. Berridge;
展开▼
作者单位

展开▼
年(卷),期 -1(14),2
年度 -1
页码 473–492
总页数 33
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Model-based and model-free Pavlovian reward learning: Revaluation, revision, and revelation [J] . Cognitive, affective & behavioral neuroscience . 2014,第2期

机译：基于模型和无模型的Pavlovian奖励学习：重估，修订和启示
2. The involvement of model-based but not model-free learning signals during observational reward learning in the absence of choice [J] . Simon Dunne, Arun DSouza, John P. ODoherty Journal of Neurophysiology . 2016,第6期

机译：在没有选择的情况下，在观察性奖励学习过程中涉及基于模型的学习信号，而不是没有模型的学习信号
3. States versus rewards: dissociable neural prediction error signals underlying model-based and model-free reinforcement learning. [J] . Glascher J, Daw N, Dayan P, Neuron . 2010,第4期

机译：状态与回报：基于模型和无模型的强化学习背后的可分离的神经预测错误信号。
4. Model-free Reinforcement Learning in Infinite-horizon Average-reward Markov Decision Processes [C] . Chen-Yu Wei, Mehdi Jafarnia-Jahromi, Haipeng Luo, International Conference on Machine Learning . 2021

机译：无限地平线平均奖励马尔可夫决策过程的无模型加强学习
5. Dopamine’s Role in Learning Pavlovian Cues Associated with Different Reward Sizes [D] . Dejeux, Mariana Isabelle Hellen. 2021

机译：多巴胺在学习与不同奖励大小相关的Pavlovian线索中的角色
6. The involvement of model-based but not model-free learning signals during observational reward learning in the absence of choice [O] . Simon Dunne, Arun DSouza, John P. ODoherty -1

机译：在没有选择的情况下基于模型的学习信号而不是非模型的学习信号在观察性奖励学习中的参与
7. Model-based and model-free Pavlovian reward learning: Revaluation, revision, and revelation [O] . Peter Dayan, Kent C. Berridge 2015

机译：基于模型和模型的巴甫洛夫奖励学习：重估，修订和启示

Model-Based and Model-Free Pavlovian Reward Learning: Revaluation Revision and Revelation

摘要

著录项

相似文献

相关主题

期刊订阅