The ubiquity of model-based reinforcement learning.

Bradley B Doll; Dylan A Simon; Nathaniel D Daw

首页> 外文期刊>Current Opinion in Neurobiology >The ubiquity of model-based reinforcement learning.

【24h】

The ubiquity of model-based reinforcement learning.

机译：基于模型的强化学习无处不在。

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

团队文献服务 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The reward prediction error (RPE) theory of dopamine (DA) function has enjoyed great success in the neuroscience of learning and decision-making. This theory is derived from model-free reinforcement learning (RL), in which choices are made simply on the basis of previously realized rewards. Recently, attention has turned to correlates of more flexible, albeit computationally complex, model-based methods in the brain. These methods are distinguished from model-free learning by their evaluation of candidate actions using expected future outcomes according to a world model. Puzzlingly, signatures from these computations seem to be pervasive in the very same regions previously thought to support model-free learning. Here, we review recent behavioral and neural evidence about these two systems, in attempt to reconcile their enigmatic cohabitation in the brain.

机译：多巴胺（DA）功能的奖励预测误差（RPE）理论在学习和决策的神经科学中取得了巨大的成功。该理论源自无模型的强化学习（RL），在该模型中，仅基于先前实现的奖励进行选择。近来，注意力已经转向大脑中更灵活，尽管计算复杂，基于模型的方法的相关性。这些方法与无模型学习的区别在于，它们根据世界模型使用预期的未来结果评估候选动作。令人困惑的是，这些计算的签名似乎在以前认为支持无模型学习的相同区域中普遍存在。在这里，我们回顾了有关这两个系统的最新行为和神经证据，试图调和它们在大脑中的神秘同居。

著录项

来源
《Current Opinion in Neurobiology 》 |2012年第6期| 共7页
作者
Bradley B Doll; Dylan A Simon; Nathaniel D Daw;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类神经病学 ;
关键词

相似文献

外文文献
中文文献
专利

1. The ubiquity of model-based reinforcement learning. [J] . Bradley B Doll, Dylan A Simon, Nathaniel D Daw Current Opinion in Neurobiology . 2012 ,第6期

机译：基于模型的强化学习无处不在。
2. Ubiquity and specificity of reinforcement signals throughout the human brain. [J] . Vickery TJ, Chun MM, Lee D Neuron . 2011 ,第1期

机译：增强信号在人脑中的普遍性和特异性。
3. Model-based reinforcement learning under concurrent schedules of reinforcement in rodents [J] . Namjung Huh Suhyun Jo Hoseok Kim Jung Hoon Sul and Min Whan Jung learning_Memory . 2009 ,第5期

机译：啮齿类动物并发调度下基于模型的加强学习
4. Multi-Target Trajectory Optimization with Neural Network and Reinforcement Learning. [C] . Haiyang Li, Zhemin Chi, Hexi Baoyin International Astronautical Congress . 2019

机译：具有神经网络和强化学习的多目标轨迹优化。
5. Perception-based generalization in model-based reinforcement learning. [D] . Leffler, Bethany R. 2009

机译：基于模型的强化学习中基于感知的泛化。
6. The ubiquity of model-based reinforcement learning [O] . Bradley B Doll, Dylan A Simon, Nathaniel D Daw -1

机译：基于模型的强化学习无处不在
7. The ubiquity of model-based reinforcement learning [O] . Bradley B Doll, Dylan A Simon, Nathaniel D Daw 2012

机译：基于模型的强化学习的难以浮动

The ubiquity of model-based reinforcement learning.

摘要

著录项

相似文献

相关主题

期刊订阅