首页> 外文会议>Annual conference on Neural Information Processing Systems >Efficient Bayes-Adaptive Reinforcement Learning using Sample-Based Search

【24h】

Efficient Bayes-Adaptive Reinforcement Learning using Sample-Based Search

机译：使用基于样本的搜索进行有效的贝叶斯自适应强化学习

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Bayesian model-based reinforcement learning is a formally elegant approach to learning optimal behaviour under model uncertainty, trading off exploration and exploitation in an ideal way. Unfortunately, finding the resulting Bayes-optimal policies is notoriously taxing, since the search space becomes enormous. In this paper we introduce a tractable, sample-based method for approximate Bayes-optimal planning which exploits Monte-Carlo tree search. Our approach outperformed prior Bayesian model-based RL algorithms by a significant margin on several well-known benchmark problems - because it avoids expensive applications of Bayes rule within the search tree by lazily sampling models from the current beliefs. We illustrate the advantages of our approach by showing it working in an infinite state space domain which is qualitatively out of reach of almost all previous work in Bayesian exploration.

机译：基于贝叶斯模型的强化学习是在模型不确定性下学习最优行为的一种形式上优雅的方法，以一种理想的方式在探索和利用之间进行权衡。不幸的是，众所周知，找到最终的贝叶斯最优策略非常麻烦，因为搜索空间变得很大。在本文中，我们介绍了一种基于样本的可处理的，近似的贝叶斯最优规划的方法，该方法利用了蒙特卡洛树搜索。我们的方法在几个著名的基准问题上大大优于先前的基于贝叶斯模型的RL算法-因为它通过从当前信念中对模型进行惰性采样避免了搜索树中贝叶斯规则的昂贵应用。我们通过显示它在无限状态空间域中工作来说明我们的方法的优势，这在质量上几乎是贝叶斯探索中几乎所有以前的工作所无法企及的。

著录项

来源
《Annual conference on Neural Information Processing Systems 》|2012年|1025-1033|共9页
会议地点
作者
Arthur Guez; David Silver; Peter Dayan;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Scalable and Efficient Bayes-Adaptive Reinforcement Learning Based on Monte-Carlo Tree Search [J] . Dayan P., Guez A., Silver D. The Journal of Artificial Intelligence Research . 2013 ,第12期

机译：基于蒙特卡洛树搜索的可扩展高效贝叶斯自适应强化学习
2. Scalable and Efficient Bayes-Adaptive Reinforcement Learning Based on Monte-Carlo Tree Search [J] . Arthur Guez, David Silver, Peter Dayan The Journal of Artificial Intelligence Research . 2013 ,第Null期

机译：基于蒙特卡洛树搜索的可扩展高效贝叶斯自适应强化学习
3. Towards efficient discovery of green synthetic pathways with Monte Carlo tree search and reinforcement learning [J] . Xiaoxue Wang, Yujie Qian, Hanyu Gao, Chemical science . 2020 ,第40期

机译：朝着蒙特卡罗树搜索和加固学习有效发现绿色综合途径
4. Efficient Bayes-Adaptive Reinforcement Learning using Sample-Based Search [C] . Arthur Guez, David Silver, Peter Dayan Annual conference on Neural Information Processing Systems . 2012

机译：高效的贝叶斯自适应强化学习使用基于样本的搜索
5. Sample-Efficient Nonconvex Optimization Algorithms in Machine Learning and Reinforcement Learning [D] . Xu, Pan. 2021

机译：机器学习和加固学习中的采样高效的非透露算法
6. Towards efficient discovery of green synthetic pathways with Monte Carlo tree search and reinforcement learning [O] . Xiaoxue Wang, Yujie Qian, Hanyu Gao, 2020

机译：朝着蒙特卡罗树搜索和加固学习有效发现绿色综合途径
7. Efficient and effective similar subtrajectory search with deep reinforcement learning [O] . Zheng Wang, Cheng Long, Gao Cong, 2020

机译：具有深度加强学习的高效和有效的类似副错误搜索

Efficient Bayes-Adaptive Reinforcement Learning using Sample-Based Search

摘要

著录项

相似文献

相关主题

期刊订阅