PAC-Bayesian Policy Evaluation for Reinforcement Learning

机译：PAC-贝叶斯强化学习政策评估

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Bayesian priors offer a compact yet general means of incorporating domain knowledge into many learning tasks. The correctness of the Bayesian analysis and inference, however, largely depends on accuracy and correctness of these priors. PAC-Bayesian methods over come this problem by providing bounds that hold regardless of the correctness of the prior distribution. This paper introduces the first PAC-Bayesian bound for the batch reinforce ment learning problem with function approx imation. We show how this bound can be used to perform model-selection in a trans fer learning scenario. Our empirical results confirm that PAC-Bayesian policy evaluation is able to leverage prior distributions when they are informative and, unlike standard Bayesian RL approaches, ignore them when they are misleading.

机译：贝叶斯先验提供了一种紧凑但通用的方法，可以将领域知识整合到许多学习任务中。但是，贝叶斯分析和推断的正确性在很大程度上取决于这些先验的准确性和正确性。 PAC-贝叶斯方法通过提供无论先验分布的正确性如何都成立的边界来解决此问题。本文介绍了函数逼近的批量强化学习问题的第一个PAC-贝叶斯界。我们展示了如何在转移学习场景中使用此界限执行模型选择。我们的经验结果证实，PAC-贝叶斯政策评估在提供信息时可以利用先前的分布，并且与标准贝叶斯RL方法不同，当它们具有误导性时，可以忽略它们。

著录项

来源
《Uncertainty in artificial intelligence》|2011年|p.195-202|共8页
会议地点 Barcelona(ES);Barcelona(ES)
作者
Mahdi Milani Fard; Joelle Pineau; Csaba Szepesvari;
展开▼
作者单位

School of Computer Science McGill University Montreal, Canada;

School of Computer Science McGill University Montreal, Canada;

Department of Computing Science University of Alberta Edmonton, Canada;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类人工智能理论;
关键词

相似文献

外文文献
中文文献
专利

1. Double Reinforcement Learning for Efficient Off-Policy Evaluation in Markov Decision Processes [J] . Nathan Kallus, Masatoshi Uehara Journal of machine learning research . 2020,第a期

机译：马尔可夫决策过程有效截止政策评估的双重加固学习
2. Distributed policy evaluation via inexact ADMM in multi-agent reinforcement learning [J] . Xiaoxiao Zhao, Peng Yi, Li Li 控制理论与应用（英文版） . 2020,第004期

机译：多智能经纪增强学习中的INExact ADMM分布式政策评估
3. A perspective on off-policy evaluation in reinforcement learning [J] . Li Lihong Frontiers of computer science in China . 2019,第5期

机译：强化学习中的非政策评估视角
4. PAC-Bayesian Policy Evaluation for Reinforcement Learning [C] . Mahdi Milani Fard, Joelle Pineau, Csaba Szepesvari Conference on Uncertainty in Artificial Intelligence . 2011

机译：Pac-Bayesian强化学习的政策评估
5. Off-Policy Evaluation of Reinforcement Learning in Healthcare [D] . Gottesman, Omer. 2020

机译：医疗保健强度学习的违规政策评估
6. Towards sentiment aided dialogue policy learning for multi-intent conversations using hierarchical reinforcement learning [O] . Tulika Saha, Sriparna Saha, Pushpak Bhattacharyya 2020

机译：利用等级强化学习的多意图对话的情感对话策略学习
7. Policy Evaluation and Seeking for Multi-Agent Reinforcement Learning via Best Response [O] . Rui Yan, Xiaoming Duan, Zongying Shi, 2021

机译：通过最佳回应寻求多智能经纪增强学习的政策评估

PAC-Bayesian Policy Evaluation for Reinforcement Learning

摘要

著录项

相似文献

相关主题

期刊订阅