Bayesian Reinforcement Learning with Exploration

机译：贝叶斯加固与勘探学习

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We consider a general reinforcement learning problem and show that carefully combining the Bayesian optimal policy and an exploring policy leads to minimax sample-complexity bounds in a very general class of (history-based) environments. We also prove lower bounds and show that the new algorithm displays adaptive behaviour when the environment is easier than worst-case.

机译：我们考虑了一般的加强学习问题，并谨慎地结合贝叶斯最优政策，并探索策略导致最普遍的（历史为基础的）环境中的最低限度样本复杂性界限。我们还证明了下限，并显示新算法在环境比最坏情况更容易时显示自适应行为。

著录项

来源
《International Conference on Algorithmic Learning Theory》|2014年||共15页
会议地点
作者
Tor Lattimore; Marcus Hutter;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP301.6-53;
关键词

相似文献

外文文献
中文文献
专利

1. Bayesian Reinforcement Learning and Bayesian Deep Learning for Blockchains With Mobile Edge Computing [J] . Asheralieva Alia, Niyato Dusit IEEE Transactions on Cognitive Communications and Networking . 2021,第1期

机译：贝叶斯加固学习和贝叶斯深入学习，具有移动边缘计算的区块
2. Rapidly Learning Bayesian Networks for Complex System Diagnosis: A Reinforcement Learning Directed Greedy Search Approach [J] . Zhang Wenfeng, Feng Wenquan, Zhao Hongbo, Quality Control, Transactions . 2020,第期

机译：快速学习复杂系统诊断的贝叶斯网络：加强学习执导贪婪搜索方法
3. Combination of learning from non-optimal demonstrations and feedbacks using inverse reinforcement learning and Bayesian policy improvement [J] . Ali Ezzeddine, Nafee Mourad, Babak Nadjar Araabi, Expert Systems with Application . 2018,第DECa期

机译：通过逆向强化学习和贝叶斯政策改进，结合非最佳演示和反馈中的学习
4. BSE-MAML: Model Agnostic Meta-Reinforcement Learning via Bayesian Structured Exploration [C] . Haonan Wang, Yiyun Zhang, Dawei Feng, IEEE International Conference on Services Computing . 2020

机译：BSE-MAML：贝叶斯结构勘探模型无可止境的荟萃强化学习
5. Bayesian Nonparametric Reinforcement Learning in LTE and Wi-Fi Coexistence [D] . Shih, Po-Kan. 2021

机译：LTE和Wi-Fi共存的贝叶斯非参数增强学习
6. An exploration strategy improves the diversity of de novo ligands using deep reinforcement learning: a case for the adenosine A2A receptor [O] . Xuhan Liu, Kai Ye, Herman W. T. van Vlijmen, 2019

机译：探索策略通过深度强化学习来改善从头配体的多样性：腺苷A2A受体的情况
7. Learning Transferable Domain Priors for Safe Exploration in Reinforcement Learning [O] . Thommen George Karimpanal, Santu Rana, Sunil Gupta, 2020

机译：学习可转让的域名前脚，以便在加固学习中安全探索

Bayesian Reinforcement Learning with Exploration

摘要

著录项

相似文献

相关主题

期刊订阅