Increasingly Cautious Optimism for Practical PAC-MDP Exploration

机译：对实际PAC-MDP勘探越来越谨慎乐观

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Exploration strategy is an essential part of learning agents in model-based Reinforcement Learning. R-MAX and V-MAX are PAC-MDP strategies proved to have polynomial sample complexity; yet, their exploration behavior tend to be overly cautious in practice. We propose the principle of Increasingly Cautious Optimism (ICO) to automatically cut off unnecessarily cautious exploration, and apply ICO to R-MAX and V-MAX, yielding two new strategies, namely Increasingly Cautious R-MAX (ICR) and Increasingly Cautious V-MAX (ICV). We prove that both ICR and ICV are PAC-MDP, and show that their improvement is guaranteed by a tighter sample complexity upper bound. Then, we demonstrate their significantly improved performance through empirical results.

机译：勘探战略是基于模型的强化学习中学习代理的重要组成部分。 R-Max和V-Max是PAC-MDP策略，证明具有多项式样本复杂性;然而，他们的勘探行为在实践中往往是谨慎的。我们提出了越来越谨慎乐观的原则（ICO），自动切断不必要的谨慎探索，并将ICO应用于R-Max和V-Max，产生两种新策略，即越来越谨慎的R-Max（ICR），越来越谨慎最大（ICV）。我们证明ICR和ICV都是PAC-MDP，并表明它们的改进是通过更严格的样本复杂性上限保证的。然后，我们通过经验结果展示了它们的显着提高的性能。

著录项

来源
《International Joint Conference on Artificial Intelligence》|2015年|3308-4132p|共8页
会议地点
作者
Liangpeng Zhang; Ke Tang; Xin Yao;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词
入库时间 2022-08-20 20:06:01

相似文献

外文文献
中文文献
专利

1. Cell therapy in kidney disease: cautious optimism... but optimism nonetheless. [J] . Zenovich AG, Taylor DA Peritoneal dialysis international: Journal of the International Society for Peritoneal Dialysis . 2007,第Suppla2期

机译：肾脏疾病中的细胞疗法：谨慎乐观……但仍然要乐观。
2. Cautious Optimism [J] . RICH BINSACCA Construction equipment . 2021,第1期

机译：谨慎乐观
3. Contractors watch international arms sale pause with cautious optimism [J] . Marjorie Censer Inside the navy . 2021,第6期

机译：承包商用谨慎乐观表观看国际武器销售暂停
4. Increasingly Cautious Optimism for Practical PAC-MDP Exploration [C] . Liangpeng Zhang, Ke Tang, Xin Yao International Joint Conference on Artificial Intelligence . 2015

机译：对实际PAC-MDP勘探越来越谨慎乐观
5. Transformative Space: An Exploration of Strength and Optimism in a Healing Center. [D] . Conover, Cashin. 2016

机译：变革性空间：康复中心的力量与乐观探索。
6. Cautious optimism—the current role of immunotherapy in gastrointestinal cancers [O] . S. Mendis, S. Gill 2020

机译：谨慎乐观-免疫疗法在胃肠道癌症中的当前作用
7. Cautious Optimism—The Current Role of Immunotherapy in Gastrointestinal Cancers [O] . S. Mendis, S. Gill 2020

机译：谨慎乐观 - 免疫疗法在胃肠癌中的目前作用

Increasingly Cautious Optimism for Practical PAC-MDP Exploration

摘要

著录项

相似文献

相关主题

期刊订阅