Generalization and Exploration via Randomized Value Functions

机译：随机价值函数的泛化与探索

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We propose randomized least-squares value iteration (RLSVI) - a new reinforcement learning algorithm designed to explore and generalize efficiently via linearly parameterized value functions. We explain why versions of least-squares value iteration that use Boltzmann or ∈-greedy exploration can be highly inefficient, and we present computational results that demonstrate dramatic efficiency gains enjoyed by RLSVI. Further, we establish an upper bound on the expected regret of RLSVI that demonstrates nearoptimality in a tabula rasa learning context. More broadly, our results suggest that randomized value functions offer a promising approach to tackling a critical challenge in reinforcement learning: synthesizing efficient exploration and effective generalization.

机译：我们提出随机的最小二乘值迭代（RLSVI） - 一种新的加强学习算法，旨在通过线性参数化值函数有效地探索和概括。我们解释为什么使用Boltzmann或贪婪探索的最小二乘价值迭代的版本可能是高效的，我们呈现了展示RLSVI享有的戏剧性效率的计算结果。此外，我们在RLSVI的预期遗憾中建立了一个上限，证明了塔杜拉RAS学习环境中的内容。更广泛地，我们的结果表明随机价值职能提供了一个有希望的方法来解决强化学习中的危急挑战：综合有效的探索和有效泛化。

著录项

来源
《International Conference on Machine Learning》|2016年|2978-3724p|共22页
会议地点
作者
Ian Osband; Benjamin Van Roy; Zheng Wen;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP181-53;
关键词

相似文献

外文文献
中文文献
专利

1. A generalization of Schur functions: Applications to Nevanlinna functions, orthogonal polynomials, random walks and unitary and open quantum walks [J] . Grunbaum F. A., Velazquez L. Advances in Mathematics . 2018,第期

机译：Schur函数的概念：对Nevanlinna功能，正交多项式，随机散步和酉和开放量子播放的应用
2. Deep Exploration via Randomized Value Functions [J] . Ian Osband, Benjamin Van Roy, Daniel J. Russo, Journal of machine learning research . 2019,第a期

机译：通过随机价值函数进行深度探索
3. Generalization Bounds and Complexities Based on Sparsity and Clustering for Convex Combinations of Functions from Random Classes [J] . Jaeger Savina Andonova Journal of machine learning research . 2005,第Mar期

机译：基于稀疏和聚类的随机类函数凸组合的广义界和复杂度
4. Generalization and Exploration via Randomized Value Functions [C] . Ian Osband, Benjamin Van Roy, Zheng Wen International Conference on Machine Learning . 2016

机译：随机价值函数的泛化与探索
5. Exponential Random Graphs and a Generalization of Parking Functions [D] . DeMuse, Ryan. 2021

机译：指数随机图和停车功能的概括
6. Multifractality of random eigenfunctions and generalization of Jarzynski equality [O] . I.M. Khaymovich, J.V. Koski, O.-P. Saira, -1

机译：随机本征函数的多重性和Jarzynski等式的推广
7. Generalization and Exploration via Randomized Value Functions [O] . Osband, Ian, Van Roy, Benjamin, Wen, Zheng 2016

机译：随机值函数的推广与探索

Generalization and Exploration via Randomized Value Functions

摘要

著录项

相似文献

相关主题

期刊订阅