A Geometric Perspective on Optimal Representations for Reinforcement Learning

机译：关于加固学习最优表示的几何视角

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We propose a new perspective on representation learning in reinforcement learning based on geometric properties of the space of value functions. We leverage this perspective to provide formal evidence regarding the usefulness of value functions as auxiliary tasks. Our formulation considers adapting the representation to minimize the (linear) approximation of the value function of all stationary policies for a given environment. We show that this optimization reduces to making accurate predictions regarding a special class of value functions which we call adversarial value functions (AVFs). We demonstrate that using value functions as auxiliary tasks corresponds to an expected-error relaxation of our formulation, with AVFs a natural candidate, and identify a close relationship with proto-value functions (Mahadevan, 2005). We highlight characteristics of AVFs and their usefulness as auxiliary tasks in a series of experiments on the four-room domain.

机译：我们提出了一种基于价值函数空间的几何特性的加固学习的代表学习的新视角。我们利用此视角来为有关辅助任务的有用函数的有用性提供正式的证据。我们的配方考虑适应表示，以最小化给定环境所有静止策略的值函数的（线性）近似。我们表明，这种优化减少了关于我们调用逆势价值函数（AVFS）的特殊价值函数的准确预测。我们证明，使用价值函数作为辅助任务对应于我们的配方的预期误差放松，以及AVFS一种自然候选，并识别与原型函数的密切关系（MahadeVan，2005）。我们在四室域的一系列实验中突出了AVFS及其用途作为辅助任务的特点。

著录项

来源
《Conference on Neural Information Processing Systems》|2020年|p3969-4770|共12页
会议地点
作者
Marc G. Bellemare; Will Dabney; Robert Dadashi; Adrien Ali Taiga; Pablo Samuel Castro; Nicolas Le Roux; Dale Schuurmans; Tor Lattimore; Clare Lyle;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计量学;
关键词
入库时间 2022-08-21 10:47:57

相似文献

外文文献
中文文献
专利

1. Autonomous Learning of State Representations for Control: An Emerging Field Aims to Autonomously Learn State Representations for Reinforcement Learning Agents from Their Real-World Sensor Observations [J] . Wendelin Bohmer, Jost Tobias Springenberg, Joschka Boedecker, Kunstliche Intelligenz . 2015,第4期

机译：控制状态表示的自主学习：一个新兴领域旨在从现实世界的传感器观察中自主学习强化学习代理的状态表示
2. Hierarchical Optimal Synchronization for Linear Systems via Reinforcement Learning: A Stackelberg–Nash Game Perspective [J] . Li Man, Qin Jiahu, Ma Qichao, Neural Networks and Learning Systems, IEEE Transactions on . 2021,第4期

机译：通过加固学习的线性系统的分层最优同步：Stackelberg-Nash游戏视角
3. Reinforcement Learning Toolbox: Reinforcement Learning for Optimal Control Tasks Institute for Theoretical Computer Science TU-GRAZ [J] . Gerhard Neumann OGAI Journal . 2007,第3期

机译：强化学习工具箱：针对最优控制任务的强化学习理论计算机科学研究院TU-GRAZ
4. A Geometric Perspective on Optimal Representations for Reinforcement Learning [C] . Marc G. Bellemare, Will Dabney, Robert Dadashi, Conference on Neural Information Processing Systems . 2020

机译：关于加固学习最优表示的几何视角
5. Scaling up reinforcement learning without sacrificing optimality by constraining exploration. [D] . Mann, Timothy Arthur. 2012

机译：通过限制探索，在不牺牲最优性的情况下扩大强化学习。
6. The Outcome-Representation Learning model: a novel reinforcement learning model of the Iowa Gambling Task [O] . Nathaniel Haines, Jasmin Vassileva, Woo-Young Ahn -1

机译：结果表征学习模型：爱荷华州赌博任务的新型强化学习模型
7. Selecting Near-Optimal Approximate State Representations in Reinforcement Learning [O] . Ronald Ortner, Odalric-ambrym Maillard, Daniil Ryabko 2016

机译：在强化学习中选择近似最佳近似状态表示

A Geometric Perspective on Optimal Representations for Reinforcement Learning

摘要

著录项

相似文献

相关主题

期刊订阅