Selecting Near-Optimal Approximate State Representations in Reinforcement Learning

机译：选择钢筋学习中的近乎最佳近似国家表示

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We consider a reinforcement learning setting introduced in [5] where the learner does not have explicit access to the states of the underlying Markov decision process (MDP). Instead, she has access to several models that map histories of past interactions to states. Here we improve over known regret bounds in this setting, and more importantly generalize to the case where the models given to the learner do not contain a true model resulting in an MDP representation but only approximations of it. We also give improved error bounds for state aggregation.

机译：我们考虑在[5]中介绍的加强学习设置，其中学习者没有明确访问底层马尔可夫决策过程（MDP）的状态。相反，她可以访问几个模型，即将过去互动的历史映射到州。在这里，我们在此设置中提高了已知的遗憾范围，更重要的是概括到给出的学习者的模型不包含真实模型，导致MDP表示但仅近似。我们还给出了状态聚合的改进错误界限。

著录项

来源
《International Conference on Algorithmic Learning Theory》|2014年||共15页
会议地点
作者
Ronald Ortner; Odalric-Ambrym Maillard; Daniil Ryabko;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP301.6-53;
关键词

相似文献

外文文献
中文文献
专利

1. Dynamic node selection in camera networks based on approximate reinforcement learning [J] . Li Qian, Sun Zhengxing, Chen Songle, Multimedia Tools and Applications . 2016,第24期

机译：基于近似强化学习的摄像机网络动态节点选择
2. Multi-agent reinforcement learning using ordinal action selection and approximate policy iteration [J] . Liu Daxue, Wu Jun, Xu Xin International Journal of Wavelets, Multiresolution and Information Processing . 2016,第6期

机译：使用有序动作选择和近似策略迭代的多主体强化学习
3. Autonomous Learning of State Representations for Control: An Emerging Field Aims to Autonomously Learn State Representations for Reinforcement Learning Agents from Their Real-World Sensor Observations [J] . Wendelin Bohmer, Jost Tobias Springenberg, Joschka Boedecker, Kunstliche Intelligenz . 2015,第4期

机译：控制状态表示的自主学习：一个新兴领域旨在从现实世界的传感器观察中自主学习强化学习代理的状态表示
4. Selecting Near-Optimal Approximate State Representations in Reinforcement Learning [C] . Ronald Ortner, Odalric-Ambrym Maillard, Daniil Ryabko International conference on algorithmic learning theory . 2014

机译：在强化学习中选择近似最优的近似状态表示
5. Data-Based Reinforcement Learning: Approximate Optimal Control for Uncertain Nonlinear Systems [D] . ?Deptu?a, Patryk 2019

机译：基于数据的强化学习：不确定非线性系统的近似最优控制
6. Multi-agent reinforcement learning with approximate model learning for competitive games [O] . Young Joon Park, Yoon Sang Cho, Seoung Bum Kim 2012

机译：多主体强化学习和近似模型学习的竞技游戏
7. Selecting Near-Optimal Approximate State Representations in Reinforcement Learning [O] . Ronald Ortner, Odalric-ambrym Maillard, Daniil Ryabko 2016

机译：在强化学习中选择近似最佳近似状态表示
8. Achieving Near-Optimal Sensor Allocation Policies Through Reinforcement Learning [R] . Malhotra, P. 1996

机译：通过强化学习实现近乎最佳的传感器分配策略

Selecting Near-Optimal Approximate State Representations in Reinforcement Learning

摘要

著录项

相似文献

相关主题

期刊订阅