Composition of Feature Space and State Space Dynamics Models for Model-based Reinforcement Learning

Akihiko YAMAGUCHI; Jun TAKAMATSU; Tsukasa OGASAWARA

首页> 外文期刊>電子情報通信学会技術研究報告. 非線形問題. Nonlinear Problems >Composition of Feature Space and State Space Dynamics Models for Model-based Reinforcement Learning

【24h】

Composition of Feature Space and State Space Dynamics Models for Model-based Reinforcement Learning

机译：基于模型的强化学习的特征空间和状态空间动力学模型的组成

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Learning a dynamics model and a reward model during reinforcement learning is a useful way, since the agent can also update its value function by using the models. In this paper, we propose a general dynamics model that is a composition of the feature space dynamics model and the state space dynamics model. This way enables to obtain a good generalization from a small number of samples because of the linearity of the state space dynamics, while it does not lose the accuracy. We demonstrate the simulation comparison of some dynamics models used together with a Dyna algorithm.

机译：在强化学习期间学习动力学模型和奖励模型是一种有用的方法，因为代理也可以通过使用模型来更新其价值函数。在本文中，我们提出了一个通用动力学模型，该模型由特征空间动力学模型和状态空间动力学模型组成。由于状态空间动力学的线性，这种方式能够从少量样本中获得良好的概括性，同时又不会失去准确性。我们演示了与Dyna算法一起使用的一些动力学模型的仿真比较。

著录项

来源
《電子情報通信学会技術研究報告. 非線形問題. Nonlinear Problems》 |2009年第124期|共6页
作者
Akihiko YAMAGUCHI; Jun TAKAMATSU; Tsukasa OGASAWARA;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类通信;
关键词
Model-based reinforcement learning; Dyna-style planning; Prioritized sweeping; Dynamics model;

机译：基于模型的强化学习;Dyna风格的计划;优先清扫;动态模型;
入库时间 2022-08-19 06:29:18

相似文献

外文文献
中文文献
专利

1. Composition of Feature Space and State Space Dynamics Models for Model-based Reinforcement Learning [J] . Akihiko YAMAGUCHI, Jun TAKAMATSU, Tsukasa OGASAWARA 電子情報通信学会技術研究報告. 非線形問題. Nonlinear Problems . 2009,第124期

机译：基于模型的强化学习的特征空间和状态空间动力学模型的组成
2. Composition of Feature Space and State Space Dynamics Models for Model-based Reinforcement Learning [J] . Akihiko YAMAGUCHI, Jim TAKAMATSU, Tsukasa OGASAWARA 電子情報通信学会技術研究報告. ニュ-ロコンピュ-ティング. Neurocomputing . 2009,第125期

机译：基于模型的强化学习的特征空间和状态空间动力学模型的组成
3. Composition of Feature Space and State Space Dynamics Models for Model-based Reinforcement Learning [J] . Akihiko YAMAGUCHI, Jun TAKAMATSU, Tsukasa OGASAWARA 電子情報通信学会技術研究報告 . 2009,第124期

机译：基于模型的强化学习的特征空间和状态空间动力学模型的组成
4. Graphical Model-Based Learning in High Dimensional Feature Spaces [C] . Zhao Song, Yuke Zhu AAAI conference on artificial intelligence;Innovative applications of artificial intelligence conference;Symposium on educational advances in artificial intelligence . 2013

机译：高维特征空间中基于图形模型的学习
5. Understanding Model-Based Reinforcement Learning and its Application in Safe Reinforcement Learning [D] . Hu, Dingcheng . 2019

机译：了解基于模型的强化学习及其在安全强化学习中的应用
6. Data-Driven Living Spaces’ Heating Dynamics Modeling in Smart Buildings using Machine Learning-Based Identification [O] . Roozbeh Sadeghian Broujeny, Kurosh Madani, Abdennasser Chebira, 2020

机译：基于机器学习的数据驱动的智能建筑中居住空间的供热动力学建模
7. Task complexity interacts with state-space uncertainty in the arbitration process between model-based and model-free reinforcement-learning at both behavioral and neural levels [O] . Dongjae Kim, Geon Yeong Park, John P. O’Doherty, 2018

机译：任务复杂性与在行为和神经水平的模型和无模型加强学习之间的仲裁过程中的状态空间不确定性相互作用

Composition of Feature Space and State Space Dynamics Models for Model-based Reinforcement Learning

摘要

著录项

相似文献

相关主题

期刊订阅