Developing reinforcement learning for adaptive co-construction of continuous high-dimensional state and action spaces

Masato Nagayoshi; Hajime Murao; Hisashi Tamaki

首页> 外文期刊>Artificial life and robotics >Developing reinforcement learning for adaptive co-construction of continuous high-dimensional state and action spaces

【24h】

Developing reinforcement learning for adaptive co-construction of continuous high-dimensional state and action spaces

机译：开发增强学习以适应连续高维状态和动作空间的自适应共建

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Engineers and researchers are paying more attention to reinforcement learning (RL) as a key technique for realizing adaptive and autonomous decentralized systems. In general, however, it is not easy to put RL into practical use. Our approach mainly deals with the problem of designing state and action spaces. Previously, an adaptive state space construction method which is called a "state space filter" and an adaptive action space construction method which is called "switching RL", have been proposed after the other space has been fixed. Then, we have reconstituted these two construction methods as one method by treating the former method and the latter method as a combined method for mimicking an infant's perceptual and motor developments and we have proposed a method which is based on introducing and referring to "entropy". In this paper, a computational experiment was conducted using a so-called "robot navigation problem" with three-dimensional continuous state space and two-dimensional continuous action space which is more complicated than a so-called "path planning problem". As a result, the validity of the proposed method has been confirmed.

机译：工程师和研究人员越来越重视强化学习（RL），这是实现自适应和自治分散系统的关键技术。但是，一般而言，将RL投入实际使用并不容易。我们的方法主要处理设计状态和动作空间的问题。先前，已经提出了在固定了另一空间之后的被称为“状态空间滤波器”的自适应状态空间构造方法和被称为“切换RL”的自适应动作空间构造方法。然后，我们通过将前一种方法和后一种方法作为模仿婴儿的知觉和运动发育的组合方法，将这两种构建方法重构为一种方法，并提出了一种基于引入和引用“熵”的方法。在本文中，使用具有三维连续状态空间和二维连续作用空间的所谓“机器人导航问题”进行了计算实验，该问题比所谓的“路径规划问题”更为复杂。结果，已经证实了所提出的方法的有效性。

著录项

来源
《Artificial life and robotics》 |2012年第2期|204-210|共7页
作者
Masato Nagayoshi; Hajime Murao; Hisashi Tamaki;
展开▼
作者单位

Niigata College of Nursing, 240, Shinnan, Joetsu 943-0147, Japan;

Faculty of Cross-Cultural Studies, Kobe University, 1-2-1, Tsurukabuto, Nada-ku, Kobe 657-8501, Japan;

Graduate School of Engineering, Kobe University, Rokko-dai, Nada-ku, Kobe 657-8501, Japan;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
reinforcement learning (RL); adaptive space co-construction; state space design; action space design; entropy;

机译：强化学习（RL）;自适应空间共建状态空间设计;动作空间设计;熵;

相似文献

外文文献
中文文献
专利

1. Adaptive co-construction of state and action spaces in reinforcement learning [J] . Masato Nagayoshi, Hajime Murao, Hisashi Tamaki Artificial life and robotics . 2011,第1期

机译：强化学习中状态和动作空间的自适应共建
2. Reinforcement Learning in High-dimensional Continuous State Spaces - A State Space Compression Method Based on Multivariate Analysis - [J] . Hideki Satoh, 佐藤仁樹電子情報通信学会技術研究報告. 非線形問題. Nonlinear Problems . 2006,第547期

机译：高维连续状态空间的强化学习-一种基于多元分析的状态空间压缩方法-
3. A State Space Compression Method Based on Multivariate Analysis for Reinforcement Learning in High-Dimensional Continuous State Spaces [J] . Hideki SATOH IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences . 2006,第8期

机译：基于多元分析的状态空间压缩方法在高维连续状态空间中的强化学习
4. An Entropy-Guided Adaptive Co-construction Method of State and Action Spaces in Reinforcement Learning [C] . Masato Nagayoshi, Hajime Murao, Hisashi Tamaki International conference on neural information processing . 2014

机译：强化学习中状态和动作空间的熵指导自适应共建方法
5. Autonomous mental development in high-dimensional and continuous state and action spaces and its application in autonomous learning of speech. [D] . Joshi, Ameet Vijay. 2003

机译：高维，连续状态和动作空间中的自主思维发展及其在语音自主学习中的应用。
6. Scaled free-energy based reinforcement learning for robust and efficient learning in high-dimensional state spaces [O] . Stefan Elfwing, Eiji Uchibe, Kenji Doya 2013

机译：基于缩放自由能的增强学习可在高维状态空间中进行健壮和高效的学习
7. A State Space Compression Method Based on Multivariate Analysis for Reinforcement Learning in High-dimensional Continuous State Spaces [O] . 2006

机译：基于多元分析的状态空间压缩方法在高维连续状态空间中的强化学习
8. Reinforcement Learning With High-Dimensional, Continuous Actions. [R] . Baird, L. C., Klopf, A. H. 1993

机译：通过高维度，持续行动强化学习。

Developing reinforcement learning for adaptive co-construction of continuous high-dimensional state and action spaces

摘要

著录项

相似文献

相关主题

期刊订阅