首页> 外文期刊>Artificial life and robotics >Reinforcement learning for dynamic environment: a classification of dynamic environments and a detection method of environmental changes
【24h】

Reinforcement learning for dynamic environment: a classification of dynamic environments and a detection method of environmental changes

机译:动态环境的强化学习:动态环境的分类和环境变化的检测方法

获取原文
获取原文并翻译 | 示例
       

摘要

Engineers and researchers are paying more attention to reinforcement learning (RL) as a key technique for realizing computational intelligence such as adaptive and autonomous decentralized systems. In general, it is not easy to put RL into practical use. In prior research our approach mainly dealt with the problem of designing state and action spaces and we have proposed an adaptive co-construction method of state and action spaces. However, it is more difficult to design state and action spaces in dynamic environments than in static ones. Therefore, it is even more effective to use an adaptive co-construction method of state and action spaces in dynamic environments. In this paper, our approach mainly deals with a problem of adaptation in dynamic environments. First, we classify tasks of dynamic environments and propose a detection method of environmental changes to adapt to dynamic environments. Next, we conducted computational experiments using a so-called "path planning problem" with a slowly changing environment where the aging of the system is assumed. The performances of a conventional RL method and the proposed detection method were confirmed.
机译:工程师和研究人员越来越重视强化学习(RL),它是实现计算智能的一项关键技术,例如自适应和自治分散系统。通常,将RL投入实际使用并不容易。在先前的研究中,我们的方法主要处理设计状态和动作空间的问题,并且我们提出了一种状态和动作空间的自适应共构建方法。但是,在动态环境中设计状态和动作空间比在静态环境中设计状态和动作空间要困难得多。因此,在动态环境中使用状态空间和动作空间的自适应共构建方法更为有效。在本文中,我们的方法主要处理动态环境中的适应性问题。首先,我们对动态环境的任务进行分类,并提出一种检测环境变化以适应动态环境的方法。接下来,我们使用所谓的“路径规划问题”进行了计算实验,该环境在缓慢变化的环境中被假定为系统老化。确认了常规RL方法和建议的检测方法的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号