首页> 外文期刊>Mechatronics, IEEE/ASME Transactions on >Incremental Reinforcement Learning With Prioritized Sweeping for Dynamic Environments
【24h】

Incremental Reinforcement Learning With Prioritized Sweeping for Dynamic Environments

机译:动态环境中优先扫描的增量强化学习

获取原文
获取原文并翻译 | 示例
           

摘要

In this paper, a novel incremental learning algorithm is presented for reinforcement learning (RL) in dynamic environments, where the rewards of state-action pairs may change over time. The proposed incremental RL (IRL) algorithm learns from the dynamic environments without making any assumptions or having any prior knowledge about the ever-changing environment. First, IRL generates a detector-agent to detect the changed part of the environment (drift environment) by executing a virtual RL process. Then, the agent gives priority to the drift environment and its neighbor environment for iteratively updating their state-action value functions using new rewards by dynamic programming. After the prioritized sweeping process, IRL restarts a canonical learning process to obtain a new optimal policy adapting to the new environment. The novelty is that IRL fuses the new information into the existing knowledge system incrementally as well as weakening the conflict between them. The IRL algorithm is compared to two direct approaches and various state-of-the-art transfer learning methods for classical maze navigation problems and an intelligent warehouse with multiple robots. The experimental results verify that IRL can effectively improve the adaptability and efficiency of RL algorithms in dynamic environments.
机译:在本文中,提出了一种新颖的增量学习算法,用于动态环境中的状态学习对的奖励可能随时间变化的强化学习(RL)。所提出的增量RL(IRL)算法可从动态环境中学习,而无需进行任何假设或具有有关不断变化的环境的任何先验知识。首先,IRL生成检测器代理程序,以通过执行虚拟RL流程来检测环境的变化部分(漂移环境)。然后,代理优先考虑漂移环境及其邻近环境,以便通过动态编程使用新的奖励来迭代地更新其状态作用值函数。在确定优先级的清除过程之后,IRL重新启动规范的学习过程,以获得适应新环境的新的最佳策略。新颖之处在于,IRL将新信息逐渐融合到现有的知识系统中,并削弱了它们之间的冲突。将IRL算法与两种直接方法以及针对经典迷宫导航问题和具有多个机器人的智能仓库的各种最新转移学习方法进行了比较。实验结果证明,IRL可以有效提高动态环境中RL算法的适应性和效率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号