...
首页> 外文期刊>Engineering Applications of Artificial Intelligence >A novel modular Q-learning architecture to improve performance under incomplete learning in a grid soccer game
【24h】

A novel modular Q-learning architecture to improve performance under incomplete learning in a grid soccer game

机译:一种新颖的模块化Q学习架构,可在网格足球游戏的不完全学习情况下提高性能

获取原文
获取原文并翻译 | 示例
           

摘要

Multi-agent reinforcement learning methods suffer from several deficiencies that are rooted in the large state space of multi-agent environments. This paper tackles two deficiencies of multi-agent reinforcement learning methods: their slow learning rate, and low quality decision-making in early stages of learning. The proposed methods are applied in a grid-world soccer game. In the proposed approach, modular reinforcement learning is applied to reduce the state space of the learning agents from exponential to linear in terms of the number of agents. The modular model proposed here includes two new modules, a partial-module and a single-module. These two new modules are effective for increasing the speed of learning in a soccer game. We also apply the instance-based learning concepts, to choose proper actions in states that are not experienced adequately during learning. The key idea is to use neighbouring states that have been explored sufficiently during the learning phase. The results of experiments in a grid-soccer game environment show that our proposed methods produce a higher average reward compared to the situation where the proposed method is not applied to the modular structure.
机译:多主体强化学习方法存在许多缺陷,这些缺陷源于多主体环境的大型状态空间。本文解决了多主体强化学习方法的两个缺陷:学习速度慢和学习初期的低质量决策。所提出的方法被应用于网格世界足球比赛中。在提出的方法中,应用模块化强化学习以将学习代理的状态空间从代理数量方面从指数减小为线性。这里提出的模块化模型包括两个新模块,一个局部模块和一个单一模块。这两个新模块可有效提高足球比赛中的学习速度。我们还应用基于实例的学习概念,以选择在学习过程中未充分体验的状态下的适当动作。关键思想是使用在学习阶段已充分探究的相邻状态。在网格足球游戏环境中的实验结果表明,与未将其应用到模块化结构的情况相比,我们的方法产生了更高的平均奖励。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号