A novel modular Q-learning architecture to improve performance under incomplete learning in a grid soccer game

Sahar Araghi; Abbas Khosravi; Michael Johnstone; Douglas Creighton

首页> 外文期刊>Engineering Applications of Artificial Intelligence >A novel modular Q-learning architecture to improve performance under incomplete learning in a grid soccer game

【24h】

A novel modular Q-learning architecture to improve performance under incomplete learning in a grid soccer game

机译：一种新颖的模块化Q学习架构，可在网格足球游戏的不完全学习情况下提高性能

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Multi-agent reinforcement learning methods suffer from several deficiencies that are rooted in the large state space of multi-agent environments. This paper tackles two deficiencies of multi-agent reinforcement learning methods: their slow learning rate, and low quality decision-making in early stages of learning. The proposed methods are applied in a grid-world soccer game. In the proposed approach, modular reinforcement learning is applied to reduce the state space of the learning agents from exponential to linear in terms of the number of agents. The modular model proposed here includes two new modules, a partial-module and a single-module. These two new modules are effective for increasing the speed of learning in a soccer game. We also apply the instance-based learning concepts, to choose proper actions in states that are not experienced adequately during learning. The key idea is to use neighbouring states that have been explored sufficiently during the learning phase. The results of experiments in a grid-soccer game environment show that our proposed methods produce a higher average reward compared to the situation where the proposed method is not applied to the modular structure.

机译：多主体强化学习方法存在许多缺陷，这些缺陷源于多主体环境的大型状态空间。本文解决了多主体强化学习方法的两个缺陷：学习速度慢和学习初期的低质量决策。所提出的方法被应用于网格世界足球比赛中。在提出的方法中，应用模块化强化学习以将学习代理的状态空间从代理数量方面从指数减小为线性。这里提出的模块化模型包括两个新模块，一个局部模块和一个单一模块。这两个新模块可有效提高足球比赛中的学习速度。我们还应用基于实例的学习概念，以选择在学习过程中未充分体验的状态下的适当动作。关键思想是使用在学习阶段已充分探究的相邻状态。在网格足球游戏环境中的实验结果表明，与未将其应用到模块化结构的情况相比，我们的方法产生了更高的平均奖励。

著录项

来源
《Engineering Applications of Artificial Intelligence》 |2013年第9期|2164-2171|共8页
作者
Sahar Araghi; Abbas Khosravi; Michael Johnstone; Douglas Creighton;
展开▼
作者单位

Centre for Intelligent Systems Research (CISR), Deakin University, Victoria 3217, Australia;

Centre for Intelligent Systems Research (CISR), Deakin University, Victoria 3217, Australia;

Centre for Intelligent Systems Research (CISR), Deakin University, Victoria 3217, Australia;

Centre for Intelligent Systems Research (CISR), Deakin University, Victoria 3217, Australia;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Multi-agent systems; Machine learning; Modular reinforcement learning; Q-learning;

机译：多代理系统;机器学习;模块化强化学习;Q学习;

相似文献

外文文献
中文文献
专利

1. Modular Q-learning based multi-agent cooperation for robot soccer [J] . Kui-Hong Park, Yong-Jae Kim, Jong-Hwan Kim Robotics and Autonomous Systems . 2001,第2期

机译：基于模块化Q学习的足球机器人多主体协作
2. Deep transfer Q-learning with virtual leader-follower for supply-demand Stackelberg game of smart grid [J] . Zhang Xiaoshun, Bao Tao, Yu Tao, Energy . 2017,第auga15期

机译：使用虚拟领导者进行深度转移Q学习，用于智能电网的供需Stackelberg游戏
3. Multi-agent reinforcement learning using modular neural network Q-learning algorithms [J] . YANG Yin-xian, FANG Kai 重庆大学学报（英文版） . 2005,第001期

机译：使用模块化神经网络Q学习算法的多主体强化学习
4. Acquiring the positioning skill in a soccer game using a fuzzy Q-learning [C] . Nakashima, T., Udo, . 2003

机译：使用模糊Q学习获得足球比赛中的定位技能
5. Improving computer game bots' behavior using Q-learning [D] . Patel, Purvag 2009

机译：使用Q学习改善计算机游戏机器人的行为
6. The validity of small-sided games in predicting 11-vs-11 soccer game performance [O] . Tom L. G. Bergkamp, Ruud J. R. den Hartigh, Wouter G. P. Frencken, 2020

机译：小型游戏的有效性在预测11 vs-11足球比赛表现中
7. A Modular Q-Learning Architecture for Manipulator Task Decomposition [O] . Chen K. Tham 1994

机译：用于机械手任务分解的模块化Q学习架构

A novel modular Q-learning architecture to improve performance under incomplete learning in a grid soccer game

摘要

著录项

相似文献

相关主题

期刊订阅