首页> 外文会议> >Extend Single-agent Reinforcement Learning Approach to a Multi-robot Cooperative Task in an Unknown Dynamic Environment

【24h】

Extend Single-agent Reinforcement Learning Approach to a Multi-robot Cooperative Task in an Unknown Dynamic Environment

机译：将单主体强化学习方法扩展到未知动态环境中的多机器人协作任务

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Machine learning technology helps multi-robot systems to carry out desired tasks in an unknown dynamic environment. In this paper, we extend the single-agent Q-learning algorithm to a multi-robot box-pushing system in an unknown dynamic environment with random obstacle distribution. There are two kinds of extensions available: directly extending MDP (Markov Decision Process) based Q-learning to the multi-robot domain, and SG-based (Stochastic Game based) Q-learning. Here, we select the first kind of extension because of its simplicity. The learning space, the box dynamics, and the reward function etc. are presented in this paper. Furthermore, a simulation system is developed and its results show effectiveness, robustness and adaptivity of this learning-based multi-robot system. Our statistical analysis of the results also shows that the robots learned correct cooperative strategy even in a dynamic environment.

机译：机器学习技术可帮助多机器人系统在未知的动态环境中执行所需的任务。在本文中，我们将单智能体Q学习算法扩展到具有未知障碍物分布的未知动态环境中的多机器人盒推系统。可以使用两种扩展：将基于MDP（马尔可夫决策过程）的Q学习直接扩展到多机器人域，以及基于SG（基于随机游戏）的Q学习。在这里，由于其简单性，我们选择了第一种扩展。本文介绍了学习空间，盒子动力学和奖励函数等。此外，开发了一个仿真系统，其结果显示了这种基于学习的多机器人系统的有效性，鲁棒性和适应性。我们对结果的统计分析还表明，即使在动态环境中，机器人也能学会正确的协作策略。

著录项

来源
《》||P.4999-5005|共7页
会议地点
作者
Ying Wang; de Silva C.W.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类工业技术 ;
关键词

相似文献

外文文献
中文文献
专利

1. A HIERARCHICAL REINFORCEMENT LEARNING-BASED APPROACH TO MULTI-ROBOT COOPERATION FOR TARGET SEARCHING IN UNKNOWN ENVIRONMENTS [J] . Yifan Cai, Simon X. Yang, Xin Xu Control and Intelligent Systems . 2013 ,第4期

机译：未知环境下基于分层学习的多机器人合作目标搜索方法
2. Hybrid IWD-DE: A Novel Approach to Model Cooperative Navigation Planning for Multi-robot in Unknown Dynamic Environment [J] . Degal Chandrasekhar Rao, Manas Ranjan Kabat, Pradipta Kumar Das, 仿生工程学报（英文版） . 2019 ,第002期

机译：混合IWD-DE：未知动态环境下多机器人协同导航规划模型的新方法
3. An improved PSO-based approach with dynamic parameter tuning for cooperative multi-robot target searching in complex unknown environments [J] . Cai Y., Yang S.X. International Journal of Control . 2013 ,第10a12期

机译：改进的基于PSO的动态参数调整方法，用于复杂未知环境中的协作式多机器人目标搜索
4. A combined hierarchical reinforcement learning based approach for multi-robot cooperative target searching in complex unknown environments [C] . Cai Yifan, Yang Simon X., Xu Xin IEEE Symposium on Adaptive Dynamic Programming And Reinforcement Learning . 2013

机译：复杂未知环境下基于分层强化学习的组合多机器人协作目标搜索方法
5. Hierarchical reinforcement learning and social cognition in cooperative multi-robot foraging. [D] . Sun, Xueqing. 2011

机译：协作式多机器人觅食中的层次强化学习和社会认知。
6. Multi-Timescale Memory Dynamics Extend Task Repertoire in a Reinforcement Learning Network With Attention-Gated Memory [O] . Marco Martinolli, Wulfram Gerstner, Aditya Gilra 2018

机译：多时标记忆动力学在具有注意力门控记忆的强化学习网络中扩展任务库
7. A robust extended H∞ filtering approach to multi-robot cooperative localization in dynamic indoor environments [O] . Zhuang, Y, Wang, Z, Yu, H, 2013

机译：动态室内环境中多机器人协作定位的鲁棒扩展H∞滤波方法

Extend Single-agent Reinforcement Learning Approach to a Multi-robot Cooperative Task in an Unknown Dynamic Environment

摘要

著录项

相似文献

相关主题

期刊订阅