...
首页> 外文期刊>Mathematical Problems in Engineering >Multiagent Reinforcement Learning with Regret Matching for Robot Soccer
【24h】

Multiagent Reinforcement Learning with Regret Matching for Robot Soccer

机译:机器人足球遗憾匹配的多主体强化学习

获取原文
获取原文并翻译 | 示例

摘要

This paper proposes a novel multiagent reinforcement learning (MARL) algorithm Nash-Q learning with regret matching, in which regret matching is used to speed up the well-known MARL algorithm Nash-Q learning. It is critical that choosing a suitable strategy for action selection to harmonize the relation between exploration and exploitation to enhance the ability of online learning for Nash-Q learning. In Markov Game the joint action of agents adopting regret matching algorithm can converge to a group of points of no-regret that can be viewed as coarse correlated equilibrium which includes Nash equilibrium in essence. It is can be inferred that regret matching can guide exploration of the state-action space so that the rate of convergence of Nash-Q learning algorithm can be increased. Simulation results on robot soccer validate that compared to original Nash-Q learning algorithm, the use of regret matching during the learning phase of Nash-Q learning has excellent ability of online learning and results in significant performance in terms of scores, average reward and policy convergence.
机译:提出了一种具有后悔匹配的新型多智能体强化学习(MARL)算法Nash-Q学习,其中后悔匹配用于加速著名的MARL算法Nash-Q学习。为行动选择选择合适的策略,以协调探索与开发之间的关系,以增强Nash-Q学习的在线学习能力至关重要。在马尔可夫博弈中,采用后悔匹配算法的主体的联合行动可以收敛到无悔点上,这可以看作是粗略的相关均衡,本质上包括纳什均衡。可以推断,后悔匹配可以指导状态动作空间的探索,从而可以提高Nash-Q学习算法的收敛速度。机器人足球的仿真结果证明,与原始的Nash-Q学习算法相比,在Nash-Q学习的学习阶段使用后悔匹配具有出色的在线学习能力,并且在得分,平均奖励和策略方面均具有显着表现收敛。

著录项

  • 来源
    《Mathematical Problems in Engineering》 |2013年第9期|926267.1-926267.8|共8页
  • 作者

    Qiang Liu; Jiachen Ma; Wei Xie;

  • 作者单位

    School of Astronautics, Harbin Institute of Technology, Harbin 150001, China,School of Information and Electrical Engineering, Harbin Institute of Technology (Weihai), Weihai 264209, China;

    School of Astronautics, Harbin Institute of Technology, Harbin 150001, China,School of Information and Electrical Engineering, Harbin Institute of Technology (Weihai), Weihai 264209, China;

    School of Astronautics, Harbin Institute of Technology, Harbin 150001, China,School of Information and Electrical Engineering, Harbin Institute of Technology (Weihai), Weihai 264209, China;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号