首页> 外文会议>IEEE Symposium Series on Computational Intelligence >Deep Reinforcement Learning Based Intelligent Decision Making for Two-player Sequential Game with Uncertain Irrational Player
【24h】

Deep Reinforcement Learning Based Intelligent Decision Making for Two-player Sequential Game with Uncertain Irrational Player

机译:基于深度加强学习智能决策与不确定的非理性球员的双人顺序游戏

获取原文

摘要

In this paper, two player sequential game with an unknown non-stationary irrational player is investigated for cooperative autonomous robots decision making applications. In practice, the irrationality of agent can seriously degrade the effectiveness of decision making especially for distributed cooperative tasks with applications to multi-robot systems. Specifically, The irrationality can be caused by the cooperation agent’s mechanical failure or sensor flaw. To handle this issue, a novel dynamic evaluation system, which includes two important parameters, i.e. cooperation index and competitive flag, is designed to efficiently quantify the player’s level of cooperation or competition firstly. Then, the continuous deep Q network space is proposed to predict the action value with respect to a continuous cooperation index. Inspired from the framework of "Friend or Foe" algorithm, a novel hybrid online multi-agent deep reinforcement learning algorithm is proposed. The designed algorithm can evaluate the cooperator’s cooperative level as well as maximize the total payoff by learning in a continuous deep Q network space. Eventually, numerical simulation and experimental tests are provided to demonstrate the effectiveness of the designed algorithm.
机译:在本文中,对具有未知非稳定性非理性玩家的两个玩家顺序游戏进行了研究,用于合作自治机器人决策应用。在实践中,代理商的非理性可以严重降低决策的有效性,特别是对于具有多机器人系统的分布式协作任务的决策的有效性。具体地,非理性可能是由合作代理机械故障或传感器缺陷引起的。为了处理这个问题,一种新的动态评估系统,包括两个重要参数,即合作指数和竞争旗,旨在有效地量化玩家的合作水平或首先竞争。然后,提出了连续的深Q网络空间来预测相对于连续合作指数的动作值。提出了一种新颖的混合动力机族多功能代理深度加强学习算法的“朋友或敌人”算法的框架。设计的算法可以评估合作伙伴的合作级别,并通过在连续深度Q网络空间中学习来最大化总回收。最终,提供了数值模拟和实验测试以证明所设计的算法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号