首页> 外文会议>IEEE Symposium Series on Computational Intelligence >Deep Reinforcement Learning Based Intelligent Decision Making for Two-player Sequential Game with Uncertain Irrational Player
【24h】

Deep Reinforcement Learning Based Intelligent Decision Making for Two-player Sequential Game with Uncertain Irrational Player

机译:基于深度强化学习的不确定性两人序列游戏的智能决策

获取原文

摘要

In this paper, two player sequential game with an unknown non-stationary irrational player is investigated for cooperative autonomous robots decision making applications. In practice, the irrationality of agent can seriously degrade the effectiveness of decision making especially for distributed cooperative tasks with applications to multi-robot systems. Specifically, The irrationality can be caused by the cooperation agent’s mechanical failure or sensor flaw. To handle this issue, a novel dynamic evaluation system, which includes two important parameters, i.e. cooperation index and competitive flag, is designed to efficiently quantify the player’s level of cooperation or competition firstly. Then, the continuous deep Q network space is proposed to predict the action value with respect to a continuous cooperation index. Inspired from the framework of "Friend or Foe" algorithm, a novel hybrid online multi-agent deep reinforcement learning algorithm is proposed. The designed algorithm can evaluate the cooperator’s cooperative level as well as maximize the total payoff by learning in a continuous deep Q network space. Eventually, numerical simulation and experimental tests are provided to demonstrate the effectiveness of the designed algorithm.
机译:本文研究了具有未知非平稳非理性玩家的两人顺序博弈,以用于协作式自主机器人决策应用。实际上,代理的不合理性会严重降低决策的有效性,尤其是对于分布式协作任务以及应用于多机器人系统的决策。具体来说,不合理性可能是由合作代理的机械故障或传感器缺陷引起的。为了解决这个问题,设计了一种新颖的动态评估系统,该系统包括两个重要参数,即合作指数和竞争标志,以首先有效地量化参与者的合作或竞争水平。然后,提出了连续的深层Q网络空间,以预测关于连续合作指标的作用值。在“朋友或敌人”算法框架的启发下,提出了一种新颖的混合在线多主体深度强化学习算法。设计的算法可以评估合作者的合作水平,并通过在连续的深层Q网络空间中学习来最大化总收益。最终,通过数值模拟和实验测试证明了所设计算法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号