首页> 外文会议>IEEE Conference on Decision and Control >Action Dependent Dual Heuristic Programming Solution for the Dynamic Graphical Games
【24h】

Action Dependent Dual Heuristic Programming Solution for the Dynamic Graphical Games

机译:动态图形游戏的动作相关双重启发式编程解决方案

获取原文
获取外文期刊封面目录资料

摘要

The context of graphical games is employed to solve the cooperative control problem for multi-agent systems interacting on graphs. Together with the need to have faster solution mechanisms urged for new approaches that employ the Dual Heuristic and Action Dependent Dual Heuristic Programming. This class of gradient-based solutions undergoes two main challenges. First, they have to use complex update expressions for the solving gradient-based structures. Second, they may overlook the local neighborhood information, if simpler costate expressions are enforced. A novel approach based on Action Dependent Dual Heuristic Programming is developed to solve the dynamic graphical games and to handle the aforementioned concerns. This adaptive learning approach is implemented online using means of value iteration and neural networks. The approximation of the optimal policy does not have priori knowledge about the agents' dynamics, while the value function gradient approximation is shown to depend only on the drift dynamics of the agents. The convergence results of the adaptive learning approach are highlighted by simulation example.
机译:图形游戏的上下文用于解决在图上交互的多智能体系统的协作控制问题。迫切需要有更快的解决方案机制,以寻求采用双重启发式和依赖于动作的双重启发式编程的新方法。此类基于梯度的解决方案面临两个主要挑战。首先,他们必须使用复杂的更新表达式来求解基于梯度的结构。其次,如果强制使用更简单的costate表达式,他们可能会忽略本地邻居信息。开发了一种基于动作依赖双重启发式编程的新颖方法来解决动态图形游戏并解决上述问题。这种自适应学习方法是使用价值迭代和神经网络在线实现的。最优策略的近似不具有关于代理动态的先验知识,而值函数梯度近似仅显示为依赖于代理的漂移动态。仿真实例突出了自适应学习方法的收敛结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号