Action Dependent Dual Heuristic Programming Solution for the Dynamic Graphical Games

机译：动态图形游戏的动作相关双重启发式编程解决方案

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

The context of graphical games is employed to solve the cooperative control problem for multi-agent systems interacting on graphs. Together with the need to have faster solution mechanisms urged for new approaches that employ the Dual Heuristic and Action Dependent Dual Heuristic Programming. This class of gradient-based solutions undergoes two main challenges. First, they have to use complex update expressions for the solving gradient-based structures. Second, they may overlook the local neighborhood information, if simpler costate expressions are enforced. A novel approach based on Action Dependent Dual Heuristic Programming is developed to solve the dynamic graphical games and to handle the aforementioned concerns. This adaptive learning approach is implemented online using means of value iteration and neural networks. The approximation of the optimal policy does not have priori knowledge about the agents' dynamics, while the value function gradient approximation is shown to depend only on the drift dynamics of the agents. The convergence results of the adaptive learning approach are highlighted by simulation example.

机译：图形游戏的上下文用于解决在图上交互的多智能体系统的协作控制问题。迫切需要有更快的解决方案机制，以寻求采用双重启发式和依赖于动作的双重启发式编程的新方法。此类基于梯度的解决方案面临两个主要挑战。首先，他们必须使用复杂的更新表达式来求解基于梯度的结构。其次，如果强制使用更简单的costate表达式，他们可能会忽略本地邻居信息。开发了一种基于动作依赖双重启发式编程的新颖方法来解决动态图形游戏并解决上述问题。这种自适应学习方法是使用价值迭代和神经网络在线实现的。最优策略的近似不具有关于代理动态的先验知识，而值函数梯度近似仅显示为依赖于代理的漂移动态。仿真实例突出了自适应学习方法的收敛结果。

著录项

来源
《IEEE Conference on Decision and Control》|2018年|2741-2746|共6页
会议地点
作者
Mohammed I. Abouheaf; Frank L. Lewis; Magdi S. Mahmoud;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Dynamic programming; Games; Mathematical model; Neural networks; Optimal control; Synchronization; Programming;

机译：动态编程;游戏;数学模型;神经网络;最优控制;同步;编程;

相似文献

外文文献
中文文献
专利

1. QR-TUNING AND APPROXIMATE-LS SOLUTIONS OF THE HJB EQUATION FOR ONLINE DLQR DESIGN VIA STATE AND ACTION-DEPENDENT HEURISTIC DYNAMIC PROGRAMMING [J] . Joao Viana Da Fonseca Neto, Patricia Helena Moraes Rego International Journal of Innovative Computing Information and Control . 2014,第3期

机译：状态与动作相关的启发式动态规划的在线DLQR设计HJB方程的QR调整和近似LS解
2. The dynamics of new dual-mode Kawahara equation: interaction of dual-waves solutions and graphical analysis [J] . Physica Scripta: An International Journal for Experimental and Theoretical Physics . 2020,第4期

机译：新型双模Kawahara方程的动态：双波解决方案的交互和图解分析
3. Online discrete-time LQR controller design with integral action for bulk Bucket Wheel Reclaimer operational processes via Action-Dependent Heuristic Dynamic Programming [J] . de Moura Jose Pinheiro, Moraes Rego Patricia Helena, da Fonseca Neto Joao Viana ISA Transactions . 2019,第期

机译：在线离散时间LQR控制器设计，通过动作依赖的启发式动态规划，具有散装桶轮再生的整体动作的整体动作
4. Action Dependent Dual Heuristic Programming Solution for the Dynamic Graphical Games [C] . Mohammed I. Abouheaf, Frank L. Lewis, Magdi S. Mahmoud IEEE Annual Conference on Decision and Control . 2018

机译：动态图形游戏的行动依赖双发主型编程解决方案
5. A bi-level programming formulation and heuristic solution approach for traffic control optimization in networks with dynamic demand and stochastic route choice. [D] . Sun, Dazhi. 2005

机译：具有动态需求和随机路由选择的网络中流量控制优化的双层编程公式和启发式解决方案。
6. Statistical measures for defining an individuals degree of independence within state-dependent dynamic games [O] . Sean A Rands, Rufus A Johnstone 2006

机译：在状态依赖型动态博弈中定义个人独立程度的统计量度
7. Comparison of a Heuristic Dynamic Programming and a Dual Heuristic Programming Based Adaptive Critics Neurocontroller for a Turbogenerator [O] . Ganesh K Venayagamoorthy Mieee, Ronald G Harley Fieee, Donald C Wunsch I Smiee 2013

机译：汽轮发电机启发式动态规划与基于双重启发式规划的自适应批评神经控制器的比较

Action Dependent Dual Heuristic Programming Solution for the Dynamic Graphical Games

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅