Adaptive Dynamic Programming algorithm for finding online the equilibrium solution of the two-player zero-sum differential game

机译：在线寻找两人零和差分博弈均衡解的自适应动态规划算法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper will present an Approximate/Adaptive Dynamic Programming (ADP) algorithm for determining online the Nash equilibrium solution for the two-player zero-sum differential game with linear dynamics and infinite horizon quadratic cost. The algorithm is built around an iterative method that has been developed in the control engineering community for solving the continuous-time game algebraic Riccati equation (CT-GARE) that is underlying the game problem. We here show how the ADP techniques will enhance the capabilities of the offline method allowing an online solution without the requirement of complete knowledge of the system dynamics.

机译：本文将提出一种近似/自适应动态规划（ADP）算法，用于在线确定具有线性动力学和无限地平线二次成本的两人零和差分游戏的纳什均衡解。该算法是围绕控制工程界开发的一种迭代方法构建的，该迭代方法用于解决游戏问题背后的连续时间游戏代数Riccati方程（CT-GARE）。我们在这里展示了ADP技术将如何增强离线方法的功能，从而允许在线解决方案，而无需完全了解系统动力学。

著录项

来源
《The 2010 International Joint Conference on Neural Networks》|2010年|p.1-8|共8页
会议地点
作者
Vrabie Draguna; Lewis Frank;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类人工神经网络与计算;
关键词

相似文献

外文文献
中文文献
专利

1. Adaptive dynamic programming for online solution of a zero-sum differential game [J] . Draguna VRABIE, Frank LEWIS 控制理论与应用（英文版） . 2011,第003期

机译：零和差分游戏在线解的自适应动态规划
2. Stable value iteration for two-player zero-sum game of discrete-time nonlinear systems based on adaptive dynamic programming [J] . Song Ruizhuo, Zhu Liao Neurocomputing . 2019,第MAYa7期

机译：基于自适应动态规划的离散非线性系统两人零和游戏的稳定值迭代
3. Robust Adaptive Dynamic Programming of Two-Player Zero-Sum Games for Continuous-Time Linear Systems [J] . Fu Yue, Fu Jun, Chai Tianyou Neural Networks and Learning Systems, IEEE Transactions on . 2015,第12期

机译：连续时间线性系统两层零和博弈的鲁棒自适应动态规划
4. Adaptive Dynamic Programming algorithm for finding online the equilibrium solution of the two-player zero-sum differential game [C] . Vrabie Draguna, Lewis Frank The 2010 International Joint Conference on Neural Networks . 2010

机译：在线寻找两人零和差分博弈均衡解的自适应动态规划算法
5. Deception in two-player zero-sum stochastic games: Theory and application to warfare games. [D] . Singh, Rajdeep. 2006

机译：两人零和随机游戏中的欺骗：理论和在战争游戏中的应用。
6. A Differential Evolution Algorithm Based on Nikaido-Isoda Function for Solving Nash Equilibrium in Nonlinear Continuous Games [O] . Feng He, Wei Zhang, Guoqiang Zhang -1

机译：基于Nikaido-Isoda函数的差分进化算法求解非线性连续博弈中的Nash平衡
7. Online Gaming: Real Time Solution of Nonlinear Two-Player Zero-Sum Games Using Synchronous Policy Iteration [O] . Kyriakos G., Frank L. 2011

机译：在线游戏：使用同步策略迭代的非线性双人零和游戏的实时解决方案

Adaptive Dynamic Programming algorithm for finding online the equilibrium solution of the two-player zero-sum differential game

摘要

著录项

相似文献

相关主题

期刊订阅