首页> 外文期刊>Neurocomputing >Neural-network-based zero-sum game for discrete-time nonlinear systems via iterative adaptive dynamic programming algorithm
【24h】

Neural-network-based zero-sum game for discrete-time nonlinear systems via iterative adaptive dynamic programming algorithm

机译:迭代非线性动态规划算法的离散时间非线性系统基于神经网络的零和博弈

获取原文
获取原文并翻译 | 示例

摘要

In this paper, we solve the zero-sum game problems for discrete-time affine nonlinear systems with known dynamics via iterative adaptive dynamic programming algorithm. First, a greedy heuristic dynamic programming iteration algorithm is developed to solve the zero-sum game problems, which can be used to solve the Hamilton-Jacobi-lsaacs equation associated with H_x optimal regulation control problems. The convergence analysis in terms of value function and control policy is provided. To facilitate the implementation of the algorithm, three neural networks are used to approximate the control policy, the disturbance policy, and the value function, respectively. Then, we extend the algorithm to H_x optimal tracking control problems through system transformation. Finally, two simulation examples are presented to demonstrate the effectiveness of the proposed scheme.
机译:本文通过迭代自适应动态规划算法解决了已知动态的离散时间仿射非线性系统的零和博弈问题。首先,开发了一种贪婪启发式动态规划迭代算法来求解零和博弈问题,该算法可用于求解与H_x最优调节控制问题相关的Hamilton-Jacobi-lsaacs方程。提供了价值函数与控制策略的融合分析。为了促进算法的实现,使用了三个神经网络分别对控制策略,干扰策略和值函数进行近似。然后,通过系统变换将算法扩展到H_x最优跟踪控制问题。最后,给出了两个仿真实例来证明所提方案的有效性。

著录项

  • 来源
    《Neurocomputing》 |2013年第13期|92-100|共9页
  • 作者单位

    State Key Laboratory of Management and Control for Complex Systems, Institute of Automation, Chinese Academy of Sciences, Beijing 100190, PR China;

    State Key Laboratory of Management and Control for Complex Systems, Institute of Automation, Chinese Academy of Sciences, Beijing 100190, PR China;

    State Key Laboratory of Management and Control for Complex Systems, Institute of Automation, Chinese Academy of Sciences, Beijing 100190, PR China;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    adaptive dynamic programming; approximate dynamic programming; heuristic dynamic programming; neural networks; zero-sum game; h_x optimal control;

    机译:自适应动态规划近似动态规划;启发式动态规划;神经网络;零和博弈;h_x最佳控制;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号