首页> 外文会议>International conference on neural information processing >Data-Based Optimal Tracking Control of Nonaffine Nonlinear Discrete-Time Systems
【24h】

Data-Based Optimal Tracking Control of Nonaffine Nonlinear Discrete-Time Systems

机译:非共源非线性离散时间系统的基于数据的最优跟踪控制

获取原文

摘要

The optimal tracking control problem of nonaffine nonlinear discrete-time systems is considered in this paper. The problem relies on the solution of the so-called tracking Hamilton-Jacobi-Bellman equation, which is extremely difficult to be solved even for simple systems. To overcome this difficulty, the data-based Q-learning algorithm is proposed by learning the optimal tracking control policy from data of the practical system. For its implementation purpose, the critic-only neural network structure is developed, where only critic neural network is required to estimate the Q-function and the least-square scheme is employed to update the weight of neural network.
机译:本文考虑了非共进非线性离散时间系统的最佳跟踪控制问题。问题依赖于所谓的跟踪Hamilton-Jacobi-Bellman方程的解决方案,即使对于简单的系统,也非常难以解决。为了克服这种困难,通过从实际系统的数据学习最佳跟踪控制策略来提出基于数据的Q学习算法。为了实现目的,仅开发了批评神经网络结构,其中仅需要批评批评神经网络来估计Q函数,并且采用最小二乘方案来更新神经网络的权重。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号