首页> 外文期刊>Neural Networks and Learning Systems, IEEE Transactions on >Adaptive Optimal Control of Unknown Constrained-Input Systems Using Policy Iteration and Neural Networks
【24h】

Adaptive Optimal Control of Unknown Constrained-Input Systems Using Policy Iteration and Neural Networks

机译:基于策略迭代和神经网络的未知约束输入系统的自适应最优控制

获取原文
获取原文并翻译 | 示例
           

摘要

This paper presents an online policy iteration (PI) algorithm to learn the continuous-time optimal control solution for unknown constrained-input systems. The proposed PI algorithm is implemented on an actor–critic structure where two neural networks (NNs) are tuned online and simultaneously to generate the optimal bounded control policy. The requirement of complete knowledge of the system dynamics is obviated by employing a novel NN identifier in conjunction with the actor and critic NNs. It is shown how the identifier weights estimation error affects the convergence of the critic NN. A novel learning rule is developed to guarantee that the identifier weights converge to small neighborhoods of their ideal values exponentially fast. To provide an easy-to-check persistence of excitation condition, the experience replay technique is used. That is, recorded past experiences are used simultaneously with current data for the adaptation of the identifier weights. Stability of the whole system consisting of the actor, critic, system state, and system identifier is guaranteed while all three networks undergo adaptation. Convergence to a near-optimal control law is also shown. The effectiveness of the proposed method is illustrated with a simulation example.
机译:本文提出了一种在线策略迭代(PI)算法,用于学习未知约束输入系统的连续时间最优控制解决方案。提出的PI算法是在行为者-批评者结构上实现的,其中两个神经网络(NNs)在线同时进行调谐,以生成最佳的有界控制策略。通过将新的NN标识符与参与者和评论者NN结合使用,可以消除对系统动力学的全面了解的要求。示出了标识符权重估计误差如何影响评论者NN的收敛。开发了一种新颖的学习规则,以确保标识符权重快速收敛到理想值的小邻域。为了提供易于检查的激励条件持久性,使用了经验重播技术。即,记录的过去经验与当前数据同时用于标识符权重的适配。当三个网络都进行自适应时,可以确保由参与者,评论者,系统状态和系统标识符组成的整个系统的稳定性。还显示了收敛到最佳控制律。仿真实例说明了该方法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号