首页> 外文会议>Chinese Control Conference >Optimal control of affine nonlinear continuous-time systems using online actor-critic algorithm
【24h】

Optimal control of affine nonlinear continuous-time systems using online actor-critic algorithm

机译:在线仿生批评算法的仿射非线性连续时间系统的最优控制

获取原文

摘要

In this paper we propose a new online actor-critic algorithm based on policy iteration for learning the continuous-time optimal control solution with infinite horizon cost for nonlinear systems. In other word, the algorithm solves online an algebraic Riccati equation without knowing the internal dynamics model of the system. This approach is implemented as an actor-critic structure which involves both actor and critic neural networks. Because of using a policy iteration method, the present algorithm alternates between the policy evaluation and policy update steps until an update of the control policy will no longer improve the system performance. Simulation results show the effectiveness of the new algorithm.
机译:在本文中,我们提出了一种新的基于策略迭代的在线actor-critic算法,用于学习非线性系统无限时空成本的连续时间最优控制解决方案。换句话说,该算法在不知道系统内部动力学模型的情况下在线求解了代数Riccati方程。这种方法被实现为既包含演员神经网络又包含批评者神经网络的演员批评结构。由于使用策略迭代方法,因此本算法在策略评估和策略更新步骤之间交替,直到控制策略的更新将不再提高系统性能。仿真结果表明了该算法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号