...
首页> 外文期刊>Cybernetics, IEEE Transactions on >Approximate Dynamic Programming for Nonlinear-Constrained Optimizations
【24h】

Approximate Dynamic Programming for Nonlinear-Constrained Optimizations

机译:非线性约束优化的近似动态规划

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

In this paper, we study the constrained optimization problem of a class of uncertain nonlinear interconnected systems. First, we prove that the solution of the constrained optimization problem can be obtained through solving an array of optimal control problems of constrained auxiliary subsystems. Then, under the framework of approximate dynamic programming, we present a simultaneous policy iteration (SPI) algorithm to solve the Hamilton-Jacobi-Bellman equations corresponding to the constrained auxiliary subsystems. By building an equivalence relationship, we demonstrate the convergence of the SPI algorithm. Meanwhile, we implement the SPI algorithm via an actor-critic structure, where actor networks are used to approximate optimal control policies and critic networks are applied to estimate optimal value functions. By using the least squares method and the Monte Carlo integration technique together, we are able to determine the weight vectors of actor and critic networks. Finally, we validate the developed control method through the simulation of a nonlinear interconnected plant.
机译:本文研究了一类不确定非线性互连系统的约束优化问题。首先,我们证明了通过求解受约束的辅助子系统的最佳控制问题阵列来获得受约束优化问题的解决方案。然后,在近似动态编程的框架下,我们介绍了一个同时迭代(SPI)算法来解决与受限辅助子系统对应的Hamilton-jacobi-Bellman方程。通过构建等价关系,我们展示了SPI算法的融合。同时,我们通过演员 - 批评结构实现SPI算法,其中actor网络用于近似最佳控制策略,并应用批评网络来估计最佳值函数。通过使用最小二乘法和蒙特卡罗集成技术在一起,我们能够确定演员和批评网络的权重向量。最后,我们通过模拟非线性互联植物来验证开发的控制方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号