首页> 外文学位 >Approximate dynamic programming solutions with a single network adaptive critic for a class of nonlinear systems.
【24h】

Approximate dynamic programming solutions with a single network adaptive critic for a class of nonlinear systems.

机译:具有一类非线性系统的单个网络自适应注释器的近似动态规划解决方案。

获取原文
获取原文并翻译 | 示例

摘要

Approximate dynamic programming formulation implemented with an Adaptive Critic (AC) based neural network (NN) structure has evolved as a powerful technique for solving the Hamilton-Jacobi-Bellman (HJB) equations. As interest in ADP and the AC solutions are escalating with time, there is a dire need to consider possible enabling factors for their implementations. A typical AC structure consists of two interacting NNs which is computationally expensive. In this work, a new architecture, called the "Cost Function Based Single Network Adaptive Critic (J-SNAC)" is presented that eliminates one of the networks in a typical AC structure. This approach is applicable to a wide class of nonlinear systems in engineering. In the first paper, two problems have been solved with the AC and the J-SNAC approaches. Results are presented that show savings of about 50% of the computational costs by J-SNAC while having the same accuracy levels of the dual network structure in solving for optimal control. In the second paper, the plant dynamics with parametric uncertainties or unmodeled nonlinearities has been considered. The author discusses the dynamic rc-optimization of the J-SNAC controller that is used to capture the uncertainty but is not considered in the system model used for controller design. In the third paper, a non-quadratic cost function is used to incorporate control constraints. Necessary equations for optimal control arc derived and au algorithm is presented to solve the constrained-control problem with J-SNAC. The fourth paper presents a new controller design technique for a class of nonlinear impulse driven systems.
机译:使用基于自适应批评家(AC)的神经网络(NN)结构实现的近似动态编程公式已发展成为解决Hamilton-Jacobi-Bellman(HJB)方程的强大技术。随着人们对ADP和AC解决方案的兴趣随着时间的推移而不断升级,迫切需要考虑实现它们的可能因素。典型的AC结构由两个相互交互的NN组成,这在计算上很昂贵。在这项工作中,提出了一种新的体系结构,称为“基于成本函数的单网络自适应评论家(J-SNAC)”,该体系结构消除了典型AC结构中的一个网络。这种方法适用于工程中的各种非线性系统。在第一篇论文中,AC和J-SNAC方法解决了两个问题。结果表明,J-SNAC可节省约50%的计算成本,同时在解决最佳控制问题上具有与双网络结构相同的准确度。在第二篇论文中,考虑了具有参数不确定性或非建模非线性的工厂动力学。作者讨论了J-SNAC控制器的动态rc优化,该控制器用于捕获不确定性,但未在用于控制器设计的系统模型中进行考虑。在第三篇论文中,非二次成本函数用于合并控制约束。提出了最优控制弧导出的必要方程和au算法,以解决J-SNAC的约束控制问题。第四篇论文提出了一种用于一类非线性脉冲驱动系统的新型控制器设计技术。

著录项

  • 作者

    Ding, Jie.;

  • 作者单位

    Missouri University of Science and Technology.;

  • 授予单位 Missouri University of Science and Technology.;
  • 学科 Engineering Mechanical.
  • 学位 Ph.D.
  • 年度 2011
  • 页码 171 p.
  • 总页数 171
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号