首页> 外文会议>Intelligent Control. 2003 IEEE International Symposium on >Analyzing and enhancing direct NDP designs using a control-theoretic approach
【24h】

Analyzing and enhancing direct NDP designs using a control-theoretic approach

机译:使用控制理论方法分析和增强直接NDP设计

获取原文

摘要

Direct NDP is in the family of approximate dynamic programming designs aiming at using learning and approximation methods to solve dynamic optimization problems formulated in dynamic programming, and to overcome the curse of dimensionality. Due to the statistical learning nature of the approaches, researchers usually make use of statistical measures to evaluate the design performance of the learning system such as the learning speed and the variation from one learning experience to the other. However, there are no systematic studies to date that address closed loop system performance from an input-output functional perspective. This paper analyzes direct NDP designs using classic control-theoretic sensitivity arguments. By using the benchmark cart-pole problem, it is shown that direct NDP uses an LQR with desired closed-loop properties as a learning guide, it is more likely for direct NDP to generate better designs than a direct NDP learning from scratch. Although the approach and results are illustrated using a simple nonlinear cart-pole system, it is clear that they are readily extended to more complex dynamical systems.
机译:Direct NDP属于近似动态规划设计家族,旨在使用学习和逼近方法来解决动态规划中提出的动态优化问题,并克服维数的诅咒。由于方法的统计学习性质,研究人员通常利用统计方法来评估学习系统的设计性能,例如学习速度和从一种学习经历到另一种学习经历的变化。但是,迄今为止,还没有系统的研究从输入输出功能的角度来解决闭环系统的性能。本文使用经典的控制理论敏感性参数来分析直接NDP设计。通过使用基准车杆问题,可以证明直接NDP使用具有所需闭环特性的LQR作为学习指南,与直接从头学习相比,直接NDP更有可能生成更好的设计。尽管使用简单的非线性Cart-pole系统对方法和结果进行了说明,但很显然,它们很容易扩展到更复杂的动力学系统。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号