首页> 外文会议>IFAC World Congress >Approximate Dynamic Programming via Penalty Functions
【24h】

Approximate Dynamic Programming via Penalty Functions

机译:通过惩罚功能近似动态编程

获取原文

摘要

In this paper, we propose a novel formulation for encoding state constraints into the Linear Programming approach to Approximate Dynamic Programming via the use of penalty functions. To maintain tractability of the resulting optimization problem that needs to be solved, we suggest a penalty function that is constructed as a point-wise maximum taken over a family of low-order polynomials. Once the penalty functions are designed, no additional approximations are introduced by the proposed formulation. The effectiveness and numerical stability of the formulation is demonstrated through examples.
机译:在本文中,我们提出了一种用于将状态约束进行编码成线性规划方法的新颖制剂,以通过使用惩罚函数来近似动态编程。为了维持所产生的优化问题的易易,我们建议一个被构造成的惩罚函数,作为一系列低阶多项式的群体最大值。一旦设计了惩罚功能,就可以通过所提出的配方引入额外的近似。通过实施例证明了制剂的有效性和数值稳定性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号