首页> 外文会议>IFAC World Congress >Approximate Dynamic Programming via Penalty Functions

【24h】

Approximate Dynamic Programming via Penalty Functions

机译：通过惩罚功能近似动态编程

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we propose a novel formulation for encoding state constraints into the Linear Programming approach to Approximate Dynamic Programming via the use of penalty functions. To maintain tractability of the resulting optimization problem that needs to be solved, we suggest a penalty function that is constructed as a point-wise maximum taken over a family of low-order polynomials. Once the penalty functions are designed, no additional approximations are introduced by the proposed formulation. The effectiveness and numerical stability of the formulation is demonstrated through examples.

机译：在本文中，我们提出了一种用于将状态约束进行编码成线性规划方法的新颖制剂，以通过使用惩罚函数来近似动态编程。为了维持所产生的优化问题的易易，我们建议一个被构造成的惩罚函数，作为一系列低阶多项式的群体最大值。一旦设计了惩罚功能，就可以通过所提出的配方引入额外的近似。通过实施例证明了制剂的有效性和数值稳定性。

著录项

来源
《IFAC World Congress》|2018年|11367-12032p|共8页
会议地点
作者
Paul N. Beuchat; John Lygeros;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP273-53;
关键词
Stochastic optimal control problems; Approximate dynamic programming; Soft constraints;

机译：随机最佳控制问题;近似动态规划;软限制;

相似文献

外文文献
中文文献
专利

1. Choice of approximator and design of penalty function for an approximate dynamic programming based control approach [J] . Lee JM, Kaisare NS, Lee JH Journal of Process Control . 2006,第2期

机译：基于近似动态规划的控制方法的近似器选择和罚函数设计
2. Approximate Dynamic Programming via Penalty Functions * * This research was partially funded by the European Commission under the project Local4Global. [J] . Paul N. Beuchat, John Lygeros IFAC PapersOnLine . 2017,第1期

机译：通过惩罚函数进行近似动态编程 * * 这项研究部分由欧盟委员会在Local4Global项目下资助。
3. Approximate Dynamic Programming via Penalty Functions * * This research was partially funded by the European Commission under the project Local4Global. [J] . Paul N. Beuchat, John Lygeros IFAC PapersOnLine . 2017,第1期

机译：通过惩罚函数进行近似动态编程 * * 这项研究部分由欧盟委员会在Local4Global项目下资助。
4. Approximate Dynamic Programming via Penalty Functions [C] . Paul N. Beuchat, John Lygeros IFAC World Congress . 2018

机译：通过惩罚功能近似动态编程
5. Automatic basis function construction for reinforcement learning and approximate dynamic programming. [D] . Keller, Philipp W. 2008

机译：用于增强学习和近似动态编程的自动基础函数构造。
6. Solving the dynamic ambulance relocation and dispatching problem using approximate dynamic programming [O] . Verena Schmid -1

机译：用近似动态规划解决动态救护车的调动和调度问题
7. NONLINEAR PROGRAMMING IN APPROXIMATE DYNAMIC PROGRAMMING: BANG-BANG SOLUTIONS, STOCK-MANAGEMENT AND UNSMOOTH PENALTIES [O] . Olivier Teytaud, Sylvain Gelly 2008

机译：近似动态规划中的非线性规划：BaNG-BaNG解决方案，股票管理和不合理的处罚

Approximate Dynamic Programming via Penalty Functions

摘要

著录项

相似文献

相关主题

期刊订阅