Performance Guarantees for Model-Based Approximate Dynamic Programming in Continuous Spaces

Beuchat Paul Nathaniel; Georghiou Angelos; Lygeros John

首页> 外文期刊>IEEE Transactions on Automatic Control >Performance Guarantees for Model-Based Approximate Dynamic Programming in Continuous Spaces

【24h】

Performance Guarantees for Model-Based Approximate Dynamic Programming in Continuous Spaces

机译：在连续空间中基于模型的近似动态规划的性能保证

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

We study both the value function and Q-function formulation of the linear programming approach to approximate dynamic programming. The approach is model based and optimizes over a restricted function space to approximate the value function or Q-function. Working in the discrete time, continuous space setting, we provide guarantees for the fitting error and online performance of the policy. In particular, the online performance guarantee is obtained by analyzing an iterated version of the greedy policy, and the fitting error guarantee by analyzing an iterated version of the Bellman inequality. These guarantees complement the existing bounds that appear in the literature. The Q-function formulation offers benefits, for example, in the decentralized controller design, however, it can lead to computationally demanding optimization problems. To alleviate this drawback, we provide a condition that simplifies the formulation, resulting in improved computational times.

机译：我们研究了线性规划方法的价值函数和Q函数配方，以近似动态规划。该方法是基于模型，并通过限制函数空间优化以近似值函数或Q函数。在离散时间工作，连续空间设置，我们为拟合误差和策略的在线表现提供保证。特别是，通过分析贪婪政策的迭代版本来获得在线性能保证，并通过分析贝尔曼不等式的迭代版本来获得拟合误差保证。这些保证补充了文献中出现的现有范围。 Q函数配方提供优势，例如，在分散的控制器设计中，它可能导致计算苛刻的优化问题。为了减轻这种缺点，我们提供了一种简化配方的条件，从而产生改善的计算时间。

著录项

来源
《IEEE Transactions on Automatic Control》 |2020年第1期|143-158|共16页
作者
Beuchat Paul Nathaniel; Georghiou Angelos; Lygeros John;
展开▼
作者单位

Swiss Fed Inst Technol Automat Control Lab CH-8092 Zurich Switzerland;

McGill Univ Desautels Fac Management Montreal PQ H3A 0G4 Canada;

Swiss Fed Inst Technol Automat Control Lab CH-8092 Zurich Switzerland;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Aerospace electronics; Dynamic programming; Optimal control; Numerical models; Linear programming; Stochastic processes; Mathematical model; Discrete-time systems; dynamic programming; infinite horizon optimal control; stochastic systems;

机译：航空航天电子;动态规划;最优控制;数值模型;线性规划;随机过程;数学模型;离散时间系统;动态编程;无限的地平线最优控制;随机系统;随机系统;

相似文献

外文文献
中文文献
专利

1. A cost-shaping linear program for average-cost approximate dynamic programming with performance guarantees [J] . de Farias DP, Van Roy B Mathematics of operations research . 2006,第3期

机译：具有性能保证的平均成本近似动态规划的成本成形线性程序
2. Performance Guarantee of an Approximate Dynamic Programming Policy for Robotic Surveillance [J] . M. Park, K. Kalyanam, S. Darbha, IEEE transactions on automation science and engineering . 2016,第2期

机译：机器人监视近似动态编程策略的性能保证
3. Approximate dynamic programming for stochastic linear control problems on compact state spaces [J] . Woerner Stefan, Laumanns Marco, Zenklusen Rico, European Journal of Operational Research . 2015,第1期

机译：紧状态空间上随机线性控制问题的近似动态规划
4. State aggregation approximate dynamic programming for model-based spacecraft autonomy [C] . Massimo Tipaldi, Luigi Glielmo European Control Conference . 2016

机译：状态聚合近似动态规划，用于基于模型的航天器自主
5. Near-optimal intelligent control for continuous set-point regulator problems via approximate dynamic programming. [D] . Hearnes, Warren Eastman, II. 1999

机译：通过近似动态编程，可以连续优化设定值问题的近乎最优的智能控制。
6. Solving the dynamic ambulance relocation and dispatching problem using approximate dynamic programming [O] . Verena Schmid -1

机译：用近似动态规划解决动态救护车的调动和调度问题
7. A Cost-Shaping Linear Program for Average-Cost Approximate Dynamic Programming with Performance Guarantees [O] . Daniela Pucci de Farias, Benjamin Van Roy 2006

机译：具有性能保证的平均成本近似动态规划的成本线性程序
8. Computing Approximate Solutions to Markov Renewal Programs with Continuous State Spaces. [R] . l'Ecuyer, P. 1989

机译：用连续状态空间计算马尔可夫更新程序的近似解。

Performance Guarantees for Model-Based Approximate Dynamic Programming in Continuous Spaces

摘要

著录项

相似文献

相关主题

期刊订阅