...
首页> 外文期刊>IEEE Transactions on Automatic Control >Performance Guarantees for Model-Based Approximate Dynamic Programming in Continuous Spaces
【24h】

Performance Guarantees for Model-Based Approximate Dynamic Programming in Continuous Spaces

机译:在连续空间中基于模型的近似动态规划的性能保证

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

We study both the value function and Q-function formulation of the linear programming approach to approximate dynamic programming. The approach is model based and optimizes over a restricted function space to approximate the value function or Q-function. Working in the discrete time, continuous space setting, we provide guarantees for the fitting error and online performance of the policy. In particular, the online performance guarantee is obtained by analyzing an iterated version of the greedy policy, and the fitting error guarantee by analyzing an iterated version of the Bellman inequality. These guarantees complement the existing bounds that appear in the literature. The Q-function formulation offers benefits, for example, in the decentralized controller design, however, it can lead to computationally demanding optimization problems. To alleviate this drawback, we provide a condition that simplifies the formulation, resulting in improved computational times.
机译:我们研究了线性规划方法的价值函数和Q函数配方,以近似动态规划。该方法是基于模型,并通过限制函数空间优化以近似值函数或Q函数。在离散时间工作,连续空间设置,我们为拟合误差和策略的在线表现提供保证。特别是,通过分析贪婪政策的迭代版本来获得在线性能保证,并通过分析贝尔曼不等式的迭代版本来获得拟合误差保证。这些保证补充了文献中出现的现有范围。 Q函数配方提供优势,例如,在分散的控制器设计中,它可能导致计算苛刻的优化问题。为了减轻这种缺点,我们提供了一种简化配方的条件,从而产生改善的计算时间。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号