...
首页> 外文期刊>IFAC PapersOnLine >Canonical Dynamic Programming Equations subject to Ambiguity ?
【24h】

Canonical Dynamic Programming Equations subject to Ambiguity ?

机译:经典动态编程方程受到歧义

获取原文

摘要

This paper studies the infinite horizon average cost Markov control model subject to ambiguity on the controlled process conditional distribution. The stochastic control problem is formulated as a minimax optimization in which, (i) the existence of optimal policies is established through a pair of canonical dynamic programming equations derived for Borel state and action spaces, and (ii) the controlled process maximizing conditional distribution is characterized through a water-filling solution derived for finite state and action spaces. To obtain average cost optimal policies numerically a policy iteration algorithm is also developed. Finally, as an application of the proposed canonical dynamic programming equations, an example is provided.
机译:本文研究了无限的地平线平均成本马尔可夫控制模型对受控过程条件分布的歧义。将随机控制问题作为最小值优化,(i)通过为Borel状态和动作空间的一对规范动态编程方程建立最佳策略的存在,并且(ii)可控过程最大化条件分布通过衍生用于有限状态和动作空间的水填充溶液。为了获得平均成本最佳策略,数值上也开发了策略迭代算法。最后,作为所提出的规范动态编程方程的应用,提供了一个例子。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号