首页> 外文期刊>SIAM Journal on Control and Optimization >DISCOUNTED CONTINUOUS-TIME MARKOV DECISION PROCESSES WITH UNBOUNDED RATES: THE CONVEX ANALYTIC APPROACH
【24h】

DISCOUNTED CONTINUOUS-TIME MARKOV DECISION PROCESSES WITH UNBOUNDED RATES: THE CONVEX ANALYTIC APPROACH

机译:利率无限的连续马尔可夫决策过程:凸分析方法

获取原文
获取原文并翻译 | 示例
           

摘要

This paper deals with constrained discounted continuous-time Markov decision processes, also known as controlled jump Markov processes, with Borel state and action spaces. Under some conditions imposed on the primitives, allowing unbounded transition rates and unbounded from both above and below) cost rates, first, we study the space of occupation measures. Then we reformulate the original problem as a linear program over the space of those measures and undertake the duality analysis. Finally, under some compactness-continuity conditions, we show the existence of a stationary optimal policy out of the class of randomized history-dependent policies.
机译:本文涉及带有Borel状态空间和动作空间的约束折扣连续时间马尔可夫决策过程,也称为受控跳跃马尔可夫过程。在强加于原语的某些条件下,允许无限制的过渡率以及不受成本上下限约束的成本率,首先,我们研究占领措施的空间。然后,我们将原始问题重新设计为在这些度量空间上的线性程序,并进行对偶分析。最后,在某些紧致性-连续性条件下,我们证明了存在随机历史相关策略类别之外的平稳最优策略的存在。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号