...
首页> 外文期刊>SIAM Journal on Control and Optimization >RISK-SENSITIVE DISCOUNTED CONTINUOUS-TIME MARKOV DECISION PROCESSES WITH UNBOUNDED RATES
【24h】

RISK-SENSITIVE DISCOUNTED CONTINUOUS-TIME MARKOV DECISION PROCESSES WITH UNBOUNDED RATES

机译:风险敏感折扣连续时间马尔可夫决策流程,具有无限性率

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

This paper attempts to study the risk-sensitive discounted continuous-time Markov decision processes with unbounded transition and cost rates. Different from the case of bounded transition/cost rates, the optimality equation (OE) no longer has a solution satisfying the uniform convergence condition introduced in the existing literature. Thus, we first replace the uniform convergence condition of the solution with a new boundary condition. Then, we find mild conditions imposed on the primitive data of the decision processes, which not only ensure the existence of a solution to the OE but also are the generalization of the bounded transition/cost rates conditions. Furthermore, using the characterization of the boundary condition and a novel technique, from the OE we prove the existence of an optimal policy out of the class of randomized history-dependent policies. Finally, we present two examples with unbounded transition/cost rates to illustrate our results.
机译:本文试图研究风险敏感的折扣连续连续时间马尔可夫决策过程,具有无限的转换和成本率。 与有界转变/成本速率的情况不同,最优性方程(OE)不再具有满足现有文献中引入的统一收敛条件的解决方案。 因此,我们首先用新的边界条件更换溶液的均匀收敛条件。 然后,我们发现对决策过程的原始数据施加的温和条件,这不仅可以确保对OE的解决方案的存在,而且还是界限转变/成本率条件的概括。 此外,使用边界条件的表征和新技术,从OE中我们证明存在于随机历史依赖性策略类别中的最佳策略。 最后,我们提出了两个具有无限转换/成本率的示例,以说明我们的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号