首页> 外文期刊>Mathematical methods of operations research >Continuous-time Markov decision processes with risk-sensitive finite-horizon cost criterion
【24h】

Continuous-time Markov decision processes with risk-sensitive finite-horizon cost criterion

机译:具有风险敏感的有限水平成本准则的连续时间马尔可夫决策过程

获取原文
获取原文并翻译 | 示例
           

摘要

This paper studies continuous-time Markov decision processes with a denumerable state space, a Borel action space, bounded cost rates and possibly unbounded transition rates under the risk-sensitive finite-horizon cost criterion. We give the suitable optimality conditions and establish the Feynman-Kac formula, via which the existence and uniqueness of the solution to the optimality equation and the existence of an optimal deterministic Markov policy are obtained. Moreover, employing a technique of the finite approximation and the optimality equation, we present an iteration method to compute approximately the optimal value and an optimal policy, and also give the corresponding error estimations. Finally, a controlled birth and death system is used to illustrate the main results.
机译:本文研究了在风险敏感的有限水平成本准则下具有可数状态空间,Borel动作空间,有界成本率和可能无界转移率的连续时间马尔可夫决策过程。我们给出合适的最优性条件,并建立Feynman-Kac公式,从而获得最优性方程解的存在性和唯一性以及最优确定性马尔可夫策略的存在性。此外,我们采用有限逼近和最优方程的技术,提出了一种迭代方法来近似计算最优值和最优策略,并给出相应的误差估计。最后,使用受控的生死系统来说明主要结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号