...
首页> 外文期刊>Kybernetika >RISK PROBABILITY OPTIMIZATION PROBLEM FOR FINITE HORIZON CONTINUOUS TIME MARKOV DECISION PROCESSES WITH LOSS RATE
【24h】

RISK PROBABILITY OPTIMIZATION PROBLEM FOR FINITE HORIZON CONTINUOUS TIME MARKOV DECISION PROCESSES WITH LOSS RATE

机译:有限地平线连续时间马尔可夫决策过程的风险概率优化问题

获取原文
获取原文并翻译 | 示例
           

摘要

This paper presents a study the risk probability optimality for finite horizon continuous-time Markov decision process with loss rate and unbounded transition rates. Under drift condition, which is slightly weaker than the regular condition, as detailed in existing literature on the risk probability optimality Semi-Markov decision processes, we prove that the value function is the unique solution of the corresponding optimality equation, and demonstrate the existence of a risk probability optimization policy using an iteration technique. Furthermore, we provide verification of the imposed condition with two examples of controlled birth-and-death system and risk control, and further demonstrate that a value iteration algorithm can be used to calculate the value function and develop an optimal policy.
机译:本文介绍了有限地平线连续时间马尔可夫决策过程的风险概率最优性,具有损失率和无绑定的过渡率。 在漂移条件下,它比常规条件略微弱,如现有文献中的风险概率最佳半马尔可夫决策过程所详述,我们证明了价值函数是相应的最优性方程的独特解决方案,并证明存在的存在 使用迭代技术的风险概率优化策略。 此外,我们提供了具有两个受控出生和死亡系统和风险控制的示例的强加条件的验证,并进一步证明了价值迭代算法可以用于计算价值函数并开发最佳政策。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号