首页> 外文期刊>Computers & operations research >Optimal decisions for continuous time Markov decision processes overn finite planning horizons
【24h】

Optimal decisions for continuous time Markov decision processes overn finite planning horizons

机译:有限规划范围内连续时间马尔可夫决策过程的最优决策

获取原文
获取原文并翻译 | 示例

摘要

The computation of epsilon-optimal policies for continuous time Markov decision processes (CTMDPs) over finite time intervals is a sophisticated problem because the optimal policy may change at arbitrary times. Numerical algorithms based on time discretization or uniformization have been proposed for the computation of optimal policies. The uniformization based algorithm has shown to be more reliable and often also more efficient but is currently only available for processes where the gain or re
机译:对于有限时间间隔内的连续时间马尔可夫决策过程(CTMDP),ε最优策略的计算是一个复杂的问题,因为最优策略可能会在任意时间发生变化。提出了基于时间离散化或均匀化的数值算法来计算最优策略。基于均匀化的算法已显示出更可靠且通常也更有效,但目前仅适用于增益或增益

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号