首页> 外文会议>Tools and Algorithms for the Construction and Analysis of Systems >Efficient Computation of Time-Bounded Reachability Probabilities in Uniform Continuous-Time Markov Decision Processes
【24h】

Efficient Computation of Time-Bounded Reachability Probabilities in Uniform Continuous-Time Markov Decision Processes

机译:均匀连续时间马尔可夫决策过程中有界可达概率的有效计算

获取原文

摘要

A continuous-time Markov decision process (CTMDP) is a generalization of a continuous-time Markov chain in which both probabilistic and nondeterministic choices co-exist. This paper presents an efficient algorithm to compute the maximum (or minimum) probability to reach a set of goal states within a given time bound in a uniform CTMDP, i.e., a CTMDP in which the delay time distribution per state visit is the same for all states. We prove that these probabilities coincide for (time-abstract) history-dependent and Markovian schedulers that resolve nondeterminism either deterministically or in a randomized way.
机译:连续时间马尔可夫决策过程(CTMDP)是连续时间马尔可夫链的概括,其中概率选择和不确定性选择都共存。本文提出了一种有效的算法,用于计算在统一CTMDP中给定时间范围内达到一组目标状态的最大(或最小)概率,即一种CTMDP,其中每个状态访问的延迟时间分布对于所有状态。我们证明,这些概率对于(时间抽象)历史相关的和马尔可夫调度程序是一致的,这些调度程序可以确定性地或以随机方式解决不确定性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号