首页> 外文会议>Foundations of software science and computational structures >Delayed Nondeterminism in Continuous-Time Markov Decision Processes
【24h】

Delayed Nondeterminism in Continuous-Time Markov Decision Processes

机译:连续时间马尔可夫决策过程中的延迟不确定性

获取原文
获取原文并翻译 | 示例

摘要

Schedulers in randomly timed games can be classified as to whether they use timing information or not. We consider continuous-time Markov decision processes (CTMDPs) and define a hierarchy of positional (P) and history-dependent (H) schedulers which induce strictly tighter bounds on quantitative properties on CTMDPs. This classification into time abstract (TA), total time (TT) and fully time-dependent (T) schedulers is mainly based on the kind of timing details that the schedulers may exploit. We investigate when the resolution of nondeterminism may be deferred. In particular, we show that TTP and TAP schedulers allow for delaying nondeterminism for all measures, whereas this does neither hold for TP nor for any TAH scheduler. The core of our study is a transformation on CTMDPs which unifies the speed of outgoing transitions per state.
机译:可以将随机计时游戏中的调度程序分类为是否使用计时信息。我们考虑连续时间的马尔可夫决策过程(CTMDP),并定义了位置(P)和历史相关(H)调度程序的层次结构,这些调度程序对CTMDP的定量属性严格限制了界限。分为时间摘要(TA),总时间(TT)和完全依赖时间(T)的调度程序,这主要是基于调度程序可能利用的时序详细信息的种类。我们调查何时可以推迟不确定性的解决。特别是,我们表明TTP和TAP调度程序允许延迟所有度量的不确定性,而这对于TP或任何TAH调度程序均不成立。我们研究的核心是对CTMDP的转换,该转换统一了每个状态的传出转换速度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号