Delayed Nondeterminism in Continuous-Time Markov Decision Processes

机译：连续时间马尔可夫决策过程中的延迟不确定性

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Schedulers in randomly timed games can be classified as to whether they use timing information or not. We consider continuous-time Markov decision processes (CTMDPs) and define a hierarchy of positional (P) and history-dependent (H) schedulers which induce strictly tighter bounds on quantitative properties on CTMDPs. This classification into time abstract (TA), total time (TT) and fully time-dependent (T) schedulers is mainly based on the kind of timing details that the schedulers may exploit. We investigate when the resolution of nondeterminism may be deferred. In particular, we show that TTP and TAP schedulers allow for delaying nondeterminism for all measures, whereas this does neither hold for TP nor for any TAH scheduler. The core of our study is a transformation on CTMDPs which unifies the speed of outgoing transitions per state.

机译：可以将随机计时游戏中的调度程序分类为是否使用计时信息。我们考虑连续时间的马尔可夫决策过程（CTMDP），并定义了位置（P）和历史相关（H）调度程序的层次结构，这些调度程序对CTMDP的定量属性严格限制了界限。分为时间摘要（TA），总时间（TT）和完全依赖时间（T）的调度程序，这主要是基于调度程序可能利用的时序详细信息的种类。我们调查何时可以推迟不确定性的解决。特别是，我们表明TTP和TAP调度程序允许延迟所有度量的不确定性，而这对于TP或任何TAH调度程序均不成立。我们研究的核心是对CTMDP的转换，该转换统一了每个状态的传出转换速度。

著录项

来源
《Foundations of software science and computational structures》|2009年|P.364-379|共16页
会议地点 York(GB);York(GB);York(GB);York(GB)
作者
Martin R. Neuhaeusser; Marieelle Stoelinga; Joost-Pieter Katoen;
展开▼
作者单位

MOVES Group, RWTH Aachen University, Germany FMT Group, University of Twente, The Netherlands;

rnFMT Group, University of Twente, The Netherlands;

rnMOVES Group, RWTH Aachen University, Germany FMT Group, University of Twente, The Netherlands;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类计算机软件;
关键词

相似文献

外文文献
中文文献
专利

1. An approximation approach for the deviation matrix of continuous-time Markov processes with application to Markov decision theory [J] . Leder N., Heidergott B., Hordijk A. Operations Research: The Journal of the Operations Research Society of America . 2010,第4aPta1期

机译：连续时间马尔可夫过程偏差矩阵的一种近似方法及其在马尔可夫决策理论中的应用
2. Policy learning in continuous-time Markov decision processes using Gaussian Processes [J] . Bartocci Ezio, Bortolussi Luca, Brazdil Tomas, Performance Evaluation . 2017,第nova期

机译：使用高斯过程的连续时间马尔可夫决策过程中的策略学习
3. Variance Optimization for Continuous-Time Markov Decision Processes [J] . Yaqing Fu Open Journal of Statistics . 2019,第2期

机译：连续时间马尔可夫决策过程的方差优化
4. Delayed Nondeterminism in Continuous-Time Markov Decision Processes [C] . Martin R. Neuhausser, Marielle Stoelinga, Joost-Pieter Katoen Joint European Conferences on Theory and Practice of Software . 2009

机译：在连续时间马尔可夫决策过程中延迟非季度主义
5. Applications of spanning trees to continuous-time Markov processes, with emphasis on loss systems. [D] . McNamara, Richard C. 2004

机译：生成树在连续时间马尔可夫过程中的应用，重点是损失系统。
6. Using model-based proposals for fast parameter inference on discrete state space continuous-time Markov processes [O] . C. M. Pooley, S. C. Bishop, G. Marion 2015

机译：使用基于模型的建议对离散状态空间连续时间马尔可夫过程进行快速参数推断
7. Delayed Nondeterminism in Continuous-Time Markov Decision Processes [O] . Martin R. Neuhäußer, Mariëlle Stoelinga, Joost-Pieter Katoen 2010

机译：连续时间马尔可夫决策过程中的延迟不确定性

Delayed Nondeterminism in Continuous-Time Markov Decision Processes

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅