Symbolic algorithms for qualitative analysis of Markov decision processes with Buechi objectives

Krishnendu Chatterjee; Monika Henzinger; Manas Joglekar; Nisarg Shah

首页> 外文期刊>Formal Methods in System Design >Symbolic algorithms for qualitative analysis of Markov decision processes with Buechi objectives

【24h】

Symbolic algorithms for qualitative analysis of Markov decision processes with Buechi objectives

机译：具有Buechi目标的Markov决策过程的定性分析的符号算法

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

We consider Markov decision processes (MDPs) with Buechi (liveness) objectives. We consider the problem of computing the set of almost-sure winning states from where the objective can be ensured with probability 1. Our contributions are as follows: First, we present the first subquadratic symbolic algorithm to compute the almost-sure winning set for MDPs with Biichi objectives; our algorithm takes O(n • m~（1/2）) symbolic steps as compared to the previous known algorithm that takes O(n~2) symbolic steps, where n is the number of states and m is the number of edges of the MDP. In practice MDPs have constant out-degree, and then our symbolic algorithm takes O(n • n~（1/2）) symbolic steps, as compared to the previous known O(n~2) symbolic steps algorithm. Second, we present a new algorithm, namely win-lose algorithm, with the following two properties: (a) the algorithm iteratively computes subsets of the almost-sure winning set and its complement, as compared to all previous algorithms that discover the almost-sure winning set upon termination; and (b) requires O(n • k~（1/2）) symbolic steps, where K is the maximal number of edges of strongly connected components (scc's) of the MDP. The win-lose algorithm requires symbolic computation of scc's. Third, we improve the algorithm for symbolic sec computation; the previous known algorithm takes linear symbolic steps, and our new algorithm improves the constants associated with the linear number of steps. In the worst case the previous known algorithm takes 5•n symbolic steps, whereas our new algorithm takes 4•n symbolic steps.

机译：我们考虑具有Buechi（活动性）目标的Markov决策过程（MDP）。我们考虑计算可以从中以概率1保证目标的几乎确定的获胜状态集的问题。我们的贡献如下：首先，我们提出第一个二次符号算法来计算MDP的几乎确定的获胜集。以Biichi目标为目标；与之前已知的采用O（n〜2）符号步长的已知算法相比，我们的算法采用O（n•m〜（1/2））个符号步长，其中n是状态数，m是边缘的数量MDP。在实践中，MDP具有恒定的出度，因此与先前已知的O（n〜2）符号步长算法相比，我们的符号算法采用O（n•n〜（1/2））符号步长。其次，我们提出了一种新的算法，即输赢算法，它具有以下两个属性：（a）与发现几乎以下情况的所有先前算法相比，该算法迭代地计算几乎确定的获胜集合及其补集的子集：终止时确定赢钱；（b）需要O（n•k〜（1/2））个符号步，其中K是MDP的强连接组件（scc）的最大边数。双输算法需要对scc进行符号计算。第三，我们改进了符号秒计算的算法。先前的已知算法采用线性符号步长，而我们的新算法改进了与线性步长相关的常数。在最坏的情况下，以前的已知算法需要5•n个符号步，而我们的新算法需要4•n个符号步。

著录项

来源
《Formal Methods in System Design》 |2013年第3期|301-327|共27页
作者
Krishnendu Chatterjee; Monika Henzinger; Manas Joglekar; Nisarg Shah;
展开▼
作者单位

IST Austria, Klosterneuburg, Austria;

University of Vienna, Vienna, Austria;

Stanford University, Palo Alto, USA;

Carnegie Mellon University, Pittsburgh, USA;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Markov decision processes; Probabilistic verification; Biichi objectives; Symbolic algorithms;

机译：马尔可夫决策过程;概率验证;Biichi目标;符号算法;

相似文献

外文文献
中文文献
专利

1. Average case analysis of the classical algorithm for Markov decision processes with Bachi objectives [J] . Chatterjee Krishnendu, Joglekar Manas, Shah Nisarg Theoretical computer science . 2015,第Null期

机译：具有Bachi目标的Markov决策过程的经典算法的平均案例分析
2. Average Case Analysis of the Classical Algorithm for Markov Decision Processes with B?chi Objectives [J] . Krishnendu Chatterjee, Manas Joglekar, Nisarg Shah LIPIcs : Leibniz International Proceedings in Informatics . 2012,第2期

机译：具有B？chi目标的经典马尔可夫决策过程算法的平均案例分析
3. A K-step look-ahead analysis of value iteration algorithms for Markov decision processes [J] . Meir Herzberg, Uri Yechiali European Journal of Operational Research . 1996,第3期

机译：Markov决策过程的值迭代算法的K步前瞻分析
4. Symbolic Algorithms for Qualitative Analysis of Markov Decision Processes with Biichi Objectives [C] . Krishnendu Chatterjee, Monika Henzinger, Manas Joglekar, Computer aided verification . 2011

机译：具有Biichi目标的Markov决策过程定性分析的符号算法
5. Increasing scalability in algorithms for centralized and decentralized partially observable Markov decision processes: Efficient decision-making and coordination in uncertain environments. [D] . Amato, Christopher. 2010

机译：用于集中式和分散式部分可观察的马尔可夫决策过程的算法中的可伸缩性不断增强：在不确定的环境中进行有效的决策和协调。
6. Multi-Objective Markov Decision Processes for Data-Driven Decision Support [O] . Daniel J. Lizotte, Eric B. Laber -1

机译：数据驱动决策支持的多目标马尔可夫决策过程
7. Symbolic Algorithms for Qualitative Analysis of Markov Decision Processes with B"uchi Objectives [O] . Chatterjee, Krishnendu, Henzinger, Monika, Joglekar, Manas, 2014

机译：马尔可夫决策定性分析的符号算法 B \“uchi目标的过程
8. Symbolic Heuristic Search for Factored Markov Decision Processes [R] . Feng, Z. , Hansen, E. A. 2003

机译：因子马尔可夫决策过程的符号启发式搜索

Symbolic algorithms for qualitative analysis of Markov decision processes with Buechi objectives

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅