Average case analysis of the classical algorithm for Markov decision processes with Bachi objectives

Chatterjee Krishnendu; Joglekar Manas; Shah Nisarg

首页> 外文期刊>Theoretical computer science >Average case analysis of the classical algorithm for Markov decision processes with Bachi objectives

【24h】

Average case analysis of the classical algorithm for Markov decision processes with Bachi objectives

机译：具有Bachi目标的Markov决策过程的经典算法的平均案例分析

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

We consider Markov decision processes (MDPs) with specifications given as Buchi (liveness) objectives, and examine the problem of computing the set of almost-sure winning vertices such that the objective can be ensured with probability I from these vertices. We study for the first time the average-case complexity of the classical algorithm for computing the set of almost-sure winning vertices for MDPs with Buchi objectives. Our contributions are as follows: First, we show that for MDPs with constant out-degree the expected number of iterations is at most logarithmic and the average-case running time is linear (as compared to the worst-case linear number of iterations and quadratic time complexity). Second, for the average-case analysis over all MDPs we show that the expected number of iterations is constant and the average-case running time is linear (again as compared to the worst-case linear number of iterations and quadratic time complexity). Finally we also show that when all MDPs are equally likely, the probability that the classical algorithm requires more than a constant number of iterations is exponentially small. (C) 2015 Elsevier B.V. All rights reserved.

机译：我们考虑以Buchi（活动性）目标给出规格的Markov决策过程（MDP），并研究计算几乎确定的获胜顶点集的问题，以便可以从这些顶点以概率I保证目标。我们首次研究了经典算法的平均情况下的复杂度，该算法用于计算具有Buchi目标的MDP的几乎确定的获胜顶点集。我们的贡献如下：首先，我们证明对于具有恒定度数的MDP，预期的迭代次数最多是对数的，并且平均情况下的运行时间是线性的（与最坏情况下的线性迭代次数和二次数相比）时间复杂度）。其次，对于所有MDP的平均情况分析，我们表明预期的迭代次数是恒定的，平均情况下的运行时间是线性的（与最坏情况的线性迭代次数和二次时间复杂度相比）。最后，我们还表明，当所有MDP的可能性均等时，经典算法需要比恒定迭代次数更多的概率呈指数减小。（C）2015 Elsevier B.V.保留所有权利。

著录项

来源
《Theoretical computer science》 |2015年第null期|共19页
作者
Chatterjee Krishnendu; Joglekar Manas; Shah Nisarg;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词
Average-case analysis; Buchi objectives; Markov decision processes (MDPs); Random graphs;

机译：平均案例分析;Buchi目标;Markov决策过程（MDP）;随机图;

相似文献

外文文献
中文文献
专利

1. Average case analysis of the classical algorithm for Markov decision processes with Bachi objectives [J] . Chatterjee Krishnendu, Joglekar Manas, Shah Nisarg Theoretical computer science . 2015,第Null期

机译：具有Bachi目标的Markov决策过程的经典算法的平均案例分析
2. Symbolic algorithms for qualitative analysis of Markov decision processes with Buechi objectives [J] . Krishnendu Chatterjee, Monika Henzinger, Manas Joglekar, Formal Methods in System Design . 2013,第3期

机译：具有Buechi目标的Markov决策过程的定性分析的符号算法
3. Reinforcement learning based algorithms for average cost Markov Decision Processes [J] . Abdulla MS, Bhatnagar S Discrete event dynamic systems: Theory and applications . 2007,第1期

机译：基于增强学习的平均成本马尔可夫决策过程算法
4. Symbolic Algorithms for Qualitative Analysis of Markov Decision Processes with Biichi Objectives [C] . Krishnendu Chatterjee, Monika Henzinger, Manas Joglekar, Computer aided verification . 2011

机译：具有Biichi目标的Markov决策过程定性分析的符号算法
5. Increasing scalability in algorithms for centralized and decentralized partially observable Markov decision processes: Efficient decision-making and coordination in uncertain environments. [D] . Amato, Christopher. 2010

机译：用于集中式和分散式部分可观察的马尔可夫决策过程的算法中的可伸缩性不断增强：在不确定的环境中进行有效的决策和协调。
6. Multi-Objective Markov Decision Processes for Data-Driven Decision Support [O] . Daniel J. Lizotte, Eric B. Laber -1

机译：数据驱动决策支持的多目标马尔可夫决策过程
7. Average Case Analysis of the Classical Algorithm for Markov Decision Processes with Büchi Objectives [O] . Krishnendu Chatterjee, Manas Joglekar, Nisarg Shah 2016

机译：Büchi目标马尔可夫决策过程经典算法的平均个案分析

Average case analysis of the classical algorithm for Markov decision processes with Bachi objectives

摘要

著录项

相似文献

相关主题

期刊订阅