首页> 外文期刊>Theoretical computer science >Average case analysis of the classical algorithm for Markov decision processes with Bachi objectives
【24h】

Average case analysis of the classical algorithm for Markov decision processes with Bachi objectives

机译:具有Bachi目标的Markov决策过程的经典算法的平均案例分析

获取原文
获取原文并翻译 | 示例
       

摘要

We consider Markov decision processes (MDPs) with specifications given as Buchi (liveness) objectives, and examine the problem of computing the set of almost-sure winning vertices such that the objective can be ensured with probability I from these vertices. We study for the first time the average-case complexity of the classical algorithm for computing the set of almost-sure winning vertices for MDPs with Buchi objectives. Our contributions are as follows: First, we show that for MDPs with constant out-degree the expected number of iterations is at most logarithmic and the average-case running time is linear (as compared to the worst-case linear number of iterations and quadratic time complexity). Second, for the average-case analysis over all MDPs we show that the expected number of iterations is constant and the average-case running time is linear (again as compared to the worst-case linear number of iterations and quadratic time complexity). Finally we also show that when all MDPs are equally likely, the probability that the classical algorithm requires more than a constant number of iterations is exponentially small. (C) 2015 Elsevier B.V. All rights reserved.
机译:我们考虑以Buchi(活动性)目标给出规格的Markov决策过程(MDP),并研究计算几乎确定的获胜顶点集的问题,以便可以从这些顶点以概率I保证目标。我们首次研究了经典算法的平均情况下的复杂度,该算法用于计算具有Buchi目标的MDP的几乎确定的获胜顶点集。我们的贡献如下:首先,我们证明对于具有恒定度数的MDP,预期的迭代次数最多是对数的,并且平均情况下的运行时间是线性的(与最坏情况下的线性迭代次数和二次数相比)时间复杂度)。其次,对于所有MDP的平均情况分析,我们表明预期的迭代次数是恒定的,平均情况下的运行时间是线性的(与最坏情况的线性迭代次数和二次时间复杂度相比)。最后,我们还表明,当所有MDP的可能性均等时,经典算法需要比恒定迭代次数更多的概率呈指数减小。 (C)2015 Elsevier B.V.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号