首页> 外文会议>Algorithms and complexity >On Strategy Improvement Algorithms for Simple Stochastic Games
【24h】

On Strategy Improvement Algorithms for Simple Stochastic Games

机译:简单随机博弈的策略改进算法研究

获取原文
获取原文并翻译 | 示例

摘要

The study of simple stochastic games (SSGs) was initiated by Condon for analyzing the computational power of randomized space-bounded alternating Turing machines. The game is played by two players, MAX and MIN, on a directed multigraph, and when the play terminates at a sink s, MAX wins from MIN a payoff p(s) ∈ [0,1]. Condon showed that the SSG value problem, which given a SSG asks whether the expected payoff won by MAX exceeds 1/2 when both players use their optimal strategies, is in NP n coNP. However, the exact complexity of this problem remains open as it is not known whether the problem is in P or is hard for some natural complexity class. In this paper, we study the computational complexity of a strategy improvement algorithm by Hoffman and Karp for this problem. The Hoffman-Karp algorithm converges to optimal strategies of a given SSG, but no nontrivial bounds were previously known on its running time. We show a bound of O(2~n) on the convergence time of this algorithm, and a bound of O(2~(0.78n)) on a randomized variant. These are the first non-trivial upper bounds on the convergence time of these strategy improvement algorithms.
机译:Condon发起了简单随机游戏(SSG)的研究,以分析随机有界的交替图灵机的计算能力。游戏由两个玩家MAX和MIN在有向多重图形上进行,并且当游戏在接收器s处终止时,MAX从MIN获胜,获得的收益为p(s)∈[0,1]。 Condon指出,SSG的价值问题在于NP n coNP中,该问题由SSG询问,当双方都使用最佳策略时,MAX赢得的预期收益是否超过1/2。但是,此问题的确切复杂性仍然是未知的,因为尚不清楚该问题是P还是难于解决某些自然复杂度问题。在本文中,我们研究了Hoffman和Karp针对此问题的策略改进算法的计算复杂性。 Hoffman-Karp算法收敛于给定SSG的最佳策略,但是以前在其运行时间上没有非平凡的界限。我们在该算法的收敛时间上显示了O(2〜n / n)的界线,在随机变量上显示了O(2〜(0.78n))的界线。这些是这些策略改进算法收敛时间的第一个非平凡的上限。

著录项

  • 来源
    《Algorithms and complexity》|2010年|p.240-251|共12页
  • 会议地点 Rome(IT);Rome(IT)
  • 作者单位

    Department of Computer Science and Engineering, University of South Florida, Tampa, FL 33620, USA;

    Department of Computer Science and Engineering, University of South Florida, Tampa, FL 33620, USA;

    Virginia Bioinformatics Institute and Dept. of Computer Science, Virginia Tech. Blacksburg, VT 24061, USA;

  • 会议组织
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 计算技术、计算机技术;
  • 关键词

  • 入库时间 2022-08-26 13:58:08

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号