首页> 外文会议>International colloquium on automata, languages and programming >A Pseudo-Polynomial Algorithm for Mean Payoff Stochastic Games with Perfect Information and a Few Random Positions
【24h】

A Pseudo-Polynomial Algorithm for Mean Payoff Stochastic Games with Perfect Information and a Few Random Positions

机译:具有完美信息和少量随机位置的均值随机随机博弈的伪多项式算法

获取原文

摘要

We consider two-person zero-sum stochastic mean payoff games with perfect information, or BWR-games, given by a digraph G = (V,E), with local rewards r : E → R, and three types of vertices: black V_B, white Vw, and random V_R forming a partition of V. It is a long-standing open question whether a polynomial time algorithm for BWR-games exists, or not. In fact, a pseudo-polynomial algorithm for these games would already imply their polynomial solvability. In this paper, we show that BWR-games with a constant number of random nodes can be solved in pseudo-polynomial time. That is, for any such game with a few random nodes |V_R| = O(1), a saddle point in pure stationary strategies can be found in time polynomial in |V_W| + |V_B|, the maximum absolute local reward R, and the common denominator of the transition probabilities.
机译:我们考虑具有理想信息的两人零和随机平均收益博弈或BWR博弈,由有向图G =(V,E)给出,具有局部奖励r:E→R以及三种类型的顶点:黑色V_B ,白色Vw和随机V_R形成V的分区。是否存在用于BWR游戏的多项式时间算法是一个长期存在的问题。实际上,针对这些游戏的伪多项式算法已经暗示了它们的多项式可解性。在本文中,我们证明了具有恒定数量随机节点的BWR博弈可以在伪多项式时间内求解。也就是说,对于任何这样的带有几个随机节点的游戏| V_R | = O(1),纯平稳策略中的鞍点可以在| V_W |的时间多项式中找到+ | V_B |,最大绝对局部奖励R和转移概率的公分母。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号