首页> 外文会议>Reliability and Quality in Design >A Polynomial-time Nash Equilibrium Algorithm for Repeated Stochastic Games
【24h】

A Polynomial-time Nash Equilibrium Algorithm for Repeated Stochastic Games

机译:重复随机博弈的多项式时间纳什均衡算法

获取原文
获取原文并翻译 | 示例

摘要

We present a polynomial-time algorithm that always finds an (approximate) Nash equilibrium for repeated two-player stochastic games. The algorithm exploits the folk theorem to derive a strategy profile that forms an equilibrium by buttressing mutually beneficial behavior with threats, where possible. One component of our algorithm efficiently searches for an approximation of the egalitarian point, the fairest pareto-efficient solution. The paper concludes by applying the algorithm to a set of grid games to illustrate typical solutions the algorithm finds. These solutions compare very favorably to those found by competing algorithms, resulting in strategies with higher social welfare, as well as guaranteed computational efficiency.
机译:我们提出了一种多项式时间算法,该算法总是为重复的两人随机游戏找到一个(近似)纳什均衡。该算法利用民间定理来推导策略配置文件,该策略配置文件在可能的情况下通过用威胁支持互利行为来形成均衡。我们算法的一个组成部分有效地搜索了均等点(最公平的有效解)的近似值。本文通过将算法应用于一组网格游戏来结束,以说明该算法找到的典型解决方案。与竞争算法相比,这些解决方案具有非常好的优势,从而带来了更高的社会福利以及保证的计算效率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号