首页> 外文会议>International Conference on Web and Internet Economics >Computing Approximate Nash Equilibria in Polymatrix Games
【24h】

Computing Approximate Nash Equilibria in Polymatrix Games

机译:计算大型纳什比赛中的近似纳什均衡

获取原文

摘要

In an ∈-Nash equilibrium, a player can gain at most ∈ by unilaterally changing his behaviour. For two-player (bimatrix) games with payoffs in [0,1], the best-known ∈ achievable in polynomial time is 0.3393 [23]. In general, for n-player games an ∈-Nash equilibrium can be computed in polynomial time for an ∈ that is an increasing function of n but does not depend on the number of strategies of the players. For three-player and four-player games the corresponding values of ∈ are 0.6022 and 0.7153, respectively. Polymatrix games are a restriction of general n-player games where a player's payoff is the sum of payoffs from a number of bimatrix games. There exists a very small but constant ∈ such that computing an ε-Nash equilibrium of a polymatrix game is PPAD-hard. Our main result is that an (0.5 + δ)-Nash equilibrium of an n-player polymatrix game can be computed in time polynomial in the input size and 1/δ. Inspired by the algorithm of Tsaknakis and Spirakis [23], our algorithm uses gradient descent on the maximum regret of the players.
机译:在∈纳什均衡中,玩家可以通过单方面改变他的行为来最多获得。对于在[0,1]中的两位玩家(Bimatrix)游戏中,多项式时间可实现的最着名的∈是0.3393 [23]。通常,对于N播放游戏,可以在多项式时间中计算∈-nash均衡,这是n的越来越多的函数,但不依赖于玩家的策略的数量。对于三位玩家和四人游戏,相应的值分别为0.6022和0.7153。 Polypatrix游戏是一般的N播放游戏的限制,球员的收益是来自许多Bimatrix游戏的收益总和。存在一个非常小但恒定的∈,使得计算Poldatrix游戏的ε-NASH平衡是PPAD-HARD。我们的主要结果是,在输入大小和1 /δ中,可以在时间多项式中计算N-Play Poldatrix游戏的(0.5 +Δ)的-Nash均衡。灵感来自Tsaknakis和Spirakis的算法[23],我们的算法在玩家的最大遗憾上使用梯度下降。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号