首页> 外文会议>American Control Conference >A simple learning rule in games and its convergence to pure-strategy Nash equilibria
【24h】

A simple learning rule in games and its convergence to pure-strategy Nash equilibria

机译:游戏中的简单学习规则及其向纯策略纳什均衡的收敛

获取原文

摘要

We propose a simple learning rule in games. The proposed rule only requires that (i) if there exists at least one strictly better reply (SBR), an agent switches its action to each SBR with positive probability or stay with the same action (with positive probability), and (ii) when there is no SBR, the agent either stays with the previous action or switches to another action that yields the same payoff. We first show that some of existing algorithms (or simple modifications) are special cases of our proposed algorithm. Secondly, we demonstrate that this intuitive rule guarantees almost sure convergence to a pure-strategy Nash equilibrium in a large class of games that we call generalized weakly acyclic games. Finally, we show that the probability that the action profile does not converge to a pure-strategy Nash equilibrium decreases geometrically fast in the aforementioned class of games.
机译:我们提出了一个简单的游戏学习规则。提议的规则仅要求(i)如果存在至少一个严格更好的答复(SBR),则代理将其动作以肯定的概率切换到每个SBR或停留在相同的动作(以肯定的概率),以及(ii)当没有SBR,座席要么停留在上一个动作,要么切换到另一个产生相同收益的动作。我们首先证明现有的某些算法(或简单的修改)是我们提出的算法的特例。其次,我们证明了这种直观的规则可以保证在我们称为广义弱非循环博弈的一大类博弈中几乎可以肯定地收敛到纯策略纳什均衡。最后,我们证明了在上述类游戏中,动作轮廓未收敛到纯策略纳什均衡的概率在几何上快速减小。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号