【24h】

Sharing Information in Adversarial Bandit

机译:在敌对强盗中共享信息

获取原文
获取外文期刊封面目录资料

摘要

2-Player games in general provide a popular platform for research in Artificial Intelligence (AI). One of the main challenges coming from this platform is approximating a Nash Equilibrium (NE) over zero-sum matrix games. While the problem of computing such a Nash Equilibrium is solvable in polynomial time using Linear Programming (LP), it rapidly becomes infeasible to solve as the size of the matrix grows; a situation commonly encountered in games. This paper focuses on improving the approximation of a NE for matrix games such that it outperforms the state-of-the-art algorithms given a finite (and rather small) number T of oracle requests to rewards. To reach this objective, we propose to share information between the different relevant pure strategies. We show both theoretically by improving the bound and empirically by experiments on artificial matrices and on a real-world game that information sharing leads to an improvement of the approximation of the NE.
机译:通常,两人游戏为人工智能(AI)研究提供了一个受欢迎的平台。该平台面临的主要挑战之一是在零和矩阵游戏中逼近Nash均衡(NE)。尽管使用线性规划(LP)可以在多项式时间内解决这种Nash平衡问题,但是随着矩阵大小的增长,解决该问题很快变得不可行。游戏中经常遇到的情况。本文着重于改进矩阵游戏中NE的近似值,使其在给定有限(但很小)的Oracle奖励请求T的情况下,胜过最新的算法。为了实现这一目标,我们建议在不同的相关纯策略之间共享信息。我们在理论上通过改进边界来进行展示,在经验上通过在人工矩阵上进行的实验以及在现实世界中的游戏中进行展示,即信息共享都可以提高NE的逼近度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号