Learning Sparse Polymatrix Games in Polynomial Time and Sample Complexity

Asish Ghoshal; Jean Honorio

首页> 外文期刊>JMLR: Workshop and Conference Proceedings >Learning Sparse Polymatrix Games in Polynomial Time and Sample Complexity

【24h】

Learning Sparse Polymatrix Games in Polynomial Time and Sample Complexity

机译：在多项式时间和样本复杂度中学习稀疏的多矩阵博弈

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

We consider the problem of learning sparse polymatrix games from observations of strategic interactions. We show that a polynomial time method based on $ell_{1,2}$-group regularized logistic regression recovers a game, whose Nash equilibria are the $ε$-Nash equilibria of the game from which the data was generated (true game), in $O(m^4 d^4 log (pd))$ samples of strategy profiles — where $m$ is the maximum number of pure strategies of a player, $p$ is the number of players, and $d$ is the maximum degree of the game graph. Under slightly more stringent separability conditions on the payoff matrices of the true game, we show that our method learns a game with the exact same Nash equilibria as the true game. We also show that $Ω(d log (pm))$ samples are necessary for any method to consistently recover a game, with the same Nash-equilibria as the true game, from observations of strategic interactions.

机译：我们考虑从战略互动的观察中学习稀疏多矩阵博弈的问题。我们证明了基于$ ell_ {1,2} $-group正则化logistic回归的多项式时间方法恢复了一个博弈，该博弈的Nash均衡是生成数据的游戏的$ε$ -Nash均衡（真实游戏）），在策略配置文件的$ O（m ^ 4 d ^ 4 log（pd））$个样本中–其中，$ m $是玩家的纯净策略的最大数量，$ p $是玩家的数量，$ d $是游戏图的最大程度。在对真实游戏的收益矩阵的可分离性条件稍加严格的情况下，我们证明了我们的方法学习的游戏具有与真实游戏完全相同的纳什均衡。我们还表明，从战略互动的观察结果来看，$Ω（d log（pm））$样本对于任何一种持续恢复游戏的条件都是必要的，该游戏具有与真实游戏相同的纳什均衡。

著录项

来源
《JMLR: Workshop and Conference Proceedings》 |2018年第2010期|共9页
作者
Asish Ghoshal; Jean Honorio;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类人工智能理论;
关键词
入库时间 2022-08-18 15:56:26

相似文献

外文文献
中文文献
专利

1. Learning linear structural equation models in polynomial time and sample complexity [J] . Asish Ghoshal, Jean Honorio JMLR: Workshop and Conference Proceedings . 2018,第2010期

机译：学习多项式时间和样本复杂度中的线性结构方程模型
2. Learning Factor Graphs in Polynomial Time and Sample Complexity [J] . Abbeel Pieter, Koller Daphne, Ng Andrew Y. Journal of machine learning research . 2006,第Aug期

机译：多项式时间和样本复杂度中的学习因子图
3. Proximal Gradient Temporal Difference Learning: Stable Reinforcement Learning with Polynomial Sample Complexity [J] . Bo Liu, Ian Gemp, Mohammad Ghavamzadeh, The Journal of Artificial Intelligence Research . 2018,第8期

机译：渐近时间差异学习：具有多项式样本复杂度的稳定强化学习
4. Learning Factor Graphs in Polynomial Time Sample Complexity [C] . Pieter Abbeel, Daphne Koller, Andrew Y. Ng Uncertainty in Artificial Intelligence . 2005

机译：多项式时间和样本复杂度中的学习因子图
5. Polynomial-time Random Oracles, Nondeterministic Sublinear Time, and Boolean Function Complexity [D] . Sekoni, Adewale. 2018

机译：多项式时间随机Oracle，不确定的亚线性时间和布尔函数复杂度
6. Using flawed uncertain proximate and sparse (FUPS) data in the context of complexity: learning from the case of child mental health [O] . Miranda Wolpert, Harry Rutter 2018

机译：在复杂情况下使用有缺陷不确定接近和稀疏（FUPS）数据：从儿童心理健康案例中学习
7. Learning factor graphs in polynomial time and sample complexity [O] . Pieter Abbeel, Daphne Koller, Andrew Y. Ng 2006

机译：多项式时间和样本复杂度中的学习因子图

Learning Sparse Polymatrix Games in Polynomial Time and Sample Complexity

摘要

著录项

相似文献

相关主题

期刊订阅