Nash Convergence of Gradient Dynamics in General-Sum Games

机译：一般和博弈中梯度动力学的纳什收敛

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Multi-agent games are becoming an increasingly prevalent formalism for the study of electronic commerce and auctions. The speed at which transactions can take place and the growing complexity of electronic market-places makes the study of computationally simple agents an appealing direction. In this work, we analyze the behavior of agents that incrementally adapt their strategy through gradient ascent on expected payoff, in the simple setting of two-player, two-action, iter-ated general-sum games, and present a sur-prising result. We show that either the agents will converge to a Nash equilibrium, or if the strategies themselves do not converge, then their average payoffs will nevertheless con-verge to the payoffs of a Nash equilibrium.

机译：对于电子商务和拍卖的研究，多主体游戏正成为越来越普遍的形式主义。交易发生的速度以及电子市场的日益复杂性使得对计算简单的代理的研究成为有吸引力的方向。在这项工作中，我们分析了在两人，两动作，迭代的一般和游戏的简单设置下，通过梯度上升按预期收益逐步调整其策略的特工的行为，并给出了令人惊讶的结果。我们表明，或者说行为人将收敛于纳什均衡，或者如果策略本身不收敛，那么他们的平均收益仍将收敛于纳什均衡的收益。

著录项

来源
《Sixteenth Conference (2000) on Uncertainty in Artificial Intelligence June 30-July 3, 2000 Stanford University, Stanford, California》|2000年|p.541-548|共8页
会议地点 Stanford CA(US);Stanford CA(US)
作者
Satinder Singh; Michael Kearns; Yishay Mansour;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类自动化技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Learning Nash Equilibrium for General-Sum Markov Games from Batch Data [J] . Julien Perolat, Florian Strub, Bilal Piot, JMLR: Workshop and Conference Proceedings . 2017,第2009期

机译：从批处理数据学习通用和马尔可夫博弈的纳什均衡
2. General-sum stochastic games: Verifiability conditions for Nash equilibria [J] . H. L. Prasad, S. Bhatnagar Automatica . 2012,第11期

机译：广义和随机游戏：纳什均衡的可验证性条件
3. Nash Q-Learning for General-Sum Stochastic Games [J] . Hu Junling, Wellman Michael P. Journal of machine learning research . 2003,第Nov期

机译：Nash Q-学习常规和随机游戏
4. Nash Convergence of Gradient Dynamics in General-Sum Games [C] . Satinder Singh, Michael Kearns, Yishay Mansour Conference on uncertainty in artificial intelligence . 2000

机译：普通和游戏中梯度动力学的纳入融合
5. Nash strategies for dynamic noncooperative linear quadratic sequential games [D] . Shen, Dan 2006

机译：动态非合作式线性二次顺序博弈的纳什策略
6. Dynamics morphogenesis and convergence of evolutionary quantum Prisoners Dilemma games on networks [O] . Angsheng Li, Xi Yong -1

机译：网络上进化量子囚徒困境游戏的动力学形态发生和收敛
7. Actor-Critic Algorithms for Learning Nash Equilibria in N-player General-Sum Games [O] . Prasad, H. L, Prashanth, L. A., Bhatnagar, Shalabh 2015

机译：N-player中学习纳什均衡的演员批评算法一般和游戏
8. Distributed Convergence to Nash Equilibria in Two-Network Zero-Sum Games. [R] . Gharesifard, B., Cortes, J. 2013

机译：双网零和博弈中纳什均衡的分布收敛性。

Nash Convergence of Gradient Dynamics in General-Sum Games

摘要

著录项

相似文献

相关主题

期刊订阅