Best-response dynamics in zero-sum stochastic games

Leslie David S.; Perkins Steven; Xu Zibo

首页> 外文期刊>Journal of economic theory >Best-response dynamics in zero-sum stochastic games

【24h】

Best-response dynamics in zero-sum stochastic games

机译：零加速游戏中的最佳响应动态

获取原文

获取原文并翻译 | 示例

获取外文期刊封面目录资料

开具论文收录证明 >>

文献代查 >>

文献数据库（团队版） >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

We define and analyse three learning dynamics for two-player zero-sum discounted-payoff stochastic games. A continuous-time best-response dynamic in mixed strategies is proved to converge to the set of Nash equilibrium stationary strategies. Extending this, we introduce a fictitious-play-like process in a continuous-time embedding of a stochastic zero-sum game, which is again shown to converge to the set of Nash equilibrium strategies. Finally, we present a modified 8-converging best-response dynamic, in which the discount rate converges to 1, and the learned value converges to the asymptotic value of the zero-sum stochastic game. The critical feature of all the dynamic processes is a separation of adaption rates: beliefs about the value of states adapt more slowly than the strategies adapt, and in the case of the 8-converging dynamic the discount rate adapts more slowly than everything else. (c) 2020 The Authors. Published by Elsevier Inc. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).

机译：我们为双人零和折扣 - 支付随机游戏定义和分析三个学习动态。证明了混合策略中连续最佳响应动态的动态融合到纳什均衡固定策略集。扩展这一点，我们在连续时间嵌入随机零和游戏的连续嵌入时介绍了一个虚拟游戏过程，这再次被显示为收敛到纳什均衡策略。最后，我们介绍了修改的8聚串最佳响应动态，其中折扣率会聚到1，并且学习值会聚到零和随机游戏的渐近值。所有动态流程的关键特征是分离适应率：关于状态的价值的信念比策略适应更慢，而在8-趋同的动态的情况下，折扣率比其他一切速度更慢。（c）2020作者。由elsevier Inc.发布这是CC下的开放式访问文章（http://creativecommons.org/licenses/by/4.0/）。

著录项

来源
《Journal of economic theory》 |2020年第9期|105095.1-105095.31|共31页
作者
Leslie David S.; Perkins Steven; Xu Zibo;
展开▼
作者单位

Univ Lancaster Dept Math & Stat Lancaster England;

PwC Bristol Avon England;

SUTD Engn Syst & Design Singapore Singapore;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Stochastic games; Best-response dynamics; Zero-sum games; Convergence;

机译：随机游戏;最佳响应动态;零和游戏;收敛;

相似文献

外文文献
中文文献
专利

1. Solving Zero-Sum Games Using Best-Response Oracles with Applications to Search Games [J] . Hellerstein Lisa, Lidbetter Thomas, Pirutinsky Daniel Operations Research: The Journal of the Operations Research Society of America . 2019,第3期

机译：使用具有应用程序的最佳响应oracelles来解决零和游戏以搜索游戏
2. Stochastic Recursive Zero-Sum Differential Game and Mixed Zero-Sum Differential Game Problem [J] . Lifeng Wei, Zhen Wu Mathematical Problems in Engineering . 2012,第pta12期

机译：随机递归零和微分博弈与混合零和和微分博弈问题
3. Quadratic stochastic operators and zero-sum game dynamics [J] . Ganikhodjaev Nasir N., Ganikhodjaev Rasul N., Jamilov U. U. Ergodic Theory and Dynamical Systems . 2015,第Pta5期

机译：二次随机算子和零和博弈动力学
4. Stochastic Recursive Zero-Sum Differential Game and Mixed Zero-Sum Differential Game Problem with Payoff Functional in BDSDES [C] . Renwei Jia, Lifeng Wei, Xiaodong Liu IEEE International Conference of Safe Production and Informatization . 2020

机译：随机递归零和差动游戏和BDSDES的收益功能混合零和差分游戏问题
5. Deception in two-player zero-sum stochastic games: Theory and application to warfare games. [D] . Singh, Rajdeep. 2006

机译：两人零和随机游戏中的欺骗：理论和在战争游戏中的应用。
6. The politics of zero-sum thinking: The relationship between political ideology and the belief that life is a zero-sum game [O] . Shai Davidai, Martino Ongis 2019

机译：零和思想的政治：政治意识形态与生活是零和游戏的信念之间的关系
7. Best-response dynamics in zero-sum stochastic games [O] . David S. Leslie, Steven Perkins, Zibo Xu 2020

机译：零加速游戏中的最佳响应动态

Best-response dynamics in zero-sum stochastic games

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅