Reinforcement Learning to Play an Optimal Nash Equilibrium in Team Markov Games

机译：强化学习在马氏游戏团队中发挥最佳纳什均衡

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Multiagent learning is a key problem in AI. In the presence of multiple Nash equilibria, even agents with non-conflicting interests may not be able to learn an optimal coordination policy. The problem is exac-cerbated if the agents do not know the game and independently receive noisy payoffs. So, multiagent reinforfcement learning involves two interrelated problems: identifying the game and learning to play. In this paper, we present optimal adaptive learning, the first algorithm that converges to an optimal Nash equilibrium with probability 1 in any team Markov game. We provide a convergence proof, and show that the algorithm's parameters are easy to set to meet the convergence conditions.

机译：多主体学习是AI中的关键问题。在存在多个Nash均衡的情况下，即使是利益不冲突的代理商也可能无法学习最佳的协调策略。如果代理商不了解游戏并独立获得嘈杂的收益，那么这个问题就更加严重。因此，多主体强化学习涉及两个相互关联的问题：识别游戏和学习游戏。在本文中，我们提出了最优自适应学习，这是在任何团队马尔可夫博弈中以概率1收敛到最优Nash均衡的第一个算法。我们提供了收敛证明，并表明该算法的参数易于设置以满足收敛条件。

著录项

来源
《Sixteenth Annual Neural Information Processing Systems (NIPS) Conference; Dec 9-14, 2002; British Columbia, Canada》|2002年|p.1603-1610|共8页
会议地点 British Columbia(CA);British Columbia(CA);British Columbia(CA)
作者
Xiaofeng Wang; Tuomas Sandholm;
展开▼
作者单位

ECE Department Carnegie Mellon University Pittsburgh, PA 15213;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类自动化技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. A Reinforcement Learning Algorithm For Obtaining The Nash Equilibrium Of Multi-player Matrix Games [J] . VISHNU NANDURI, TAPAS K. DAS IIE Transactions . 2009,第2期

机译：一种获得多人矩阵游戏纳什均衡的强化学习算法
2. Learning Nash Equilibrium for General-Sum Markov Games from Batch Data [J] . Julien Perolat, Florian Strub, Bilal Piot, JMLR: Workshop and Conference Proceedings . 2017,第2009期

机译：从批处理数据学习通用和马尔可夫博弈的纳什均衡
3. Approximate Nash Solutions for Multiplayer Mixed-Zero-Sum Game With Reinforcement Learning [J] . Lv Yongfeng, Ren Xuemei IEEE Transactions on Systems, Man, and Cybernetics . 2019,第12期

机译：具有强化学习的多人混合零和游戏的近似Nash解决方案
4. Reinforcement Learning to Play an Optimal Nash Equilibrium in Team Markov Games [C] . Xiaofeng Wang, Tuomas Sandholm Annual neural information processing systems conference . 2003

机译：加强学习在马尔可夫游戏团队中发挥最佳的纳什均衡
5. Multi-agent reinforcement learning in Markov games. [D] . Sheppard, John Wilbur. 1997

机译：马尔可夫游戏中的多主体强化学习。
6. Nash Equilibrium of Social-Learning Agents in a Restless Multiarmed Bandit Game [O] . Kazuaki Nakayama, Masato Hisakado, Shintaro Mori -1

机译：躁动多臂强盗游戏中的社会学习代理人的纳什均衡
7. An Enhanced Model-Free Reinforcement Learning Algorithm to Solve Nash Equilibrium for Multi-Agent Cooperative Game Systems [O] . Yuannan Jiang, Fuxiao Tan 2020

机译：用于求解多方代代理合作游戏系统的纳什均衡的增强的无模型加强学习算法

Reinforcement Learning to Play an Optimal Nash Equilibrium in Team Markov Games

摘要

著录项

相似文献

相关主题

期刊订阅