首页> 外文会议>AAAI Workshop on Multiagent Learning >Multiagent Q-Learning: Preliminary Study on Dominance between the Nash and Stackelberg Equilibriums

【24h】

Multiagent Q-Learning: Preliminary Study on Dominance between the Nash and Stackelberg Equilibriums

机译：多元型Q学习：纳什和Stackelberg均衡之间的优势初步研究

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Some game theory approaches to solve multiagent reinforcement learning in self play, i.e. when agents use the same algorithm for choosing action, employ equilibriums, such as the Nash equilibrium, to compute the policies of the agents. These approaches have been applied only on simple examples. In this paper, we present an extended version of Nash Q-Learning using the Stackelberg equilibrium to address a wider range of games than with the Nash Q-Learning. We show that mixing the Nash and Stackelberg equilibriums can lead to better rewards not only in static games but also in stochastic games. Moreover, we apply the algorithm to a real world example, the automated vehicle coordination problem.

机译：一些博弈论解决自我播放中的多层加固学习的方法，即当代理使用相同的算法来选择动作时，采用均衡，例如纳什均衡，计算代理的策略。这些方法仅应用于简单的例子。在本文中，我们使用Stackelberg均衡介绍了NASH Q-Learning的扩展版本，以满足更广泛的游戏范围而不是Nash Q-Learning。我们展示了纳什和Stackelberg均衡的混合可以导致不仅在静态游戏中更好地奖励，也可以在随机游戏中获得更好的奖励。此外，我们将该算法应用于真实的世界示例，自动化的车辆协调问题。

著录项

来源
《AAAI Workshop on Multiagent Learning》|2005年||共5页
会议地点
作者
Julien Laumonier; Brahim Chaib-draa;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词

相似文献

外文文献
中文文献
专利

1. AN ASYMMETRIC COURNOT-NASH EQUILIBRIUM UNDER UNCERTAINTY AS A GENERALIZED COURNOT-STACKELBERG-NASH EQUILIBRIUM [J] . V. M. Gorbachuk Cybernetics and Systems Analysis . 2007,第4期

机译：不确定性下的不对称古诺-纳什均衡，广义的古诺-斯塔克伯格-纳什均衡
2. Evaluating semi-cooperative Nash/Stackelberg Q-learning for traffic routes plan in a single intersection [J] . Jian Guo, Istvan Harmati Control Engineering Practice . 2020,第Sepa期

机译：评估半协同纳什/ Stackelberg Q-Learning进行单个交叉路口的交通路线计划
3. Nash Q-learning agents in Hotelling's model: Reestablishing equilibrium [J] . Vainer Jan, Kukacka Jiri Communications in Nonlinear Science and Numerical Simulation . 2021,第Auga期

机译：纳什Q学习代理在热身的模型中：重建均衡
4. Multiagent Q-Learning: Preliminary Study on Dominance between the Nash and Stackelberg Equilibriums [C] . Julien Laumonier, Brahim Chaib-draa AAAI Workshop on Multiagent Learning . 2005

机译：多元型Q学习：纳什和Stackelberg均衡之间的优势初步研究
5. Debt Management Problems and Topics in Stackelberg Equilibrium [D] . Jiang, Yilun. 2019

机译：Stackelberg均衡中的债务管理问题和主题
6. Game Theory Based Security in Wireless Body Area Network with Stackelberg Security Equilibrium [O] . M. Somasundaram, R. Sivakumar 2015

机译：具有Stackelberg安全平衡的无线体域网中基于博弈论的安全性
7. Distributionally robust equilibrium for continuous games: Nash and Stackelberg models [O] . Liu, Yongchao, Xu, Huifu, Yang, Shu-Jung Sunny, 2017

机译：连续游戏的分布稳健均衡：Nash和stackelberg模型
8. Survey of Nash and Stackelberg Equilibrium Strategies in Dynamic Games. [R] . Cruz, J. B. 1975

机译：动态博弈中Nash和stackelberg均衡策略的研究。

Multiagent Q-Learning: Preliminary Study on Dominance between the Nash and Stackelberg Equilibriums

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅