The resilience of cooperation in a Dilemma game played by reinforcement learning agents

机译：强化学习代理人在困境游戏中的合作弹性

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

This work discusses what an (independent) reinforcement learning agent can do in a multiagent environment. In particular, we consider a stateless Q-learning agent in a Prisoner's Dilemma (PD) game. Although it had been shown in the literature that stateless, independent Q-learning agents had been difficult to cooperate with each other in an iterated PD (IPD) game, we gave a condition of PD payoffs and Q-learning parameters that helps the agents cooperate with each other. Based on the condition, we also discussed the ratio of mutual cooperation happening in IPD games. It supposed that mutual cooperation was fragile, i.e., one misfortune defection would have the agents slide down the spiral of mutual defection. However, it is not always correct. Mutual cooperation will reinforce itself and thus it will be robust and resilient. Hence, this work analytically derives how long a series of mutual cooperation continues once it happened while considering the resilience. It gives us further comprehension of the process of reinforcement learning in IPD games.

机译：这项工作讨论了（独立的）强化学习代理在多主体环境中可以做什么。特别是，我们在囚徒困境（PD）游戏中考虑了无状态Q学习代理。尽管在文献中已经表明，无状态，独立的Q学习代理在迭代PD（IPD）博弈中难以彼此协作，但我们给出了PD收益和Q学习参数的条件，该条件可以帮助代理进行协作彼此。在此基础上，我们还讨论了IPD游戏中相互合作发生的比例。它认为相互合作是脆弱的，即，一次不幸的叛逃会使特工们沿着相互叛逃的螺旋式下滑。但是，它并不总是正确的。相互合作将加强自身，因此将是强大和有弹性的。因此，这项工作从分析的角度得出了一系列相互合作一旦发生的持续时间，同时考虑了弹性。它使我们对IPD游戏中强化学习的过程有了进一步的了解。

著录项

来源
《IEEE International Conference on Agents》|2017年|33-39|共7页
会议地点
作者
Koichi Moriyama; Kaori Nakase; Atsuko Mutoh; Nobuhiro Inuzuka;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Games; Learning (artificial intelligence); Resilience; Spirals; Robustness; Game theory; Conferences;

机译：游戏;学习（人工智能）;弹性;螺旋形;健壮性;博弈论;会议;

相似文献

外文文献
中文文献
专利

1. Performance Study of Minimax and Reinforcement Learning Agents Playing the Turn-based Game Iwoki [J] . Videgain Santiago, Garcia Sanchez Pablo Applied Artificial Intelligence . 2021,第9a11期

机译：MIMIMAX和加强学习代理的绩效研究播放基于转向的游戏IWOKI
2. Coevolutionary, coexisting learning and teaching agents model for prisoner's dilemma games enhancing cooperation with assortative heterogeneous networks [J] . Tanimoto J. Physica, A. Statistical mechanics and its applications . 2013,第13期

机译：囚徒困境游戏的共进化，共存的学与教模型，增强了与各种异构网络的合作
3. Evolution of cooperation in the snowdrift game among mobile players with random-pairing and reinforcement learning [J] . Jia N., Ma S. Physica, A. Statistical mechanics and its applications . 2013,第22期

机译：具有随机配对和强化学习的移动玩家之间在雪堆游戏中合作的演变
4. The resilience of cooperation in a Dilemma game played by reinforcement learning agents [C] . Koichi Moriyama, Kaori Nakase, Atsuko Mutoh, IEEE International Conference on Agents . 2017

机译：在加固学习代理人举行的困境游戏中的合作抵御能力
5. Collective learning and cooperation between intelligent software agents: A study of artificial personality and behavior in autonomous agents playing the infinitely repeated prisoner's dilemma game. [D] . Shebalin, Paul Valentine. 1997

机译：智能软件代理之间的集体学习与合作：研究在玩无限次囚徒困境游戏中的自治代理中人为的人格和行为。
6. Cooperation in rats playing the iterated Prisoner’s Dilemma game [O] . Ruth I. Wood, Jessica Y. Kim, Grace R. Li -1

机译：在反复囚徒困境游戏中的老鼠合作
7. Investigation into the effect of social learning in reinforcement learning board game playing agents [O] . Marivate Vukosi Ntsakisi 2009

机译：社会学习在强化学习型棋盘游戏代理商中的作用调查
8. Cooperation and Coordination Between Fuzzy Reinforcement Learning Agents in Continuous State Partially Observable Markov Decision Processes [R] . Berenji, Hamid R., Vengerov, David 1999

机译：连续状态部分可观测马尔可夫决策过程中模糊强化学习agent的协作与协调

The resilience of cooperation in a Dilemma game played by reinforcement learning agents

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅