Oscillatory evolution of collective behavior in evolutionary games played with reinforcement learning

首页> 外文期刊>Nonlinear dynamics >Oscillatory evolution of collective behavior in evolutionary games played with reinforcement learning

【24h】

Oscillatory evolution of collective behavior in evolutionary games played with reinforcement learning

机译：加固学习中进化游戏中集体行为的振动演变

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Large-scale cooperation underpins the evolution of ecosystems and the human society, and the collective behaviors by self-organization of multi-agent systems are the key for understanding. As artificial intelligence (AI) prevails in almost all branches of science, it would be of great interest to see what new insights of collective behaviors could be obtained from a multi-agent AI system. Here, we introduce a typical reinforcement learning (RL) algorithm-Q-learning into evolutionary game dynamics, where agents pursue optimal action on the basis of the introspectiveness rather than the outward manner such as the birth-death or imitation processes in the traditional evolutionary game (EG). We investigate the cooperation prevalence numerically for a general 2x2 game setting. We find that the cooperation prevalence in the multi-agent AI is unexpectedly of equal level as in the traditional EG in most cases. However, in the snowdrift games with RL, we reveal that explosive cooperation appears in the form of periodic oscillation, and we study the impact of the payoff structure on its emergence. Finally, we show that the periodic oscillation can also be observed in some other EGs with the RL algorithm, such as the rock-paper-scissors game. Our results offer a reference point to understand the emergence of cooperation and oscillatory behaviors in nature and society from AI's perspective.

机译：大规模合作支持生态系统和人类社会的演变，以及通过组织多助手系统的集体行为是理解的关键。随着人工智能（AI）几乎所有的科学分支机构，都会有望看出可以从多代理AI系统获得集体行为的新见解。在这里，我们将典型的强化学习（RL）算法 - Q学习进入进化游戏动态，其中代理在内省的基础上追求最佳动作，而不是传统进化中的出生死亡或仿制过程游戏（例如）。我们对一般的2x2游戏设置进行了数控进行了普遍存在的普遍性。我们发现，在大多数情况下，多代理AI中的合作普遍性在于传统的平等水平。然而，在与RL的雪雨游戏中，我们揭示了爆炸性合作以周期性振荡的形式出现，我们研究了收益结构对其出现的影响。最后，我们表明，也可以用R1算法在一些其他EGS中观察周期性振荡，例如岩纸剪刀游戏。我们的结果提供了一个参考点，了解自然和社会的合作和振荡行为的出现。

著录项

来源
《Nonlinear dynamics》 |2020年第4期|共12页
作者

展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类动力学;
关键词
Self-organization; Artificial intelligence; Evolutionary games; Reinforcement learning; Collective behaviors; Oscillation; Explosive events;

机译：自我组织;人工智能;进化游戏;加强学习;集体行为;振荡;爆炸事件;

相似文献

外文文献
中文文献
专利

1. Oscillatory evolution of collective behavior in evolutionary games played with reinforcement learning [J] . Nonlinear dynamics . 2020,第4期

机译：加固学习中进化游戏中集体行为的振动演变
2. Evolution of cooperation in the snowdrift game among mobile players with random-pairing and reinforcement learning [J] . Jia N., Ma S. Physica, A. Statistical mechanics and its applications . 2013,第22期

机译：具有随机配对和强化学习的移动玩家之间在雪堆游戏中合作的演变
3. Game Theory-Based Control System Algorithms with Real-Time Reinforcement Learning: How to Solve Multiplayer Games Online [J] . Kyriakos G. Vamvoudakis, Hamidreza Modares, Bahare Kiumarsi, Control Systems, IEEE . 2017,第1期

机译：实时强化学习的基于博弈论的控制系统算法：如何在线解决多人游戏
4. Behavior learning and evolution of collective autonomous mobile robots based on reinforcement learning and distributed genetic algorithms [C] . Hyo-Byung Jun, Kwee-Bo Sim Robot and Human Communication, 1997. RO-MAN '97. Proceedings., 6th IEEE International Workshop on . 1997

机译：基于强化学习和分布式遗传算法的集体自主移动机器人行为学习与演化
5. Collective learning and cooperation between intelligent software agents: A study of artificial personality and behavior in autonomous agents playing the infinitely repeated prisoner's dilemma game. [D] . Shebalin, Paul Valentine. 1997

机译：智能软件代理之间的集体学习与合作：研究在玩无限次囚徒困境游戏中的自治代理中人为的人格和行为。
6. Corrigendum: Language Learning Enhanced by Massive Multiple Online Role-Playing Games (MMORPGs) and the Underlying Behavioral and Neural Mechanisms [O] . Yongjun Zhang, Hongwen Song, Xiaoming Liu, 2019

机译：更正：大规模的多个在线角色扮演游戏（MMORPG）增强的语言学习以及潜在的行为和神经机制
7. Generating Behavior-Diverse Game AIs with Evolutionary Multi-Objective Deep Reinforcement Learning [O] . Ruimin Shen, Yan Zheng, Jianye Hao, 2020

机译：具有进化多目标深度加强学习的生成行为多样化的游戏AIS
8. Predicting Pilot Behavior in Medium Scale Scenarios Using Game Theory and Reinforcement Learning. [R] . Yildiz, Y., Agogino, A., Brat, G. 2013

机译：利用博弈论和强化学习预测中等规模情景中的飞行员行为。

Oscillatory evolution of collective behavior in evolutionary games played with reinforcement learning

摘要

著录项

相似文献

相关主题

期刊订阅