首页> 外文期刊>Nonlinear dynamics >Oscillatory evolution of collective behavior in evolutionary games played with reinforcement learning
【24h】

Oscillatory evolution of collective behavior in evolutionary games played with reinforcement learning

机译:加固学习中进化游戏中集体行为的振动演变

获取原文
获取原文并翻译 | 示例
           

摘要

Large-scale cooperation underpins the evolution of ecosystems and the human society, and the collective behaviors by self-organization of multi-agent systems are the key for understanding. As artificial intelligence (AI) prevails in almost all branches of science, it would be of great interest to see what new insights of collective behaviors could be obtained from a multi-agent AI system. Here, we introduce a typical reinforcement learning (RL) algorithm-Q-learning into evolutionary game dynamics, where agents pursue optimal action on the basis of the introspectiveness rather than the outward manner such as the birth-death or imitation processes in the traditional evolutionary game (EG). We investigate the cooperation prevalence numerically for a general 2x2 game setting. We find that the cooperation prevalence in the multi-agent AI is unexpectedly of equal level as in the traditional EG in most cases. However, in the snowdrift games with RL, we reveal that explosive cooperation appears in the form of periodic oscillation, and we study the impact of the payoff structure on its emergence. Finally, we show that the periodic oscillation can also be observed in some other EGs with the RL algorithm, such as the rock-paper-scissors game. Our results offer a reference point to understand the emergence of cooperation and oscillatory behaviors in nature and society from AI's perspective.
机译:大规模合作支持生态系统和人类社会的演变,以及通过组织多助手系统的集体行为是理解的关键。随着人工智能(AI)几乎所有的科学分支机构,都会有望看出可以从多代理AI系统获得集体行为的新见解。在这里,我们将典型的强化学习(RL)算法 - Q学习进入进化游戏动态,其中代理在内省的基础上追求最佳动作,而不是传统进化中的出生死亡或仿制过程游戏(例如)。我们对一般的2x2游戏设置进行了数控进行了普遍存在的普遍性。我们发现,在大多数情况下,多代理AI中的合作普遍性在于传统的平等水平。然而,在与RL的雪雨游戏中,我们揭示了爆炸性合作以周期性振荡的形式出现,我们研究了收益结构对其出现的影响。最后,我们表明,也可以用R1算法在一些其他EGS中观察周期性振荡,例如岩纸剪刀游戏。我们的结果提供了一个参考点,了解自然和社会的合作和振荡行为的出现。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号