【24h】

Online Multiagent Learning against Memory Bounded Adversaries

机译:针对记忆障碍对手的在线多主体学习

获取原文

摘要

The traditional agenda in Multiagent Learning (MAL) has been to develop learners that guarantee convergence to an equilibrium in self-play or that converge to playing the best response against an opponent using one of a fixed set of known targeted strategies. This paper introduces an algorithm called Learn or Exploit for Adversary Induced Markov Decision Process (LoE-AIM) that targets optimality against any learning opponent that can be treated as a memory bounded adversary. LoE-AIM makes no prior assumptions about the opponent and is tailored to optimally exploit any adversary which induces a Markov decision process in the state space of joint histories. LoE-AIM either explores and gathers new information about the opponent or converges to the best response to the partially learned opponent strategy in repeated play. We further extend LoE-AIM to account for online repeated interactions against the same adversary with plays against other adversaries interleaved in between. LoE-AIM-repeated stores learned knowledge about an adversary, identifies the adversary in case of repeated interaction, and reuses the stored knowledge about the behavior of the adversary to enhance learning in the current epoch of play. LoE-AIM and LoE-AIM-repeated are fully implemented, with results demonstrating their superiority over other existing MAL algorithms.
机译:Multiagent学习(MAL)的传统议程是开发学习者,这些学习者可以使用一组固定的已知有针对性的策略中的一种,确保自己在自我比赛中趋于平衡,或者收敛为对对手做出最佳反应。本文介绍了一种称为“学习或利用攻击者诱导马尔可夫决策过程(LoE-AIM)”的算法,该算法将最优性针对任何可被视为记忆受限对手的学习对手。 LoE-AIM没有对对手做出任何先验假设,并且经过专门设计以最佳利用任何在联合历史状态空间中引发马尔可夫决策过程的对手。 LoE-AIM要么探索并收集有关对手的新信息,要么在重复比赛中收敛到对部分学习的对手策略的最佳响应。我们进一步扩展了LoE-AIM,以说明针对同一对手的在线重复互动,以及针对其间插入的其他对手的游戏。 LoE-AIM重复存储了有关对手的学习知识,在反复交互的情况下识别了对手,并重复使用了存储的有关对手行为的知识,以增强当前游戏时代的学习效果。完全执行了LoE-AIM和LoE-AIM重复,其结果证明了它们相对于其他现有MAL算法的优越性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号