【24h】

Learning against opponents with bounded memory

机译:与记忆力有限的对手学习

获取原文
获取原文并翻译 | 示例

摘要

Recently, a number of authors have proposed criteria for evaluating learning algorithms in multi-agent systems. While well-justified, each of these has generally given little attention to one of the main challenges of a multi-agent setting: the capability of the other agents to adapt and learn as well. We propose extending existing criteria to apply to a class of adaptive opponents with bounded memory. We then show an algorithm that prov-ably achieves an e-best response against this richer class of opponents while simultaneously guaranteeing a minimum payoff against any opponent and performing well in self-play. This new algorithm also demonstrates strong performance in empirical tests against a variety of opponents in a wide range of environments.
机译:最近,许多作者提出了评估多智能体系统中学习算法的标准。尽管有充分的理由,但通常每个人都很少注意多主体设置的主要挑战之一:其他主体也具有适应和学习的能力。我们建议扩展现有标准,以适用于具有有限记忆的一类适应性对手。然后,我们展示了一种算法,该算法可有效地针对这种较丰富的对手类别实现电子最佳响应,同时保证对任何对手的最低回报并在自打中表现良好。这种新算法还展示了在广泛环境中针对各种对手的经验测试中的强大性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号