Learning against opponents with bounded memory

机译：与记忆力有限的对手学习

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Recently, a number of authors have proposed criteria for evaluating learning algorithms in multi-agent systems. While well-justified, each of these has generally given little attention to one of the main challenges of a multi-agent setting: the capability of the other agents to adapt and learn as well. We propose extending existing criteria to apply to a class of adaptive opponents with bounded memory. We then show an algorithm that prov-ably achieves an e-best response against this richer class of opponents while simultaneously guaranteeing a minimum payoff against any opponent and performing well in self-play. This new algorithm also demonstrates strong performance in empirical tests against a variety of opponents in a wide range of environments.

机译：最近，许多作者提出了评估多智能体系统中学习算法的标准。尽管有充分的理由，但通常每个人都很少注意多主体设置的主要挑战之一：其他主体也具有适应和学习的能力。我们建议扩展现有标准，以适用于具有有限记忆的一类适应性对手。然后，我们展示了一种算法，该算法可有效地针对这种较丰富的对手类别实现电子最佳响应，同时保证对任何对手的最低回报并在自打中表现良好。这种新算法还展示了在广泛环境中针对各种对手的经验测试中的强大性能。

著录项

来源
《International Joint Conference on Artifical Intelligence(IJCAI-05); 20050730-0805; Edinburgh(GB) 》|2005年|P.817-822|共6页
会议地点 Edinburgh(GB)
作者
Rob Powers; Yoav Shoham;
展开▼
作者单位

Computer Science Department Stanford University Stanford, CA 94305;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类人工智能理论 ;
关键词

相似文献

外文文献
中文文献
专利

1. Fast Learning Requires Good Memory: A Time-Space Lower Bound for Parity Learning [J] . Raz Ran Journal of the Association for Computing Machinery . 2019 ,第1期

机译：快速学习需要良好的记忆力：奇偶学习的时空下界
2. Fast Learning Requires Good Memory: A Time-Space Lower Bound for Parity Learning [J] . Ran Raz Electronic Colloquium on Computational Complexity . 2016 ,第8期

机译：快速学习需要良好的记忆力：奇偶学习的时空下界
3. Learning about the opponent in automated bilateral negotiation: a comprehensive survey of opponent modeling techniques [J] . Baarslag Tim, Hendrikx Mark J. C., Hindriks Koen V., Autonomous agents and multi-agent systems . 2016 ,第5期

机译：在自动双边谈判中了解对手：对对手建模技术的全面调查
4. Learning against opponents with bounded memory [C] . Rob Powers, Yoav Shoham International Joint Conference on Artificial Intelligence . 2007

机译：用有界记忆学习对手
5. Reinforcement learning in stochastic games against bounded memory opponents. [D] . Vrljicak, Tomislav. 2006

机译：针对随机记忆对手的随机游戏中的强化学习。
6. Opponent processes in visual memories: A model of attraction and repulsion in navigating insects’ mushroom bodies [O] . Florent Le Möel, Antoine Wystrach 2020

机译：视觉记忆中的对立过程：在昆虫的蘑菇体中吸引和排斥的模型
7. Learning about the opponent in automated bilateral negotiation: a comprehensive survey of opponent modeling techniques [O] . Baarslag, Tim, Hendrikx, Mark J.C., Hindriks, Koen V., 2016

机译：在自动双边谈判中了解对手：对对手建模技术的全面调查

Learning against opponents with bounded memory

摘要

著录项

相似文献

相关主题

期刊订阅