Learning Complementary Multiagent Behaviors: A Case Study

机译：学习互补的多层行为：一个案例研究

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

As machine learning is applied to increasingly complex tasks, it is likely that the diverse challenges encountered can only be addressed by combining the strengths of different learning algorithms. We examine this aspect of learning through a case study grounded in the robot soccer context. The task we consider is Keepaway, a popular benchmark for multiagent reinforcement learning from the simulation soccer domain. Whereas previous successful results in Keepaway have limited learning to an isolated, infrequent decision that amounts to a turn-taking behavior (passing), we expand the agents' learning capability to include a much more ubiquitous action (moving without the ball, or getting open), such that at any given time, multiple agents are executing learned behaviors simultaneously. We introduce a policy search method for learning "GETOPEN" to complement the temporal difference learning approach employed for learning "PASS". Empirical results indicate that the learned GETOPEN policy matches the best hand-coded policy for this task, and outperforms the best policy found when PASS is learned. We demonstrate that PASS and GETOPEN can be learned simultaneously to realize tightly-coupled soccer team behavior.

机译：由于机器学习应用于越来越复杂的任务，因此只能通过组合不同学习算法的优点来解决各种挑战。我们通过在机器人足球背景下的案例研究来检查学习的这一方面。我们考虑的任务是Leepaway，这是一种从模拟足球域中获得多层强化学习的流行基准。虽然以前的成功结果在Leepaway中有限地学习了一个孤立的，但少量的决定，这相当于转动行为（通过），我们将代理商的学习能力扩展到包括更无处不在的行动（在没有球的情况下移动，或者开放），使得在任何给定的时间，多个代理正在同时执行学习行为。我们介绍了一种学习“GetoPen”的政策搜索方法，以补充学习“通过”的时间差异学习方法。经验结果表明，学习的GetoPen策略与此任务的最佳手工编码策略匹配，并且优越地赢得了通过何时发现的最佳政策。我们证明了通过和葛靠，可以同时学习，以实现紧密耦合的足球队行为。

著录项

来源
《RoboCup International Symposium》|2010年||共13页
会议地点
作者
Shivaram Kalyanakrishnan; Peter Stone;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词

相似文献

外文文献
中文文献
专利

1. Applying the Policy Gradient Method to Behavior Learning in Multiagent Systems: The Pursuit Problem [J] . Seiji Ishihara, Harukazu Igarashi Systems and Computers in Japan . 2006,第10期

机译：策略梯度法在多主体系统行为学习中的应用：追求问题
2. Multiagent Reinforcement Social Learning toward Coordination in Cooperative Multiagent Systems [J] . JIANYE HAO, HO-FUNG LEUNG, ZHONG MING ACM transactions on autonomous and adaptive systems . 2015,第4期

机译：协作式多智能体系统中的多智能体增强社会学习以促进协调
3. Multiagent Learning of Coordination in Loosely Coupled Multiagent Systems [J] . Yu Chao, Zhang Minjie, Ren Fenghui, Cybernetics, IEEE Transactions on . 2015,第12期

机译：松耦合多智能体系统中协调的多智能体学习
4. Learning Complementary Multiagent Behaviors: A Case Study [C] . Shivaram Kalyanakrishnan, Peter Stone RoboCup International Symposium . 2010

机译：学习互补的多层行为：一个案例研究
5. Explaining Collective Behavior with Dynamical Systems: Spatial Gradient Sensing in Eukaryotic Chemotaxis and Learning Dynamics in Multiagent Reinforcement Learning [D] . Shams, Daniel . 2019

机译：用动力系统解释集体行为：多核化趋化性的空间梯度传感和多核强化学习中的学习动态
6. Hopf Bifurcations in Complex Multiagent Activity: The Signature of Discrete to Rhythmic Behavioral Transitions [O] . Gaurav Patil, Patrick Nalepka, Rachel W. Kallen, 2020

机译：复杂多元活动中的Hopf分叉：离散到节奏行为过渡的签名
7. Learning complementary multiagent behaviors: A case study [O] . Shivaram Kalyanakrishnan, Peter Stone 2009

机译：学习互补的多智能体行为：一个案例研究
8. Studies in Abstraction Learning, Transfer Learning, Instructional Problems of Teaching, and Behavioral Processes [R] . Thune, L. E. 1964

机译：抽象学习，转移学习，教学教学问题和行为过程研究

Learning Complementary Multiagent Behaviors: A Case Study

摘要

著录项

相似文献

相关主题

期刊订阅