...
首页> 外文期刊>Autonomous agents and multi-agent systems >Allocating training instances to learning agents for team formation
【24h】

Allocating training instances to learning agents for team formation

机译:将培训实例分配给学习代理以组建团队

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Agents can learn to improve their coordination with their teammates and increase team performance. There are finite training instances, where each training instance is an opportunity for the learning agents to improve their coordination. In this article, we focus on allocating training instances to learning agent pairs, i.e., pairs that improve coordination with each other, with the goal of team formation. Agents learn at different rates, and hence, the allocation of training instances affects the performance of the team formed. We build upon previous work on the Synergy Graph model, that is learned completely from data and represents agents' capabilities and compatibility in a multi-agent team. We formally define the learning agents team formation problem, and compare it with the multi-armed bandit problem. We consider learning agent pairs that improve linearly and geometrically, i.e., the marginal improvement decreases by a constant factor. We contribute algorithms that allocate the training instances, and compare against algorithms from the multi-armed bandit problem. In our simulations, we demonstrate that our algorithms perform similarly to the bandit algorithms in the linear case, and outperform them in the geometric case. Further, we apply our model and algorithms to a multi-agent foraging problem, thus demonstrating the efficacy of our algorithms in general multi-agent problems.
机译:座席可以学习改善与队友的协作并提高团队绩效。有有限的训练实例,其中每个训练实例都是学习代理改善其协调能力的机会。在本文中,我们专注于将训练实例分配给学习代理对,即以团队形成为目标的可增进彼此协调的学习对。代理以不同的速度学习,因此,培训实例的分配会影响组建团队的绩效。我们以之前在Synergy Graph模型上的工作为基础,该模型是完全从数据中学习的,代表了多Agent团队中Agent的能力和兼容性。我们正式定义了学习代理团队的组成问题,并将其与多武装匪徒问题进行了比较。我们考虑学习代理对在线性和几何上都有改进,即边际改进减少了一个常数。我们提供分配训练实例的算法,并与多臂匪徒问题中的算法进行比较。在我们的仿真中,我们证明了我们的算法在线性情况下的性能与强盗算法相似,并且在几何情况下的性能优于后者。此外,我们将模型和算法应用于多主体搜寻问题,从而证明了我们的算法在一般多主体问题中的功效。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号