Allocating training instances to learning agents for team formation

Liemhetcharat Somchaya; Veloso Manuela

首页> 外文期刊>Autonomous agents and multi-agent systems >Allocating training instances to learning agents for team formation

【24h】

Allocating training instances to learning agents for team formation

机译：将培训实例分配给学习代理以组建团队

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Agents can learn to improve their coordination with their teammates and increase team performance. There are finite training instances, where each training instance is an opportunity for the learning agents to improve their coordination. In this article, we focus on allocating training instances to learning agent pairs, i.e., pairs that improve coordination with each other, with the goal of team formation. Agents learn at different rates, and hence, the allocation of training instances affects the performance of the team formed. We build upon previous work on the Synergy Graph model, that is learned completely from data and represents agents' capabilities and compatibility in a multi-agent team. We formally define the learning agents team formation problem, and compare it with the multi-armed bandit problem. We consider learning agent pairs that improve linearly and geometrically, i.e., the marginal improvement decreases by a constant factor. We contribute algorithms that allocate the training instances, and compare against algorithms from the multi-armed bandit problem. In our simulations, we demonstrate that our algorithms perform similarly to the bandit algorithms in the linear case, and outperform them in the geometric case. Further, we apply our model and algorithms to a multi-agent foraging problem, thus demonstrating the efficacy of our algorithms in general multi-agent problems.

机译：座席可以学习改善与队友的协作并提高团队绩效。有有限的训练实例，其中每个训练实例都是学习代理改善其协调能力的机会。在本文中，我们专注于将训练实例分配给学习代理对，即以团队形成为目标的可增进彼此协调的学习对。代理以不同的速度学习，因此，培训实例的分配会影响组建团队的绩效。我们以之前在Synergy Graph模型上的工作为基础，该模型是完全从数据中学习的，代表了多Agent团队中Agent的能力和兼容性。我们正式定义了学习代理团队的组成问题，并将其与多武装匪徒问题进行了比较。我们考虑学习代理对在线性和几何上都有改进，即边际改进减少了一个常数。我们提供分配训练实例的算法，并与多臂匪徒问题中的算法进行比较。在我们的仿真中，我们证明了我们的算法在线性情况下的性能与强盗算法相似，并且在几何情况下的性能优于后者。此外，我们将模型和算法应用于多主体搜寻问题，从而证明了我们的算法在一般多主体问题中的功效。

著录项

来源
《Autonomous agents and multi-agent systems》 |2017年第4期|905-940|共36页
作者
Liemhetcharat Somchaya; Veloso Manuela;
展开▼
作者单位

ASTAR, Inst Infocomm Res, Singapore, Singapore|Uber Adv Technol Ctr, Pittsburgh, PA 15201 USA;

Carnegie Mellon Univ, Dept Comp Sci, Pittsburgh, PA 15213 USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Learning agent; Ad hoc agent; Multi-agent; Team formation;

机译：学习代理;特设代理;多代理;团队形成;

相似文献

外文文献
中文文献
专利

1. Team Training, Team Learning, Leadership and Psychology Safety: A Study of Team Training and Team Learning Behavior during a Swedish Military Staff Exercise [J] . Erik Hedlund, Johan ?sterberg Sociology Mind . 2013,第1期

机译：团队训练，团队学习，领导力和心理安全：瑞典军事参谋演习期间的团队训练和团队学习行为研究
2. Allocating structured tasks in heterogeneous agent teams [J] . Basegio Tulio L., Bordini Rafael H. Computational Intelligence . 2019,第1期

机译：在异构代理团队中分配结构化任务
3. Situated learning with co-operative agent simulations in team training [J] . M.W. Dobsona, M. Pengelly, J.-A. Sime Computers in Human Behavior . 2001,第5a6期

机译：在团队培训中通过协作式座席模拟进行情境学习
4. Goal Directed Design of Rewards and Training Features for Self-learning Agents in a Human-Autonomy-Teaming Environment [C] . Simon Schwerd, Sebastian Lindner, Axel Schulte International Conference on Intelligent Human Systems Integration . 2020

机译：目标针对人类自主组织环境中自学代理的奖励和培训特征的指导设计
5. Using cognitively inspired agents and information supply chains to anticipate and share information for decision making teams. [D] . Sun, Shuang. 2006

机译：使用受认知启发的代理和信息供应链来为决策团队预测和共享信息。
6. Discriminative Structure Learning of Bayesian Network Classifiers from Training Dataset and Testing Instance [O] . Limin Wang, Yang Liu, Musa Mammadov, 2019

机译：培训数据集和测试实例贝叶斯网络分类器的鉴别结构学习
7. Team Training, Team Learning, Leadership and Psychology Safety : A Study of Team Training and Team Learning Behavior during a Swedish Military Staff Exercise [O] . Hedlund, Erik, Österberg, Johan 2013

机译：团队训练，团队学习，领导力和心理安全：瑞典军事参谋演习期间的团队训练和团队学习行为研究

Allocating training instances to learning agents for team formation

摘要

著录项

相似文献

相关主题

期刊订阅