首页> 外文会议>Robot World Cup Soccer and Rescue Competitions and Conference >Learning the Sequential Coordinated Behavior of Teams from Observations
【24h】

Learning the Sequential Coordinated Behavior of Teams from Observations

机译:从观察中学习团队的顺序协调行为

获取原文

摘要

The area of agent modeling deals with the task of observing other agents and modeling their behavior, in order to predict their future behavior, coordinate with them, assist them, or counter their actions. Typically, agent modeling techniques assume the availability of a plan- or behavior-library, which encodes the full repertoire of expected observed behavior. However, recent applications areas of agent modeling raise challenges to the assumption of such a library, as agent modeling systems are increasingly used in open and/or adversarial settings, where the behavioral repertoire of the observed agents is unknown at design time. This paper focuses on the challenge of the unsupervised autonomous learning of the sequential behaviors of agents, from observations of their behavior. The techniques we present translate observations of the dynamic, complex, continuous multi-variate world state into a time-series of recognized atomic behaviors. This time-series is then analyzed to find repeating subsequences characterizing each team. We compare two alternative approaches to extracting such characteristic sequences, based on frequency counts and statistical dependencies. Our results indicate that both techniques are able to extract meaningful sequences, and do significantly better than random predictions. However, the statistical dependency approach is able to correctly reject sequences that are frequent, but are due to random co-occurrence of behaviors, rather than to a true sequential dependency between them.
机译:代理面积建模涉及观察其他代理和建模行为的任务,以预测其未来的行为,与他们协调,协助他们或抵制他们的行为。通常,代理建模技术假设计划或行为库的可用性,该库或行为库编码预期观察行为的完整reptoIre。然而,由于代理建模系统越来越多地用于打开和/或侵犯的环境,所以在设计时,观测到的药剂的行为曲目越来越多地用于对这种文库的假设提出挑战。本文侧重于无监督自主学习代理人的无监督自主学习的挑战,从他们的行为观察。我们将对动态,复杂,连续多变化世界状态的观察转化为一系列公认的原子行为的观察。然后分析此时间序列以查找特征每个团队的重复子程。我们比较基于频率计数和统计依赖性提取这种特征序列的两种替代方法。我们的结果表明,两种技术都能够提取有意义的序列,并且比随机预测显着更好。然而,统计依赖方法能够正确地拒绝频繁的序列,而是由于行为随机的共同发生,而不是它们之间的真实顺序依赖性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号