【24h】

Cost-Based Policy Mapping for Imitation

机译:基于成本的模仿策略映射

获取原文
获取原文并翻译 | 示例

摘要

Imitation represents a powerful approach for programming and autonomous learning in robot and computer systems. An important aspect of imitation is the mapping of observations to an executable control strategy. This is particularly important if the behavioral capabilities of the observed and imitating agent differ significantly. This paper presents an approach that addresses this problem by locally optimizing a cost function representing the deviation from the observed state sequence and the cost of the actions required to perform the imitation. The result are imitation strategies that can be performed by the imitating agent and that as closely as possible resemble the observations of the demonstrating agent. The performance of this approach is illustrated within the context of a simulated multi-agent environment.
机译:模仿代表了在机器人和计算机系统中进行编程和自主学习的强大方法。模仿的一个重要方面是将观察结果映射到可执行控制策略。如果所观察到的和模仿剂的行为能力明显不同,这一点尤其重要。本文提出了一种通过局部优化成本函数来解决此问题的方法,该成本函数表示与观察到的状态序列的偏差以及执行模仿所需的操作成本。结果是可以由模仿剂执行的模仿策略,并且该模仿策略尽可能类似于对展示剂的观察。在模拟的多主体环境中说明了这种方法的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号