首页> 外文会议>International Joint Conference on Neural Networks >Augmented Behavioral Cloning from Observation
【24h】

Augmented Behavioral Cloning from Observation

机译:通过观察增强行为克隆

获取原文

摘要

Imitation from observation is a computational technique that teaches an agent on how to mimic the behavior of an expert by observing only the sequence of states from the expert demonstrations. Recent approaches learn the inverse dynamics of the environment and an imitation policy by interleaving epochs of both models while changing the demonstration data. However, such approaches often get stuck into sub-optimal solutions that are distant from the expert, limiting their imitation effectiveness. We address this problem with a novel approach that overcomes the problem of reaching bad local minima by exploring: (i) a self-attention mechanism that better captures global features of the states; and (ii) a sampling strategy that regulates the observations that are used for learning. We show empirically that our approach outperforms the state-of-the-art approaches in four different environments by a large margin.
机译:观察模仿是一种计算技术,可通过仅观察专家演示中的状态序列来教给代理人如何模仿专家的行为。最近的方法是通过在改变演示数据的同时交织两个模型的历元来学习环境的逆向动力学和模仿策略。但是,这些方法经常陷入与专家相距甚远的次优解决方案中,从而限制了其模仿效果。我们通过一种新颖的方法解决了这个问题,该方法通过探索以下方面克服了达到最低的局部最小值的问题:(i)一种能够更好地捕捉国家全局特征的自我关注机制; (ii)规范用于学习的观察的抽样策略。我们凭经验表明,在四个不同的环境中,我们的方法明显优于最新方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号