【24h】

Adversarial behavioral cloning*

机译:对抗性行为克隆*

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

Imitation learning has been widely applied for autonomous robotics control. A popular IL approach is apprenticeship learning (AL) which alternates RL and inverse reinforcement learning (IRL). AL fundamentally requires a large number of environment interactions and thus takes a long time for training. We believe that IL algorithms would be more applicable to real-world problems if the number of interactions could be reduced as close to zero as possible. In this paper, we propose an IL algorithm which we call Adversarial Behavioral Cloning (ABC). Experimental results on MuJoCo physics simulator show that our algorithm achieves competitive results with a state-of-the-art AL algorithm, namely generative adversarial imitation learning (GAIL), even without any environment interactions.
机译:模仿学习已广泛应用于自主机器人控制。一种流行的 IL 方法是学徒学习 (AL),它交替使用 RL 和逆强化学习 (IRL)。从根本上说,AL需要大量的环境交互,因此需要很长时间进行训练。我们相信,如果交互次数可以尽可能接近于零,IL算法将更适用于现实世界的问题。在本文中,我们提出了一种IL算法,我们称之为对抗性行为克隆(ABC)。在MuJoCo物理模拟器上的实验结果表明,即使没有任何环境交互,我们的算法也能与最先进的AL算法,即生成对抗模仿学习(GAIL)一起取得竞争结果。

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号