首页> 外文期刊>JMLR: Workshop and Conference Proceedings >Sample-Efficient Imitation Learning via Generative Adversarial Nets
【24h】

Sample-Efficient Imitation Learning via Generative Adversarial Nets

机译:通过生成对抗网络进行样本有效的模仿学习

获取原文
       

摘要

GAIL is a recent successful imitation learning architecture that exploits the adversarial training procedure introduced in GANs. Albeit successful at generating behaviours similar to those demonstrated to the agent, GAIL suffers from a high sample complexity in the number of interactions it has to carry out in the environment in order to achieve satisfactory performance. We dramatically shrink the amount of interactions with the environment necessary to learn well-behaved imitation policies, by up to several orders of magnitude. Our framework, operating in the model-free regime, exhibits a significant increase in sample-efficiency over previous methods by simultaneously a) learning a self-tuned adversarially-trained surrogate reward and b) leveraging an off-policy actor-critic architecture. We show that our approach is simple to implement and that the learned agents remain remarkably stable, as shown in our experiments that span a variety of continuous control tasks. Video visualisations available at: url{https://youtu.be/-nCsqUJnRKU}.
机译:GAIL是最近成功的模仿学习体系结构,它利用了GAN中引入的对抗训练程序。尽管可以成功地生成与代理所演示的行为类似的行为,但GAIL在环境中要获得令人满意的性能而必须进行的交互次数却具有较高的样本复杂性。我们将与行为良好的模仿策略学习所必需的环境交互作用的数量大大减少了几个数量级。我们的框架在无模型的体制下运作,通过同时(a)学习经过自我调整的对抗训练的替代奖励,以及(b)利用非政策性参与者批评体系,在样本效率方面比以前的方法有了显着提高。我们证明了我们的方法易于实现,并且学习到的主体仍然非常稳定,如我们的实验所显示的,该实验涵盖了各种连续控制任务。视频可视化效果位于: url {https://youtu.be/-nCsqUJnRKU}。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号