Sample-Efficient Imitation Learning via Generative Adversarial Nets

Lionel Blondé; Alexandros Kalousis

首页> 外文期刊>JMLR: Workshop and Conference Proceedings >Sample-Efficient Imitation Learning via Generative Adversarial Nets

【24h】

Sample-Efficient Imitation Learning via Generative Adversarial Nets

机译：通过生成对抗网络进行样本有效的模仿学习

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

GAIL is a recent successful imitation learning architecture that exploits the adversarial training procedure introduced in GANs. Albeit successful at generating behaviours similar to those demonstrated to the agent, GAIL suffers from a high sample complexity in the number of interactions it has to carry out in the environment in order to achieve satisfactory performance. We dramatically shrink the amount of interactions with the environment necessary to learn well-behaved imitation policies, by up to several orders of magnitude. Our framework, operating in the model-free regime, exhibits a significant increase in sample-efficiency over previous methods by simultaneously a) learning a self-tuned adversarially-trained surrogate reward and b) leveraging an off-policy actor-critic architecture. We show that our approach is simple to implement and that the learned agents remain remarkably stable, as shown in our experiments that span a variety of continuous control tasks. Video visualisations available at: url{https://youtu.be/-nCsqUJnRKU}.

机译：GAIL是最近成功的模仿学习体系结构，它利用了GAN中引入的对抗训练程序。尽管可以成功地生成与代理所演示的行为类似的行为，但GAIL在环境中要获得令人满意的性能而必须进行的交互次数却具有较高的样本复杂性。我们将与行为良好的模仿策略学习所必需的环境交互作用的数量大大减少了几个数量级。我们的框架在无模型的体制下运作，通过同时（a）学习经过自我调整的对抗训练的替代奖励，以及（b）利用非政策性参与者批评体系，在样本效率方面比以前的方法有了显着提高。我们证明了我们的方法易于实现，并且学习到的主体仍然非常稳定，如我们的实验所显示的，该实验涵盖了各种连续控制任务。视频可视化效果位于： url {https://youtu.be/-nCsqUJnRKU}。

著录项

来源
《JMLR: Workshop and Conference Proceedings》 |2018年第2009期|共11页
作者
Lionel Blondé; Alexandros Kalousis;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类人工智能理论;
关键词

相似文献

外文文献
中文文献
专利

1. Sample-Efficient Imitation Learning via Generative Adversarial Nets [J] . Lionel Blondé, Alexandros Kalousis JMLR: Workshop and Conference Proceedings . 2018,第2010期

机译：通过生成对抗网络进行样本有效的模仿学习
2. Saliency Prediction on Omnidirectional Image With Generative Adversarial Imitation Learning [J] . Mai Xu, Li Yang, Xiaoming Tao, IEEE Transactions on Image Processing . 2021,第1期

机译：具有生成对抗性模仿学习的全向图像的显着性预测
3. TrajGAIL: Generating urban vehicle trajectories using generative adversarial imitation learning [J] . Choi Seongjin, Kim Jiwon, Yeo Hwasoo Transportation research . 2021,第Jula期

机译：Trajgail：使用生成的对抗性模仿学习产生城市车辆轨迹
4. Generative Adversarial Imitation Learning with Neural Network Parameterization: Global Optimality and Convergence Rate [C] . Yufeng Zhang, Qi Cai, Zhuoran Yang, International Conference on Machine Learning . 2021

机译：具有神经网络参数化的生成对抗性模仿学习：全球最优性和收敛速度
5. Stacked Generative Adversarial Networks for Learning Additional Features of Image Segmentation Maps [D] . Burke, Matthew. 2020

机译：用于学习图像分割图的其他特征的堆叠生成的对抗网络
6. Generative Adversarial Phonology: Modeling Unsupervised Phonetic and Phonological Learning With Neural Networks [O] . Gašper Beguš 2020

机译：生成对抗语音学：用神经网络建模无监督的语音和语音学习
7. Joint Entity and Event Extraction with Generative Adversarial Imitation Learning [O] . Tongtao Zhang, Heng Ji, Avirup Sil 2019

机译：具有生成对抗性模仿学习的联合实体和事件提取

Sample-Efficient Imitation Learning via Generative Adversarial Nets

摘要

著录项

相似文献

相关主题

期刊订阅