Risk-Sensitive Generative Adversarial Imitation Learning

Jonathan Lacotte; Mohammad Ghavamzadeh; Yinlam Chow; Marco Pavone

首页> 外文期刊>JMLR: Workshop and Conference Proceedings >Risk-Sensitive Generative Adversarial Imitation Learning

【24h】

Risk-Sensitive Generative Adversarial Imitation Learning

机译：风险敏感的对抗式模仿学习

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

We study risk-sensitive imitation learning where the agent’s goal is to perform at least as well as the expert in terms of a risk profile. We first formulate our risk-sensitive imitation learning setting. We consider the generative adversarial approach to imitation learning (GAIL) and derive an optimization problem for our formulation, which we call it risk- sensitive GAIL (RS-GAIL). We then derive two different versions of our RS-GAIL optimization problem that aim at matching the risk profiles of the agent and the expert w.r.t. Jensen-Shannon (JS) divergence and Wasserstein distance, and develop risk-sensitive generative adversarial imitation learning algorithms based on these optimization problems. We evaluate the performance of our algorithms and compare them with GAIL and the risk-averse imitation learning (RAIL) algorithms in two MuJoCo and two OpenAI classical control tasks.

机译：我们研究风险敏感的模仿学习，其中代理商的目标是在风险方面至少表现出与专家相同的水平。我们首先制定风险敏感的模仿学习环境。我们考虑了模仿学习的生成对抗方法（GAIL），并为我们的公式推导了一个优化问题，我们称其为风险敏感型GAIL（RS-GAIL）。然后，我们得出RS-GAIL优化问题的两个不同版本，旨在匹配代理商和专家的风险状况。 Jensen-Shannon（JS）散度和Wasserstein距离，并基于这些优化问题开发风险敏感的生成对抗式模仿学习算法。我们评估了我们算法的性能，并将其与GAIL和风险厌恶模仿学习（RAIL）算法在两个MuJoCo和两个OpenAI经典控制任务中进行了比较。

著录项

来源
《JMLR: Workshop and Conference Proceedings》 |2018年第12期|共10页
作者
Jonathan Lacotte; Mohammad Ghavamzadeh; Yinlam Chow; Marco Pavone;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类人工智能理论;
关键词

相似文献

外文文献
中文文献
专利

1. Saliency Prediction on Omnidirectional Image With Generative Adversarial Imitation Learning [J] . Mai Xu, Li Yang, Xiaoming Tao, IEEE Transactions on Image Processing . 2021,第1期

机译：具有生成对抗性模仿学习的全向图像的显着性预测
2. TrajGAIL: Generating urban vehicle trajectories using generative adversarial imitation learning [J] . Choi Seongjin, Kim Jiwon, Yeo Hwasoo Transportation research . 2021,第Jula期

机译：Trajgail：使用生成的对抗性模仿学习产生城市车辆轨迹
3. Deterministic generative adversarial imitation learning [J] . Neurocomputing . 2020,第May7期

机译：确定性生成对抗模仿学习
4. Learning Food-arrangement Policies from Raw Images with Generative Adversarial Imitation Learning [C] . Junki Matsuoka, Yoshihisa Tsurumine, Yuhwan Kwon, International Conference on Ubiquitous Robots . 2020

机译：通过原始对抗性模仿学习从原始图像中学习食物安排政策
5. Stacked Generative Adversarial Networks for Learning Additional Features of Image Segmentation Maps [D] . Burke, Matthew. 2020

机译：用于学习图像分割图的其他特征的堆叠生成的对抗网络
6. Generative Adversarial Learning of Protein Tertiary Structures [O] . Taseef Rahman, Yuanqi Du, Liang Zhao, 2021

机译：蛋白质三级结构的生成逆境学习
7. Joint Entity and Event Extraction with Generative Adversarial Imitation Learning [O] . Tongtao Zhang, Heng Ji, Avirup Sil 2019

机译：具有生成对抗性模仿学习的联合实体和事件提取

Risk-Sensitive Generative Adversarial Imitation Learning

摘要

著录项

相似文献

相关主题

期刊订阅