首页> 外文期刊>JMLR: Workshop and Conference Proceedings >Risk-Sensitive Generative Adversarial Imitation Learning
【24h】

Risk-Sensitive Generative Adversarial Imitation Learning

机译:风险敏感的对抗式模仿学习

获取原文
           

摘要

We study risk-sensitive imitation learning where the agent’s goal is to perform at least as well as the expert in terms of a risk profile. We first formulate our risk-sensitive imitation learning setting. We consider the generative adversarial approach to imitation learning (GAIL) and derive an optimization problem for our formulation, which we call it risk- sensitive GAIL (RS-GAIL). We then derive two different versions of our RS-GAIL optimization problem that aim at matching the risk profiles of the agent and the expert w.r.t. Jensen-Shannon (JS) divergence and Wasserstein distance, and develop risk-sensitive generative adversarial imitation learning algorithms based on these optimization problems. We evaluate the performance of our algorithms and compare them with GAIL and the risk-averse imitation learning (RAIL) algorithms in two MuJoCo and two OpenAI classical control tasks.
机译:我们研究风险敏感的模仿学习,其中代理商的目标是在风险方面至少表现出与专家相同的水平。我们首先制定风险敏感的模仿学习环境。我们考虑了模仿学习的生成对抗方法(GAIL),并为我们的公式推导了一个优化问题,我们称其为风险敏感型GAIL(RS-GAIL)。然后,我们得出RS-GAIL优化问题的两个不同版本,旨在匹配代理商和专家的风险状况。 Jensen-Shannon(JS)散度和Wasserstein距离,并基于这些优化问题开发风险敏感的生成对抗式模仿学习算法。我们评估了我们算法的性能,并将其与GAIL和风险厌恶模仿学习(RAIL)算法在两个MuJoCo和两个OpenAI经典控制任务中进行了比较。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号