Uncertainty-Aware Data Aggregation for Deep Imitation Learning

机译：用于深度模仿学习的不确定性数据聚合

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Estimating statistical uncertainties allows autonomous agents to communicate their confidence during task execution and is important for applications in safety-critical domains such as autonomous driving. In this work, we present the uncertainty-aware imitation learning (UAIL) algorithm for improving end-to-end control systems via data aggregation. UAIL applies Monte Carlo Dropout to estimate uncertainty in the control output of end-to-end systems, using states where it is uncertain to selectively acquire new training data. In contrast to prior data aggregation algorithms that force human experts to visit sub-optimal states at random, UAIL can anticipate its own mistakes and switch control to the expert in order to prevent visiting a series of sub-optimal states. Our experimental results from simulated driving tasks demonstrate that our proposed uncertainty estimation method can be leveraged to reliably predict infractions. Our analysis shows that UAIL outperforms existing data aggregation algorithms on a series of benchmark tasks.

机译：估计统计不确定性使自治代理可以在任务执行期间传达其信心，这对于诸如安全驾驶等安全关键领域的应用非常重要。在这项工作中，我们提出了不确定性感知模仿学习（UAIL）算法，用于通过数据聚合来改善端到端控制系统。 UAIL使用不确定是否有选择地获取新训练数据的状态，使用Monte Carlo Dropout来估计端到端系统的控制输出中的不确定性。与迫使人类专家随机访问次优状态的现有数据聚合算法相反，UAIL可以预见自己的错误，并将控制权切换给专家，以防止访问一系列次优状态。我们从模拟驾驶任务获得的实验结果表明，我们提出的不确定性估计方法可用于可靠地预测违规情况。我们的分析表明，在一系列基准测试任务上，UAIL的性能优于现有的数据聚合算法。

著录项

来源
《International Conference on Robotics and Automation》|2019年|761-767|共7页
会议地点 Montreal(CA)
作者
Yuchen Cui; David Isele; Scott Niekum; Kikuo Fujimura;
展开▼
作者单位

University of Texas at Austin Austin TX 78712 USA;

Honda Research Institute USA CA 94043 USA;

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Uncertainty; Data aggregation; Task analysis; Switches; Estimation; Data models;

机译：不确定;数据汇总；任务分析；开关；估计；资料模型;

相似文献

外文文献
中文文献
专利

1. Deep imitation reinforcement learning with expert demonstration data [J] . Menglong Yi, Xin Xu, Yujun Zeng, The Journal of Engineering . 2018,第16期

机译：通过专家演示数据进行深度模仿强化学习
2. An uncertainty-aware deep reinforcement learning framework for residential air conditioning energy management [J] . Lork Clement, Li Wen-Tai, Qin Yan, Applied Energy . 2020,第Octa15期

机译：住宅空调能源管理的不确定感知深增强学习框架
3. DR vertical bar GRADUATE: Uncertainty-aware deep learning-based diabetic retinopathy grading in eye fundus images [J] . Medical image analysis . 2020,第期

机译：垂直条毕业生：眼底图像中的不确定感知深度学习的糖尿病视网膜病变分级
4. Uncertainty-Aware Data Aggregation for Deep Imitation Learning [C] . Yuchen Cui, David Isele, Scott Niekum, International Conference on Robotics and Automation . 2019

机译：深度模仿学习的不确定性感知数据聚合
5. On and Off-policy Deep Imitation Learning for Robotics [D] . Laskey, Michael. 2018

机译：机器人学的禁止政策深度模仿学习
6. Learning for a Robot: Deep Reinforcement Learning Imitation Learning Transfer Learning [O] . Jiang Hua, Liangcai Zeng, Gongfa Li, 2021

机译：学习机器人：深增强学习仿制学习转移学习
7. Uncertainty-Aware Imitation Learning using Kernelized Movement Primitives [O] . Joao Silverio, Yanlong Huang, Fares J. Abu-Dakka, 2019

机译：不确定性意识到使用内核运动原语的仿制学习

Uncertainty-Aware Data Aggregation for Deep Imitation Learning

摘要

著录项

相似文献

相关主题

期刊订阅