首页> 外文会议>International Conference on Robotics and Automation >Uncertainty-Aware Data Aggregation for Deep Imitation Learning
【24h】

Uncertainty-Aware Data Aggregation for Deep Imitation Learning

机译:用于深度模仿学习的不确定性数据聚合

获取原文

摘要

Estimating statistical uncertainties allows autonomous agents to communicate their confidence during task execution and is important for applications in safety-critical domains such as autonomous driving. In this work, we present the uncertainty-aware imitation learning (UAIL) algorithm for improving end-to-end control systems via data aggregation. UAIL applies Monte Carlo Dropout to estimate uncertainty in the control output of end-to-end systems, using states where it is uncertain to selectively acquire new training data. In contrast to prior data aggregation algorithms that force human experts to visit sub-optimal states at random, UAIL can anticipate its own mistakes and switch control to the expert in order to prevent visiting a series of sub-optimal states. Our experimental results from simulated driving tasks demonstrate that our proposed uncertainty estimation method can be leveraged to reliably predict infractions. Our analysis shows that UAIL outperforms existing data aggregation algorithms on a series of benchmark tasks.
机译:估计统计不确定性使自治代理可以在任务执行期间传达其信心,这对于诸如安全驾驶等安全关键领域的应用非常重要。在这项工作中,我们提出了不确定性感知模仿学习(UAIL)算法,用于通过数据聚合来改善端到端控制系统。 UAIL使用不确定是否有选择地获取新训练数据的状态,使用Monte Carlo Dropout来估计端到端系统的控制输出中的不确定性。与迫使人类专家随机访问次优状态的现有数据聚合算法相反,UAIL可以预见自己的错误,并将控制权切换给专家,以防止访问一系列次优状态。我们从模拟驾驶任务获得的实验结果表明,我们提出的不确定性估计方法可用于可靠地预测违规情况。我们的分析表明,在一系列基准测试任务上,UAIL的性能优于现有的数据聚合算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号