首页> 外文会议>IEEE Conference on Software Testing, Verification and Validation >Fail-Safe Execution of Deep Learning based Systems through Uncertainty Monitoring
【24h】

Fail-Safe Execution of Deep Learning based Systems through Uncertainty Monitoring

机译:通过不确定监测失败基于深度学习的系统的执行

获取原文

摘要

Modern software systems rely on Deep Neural Networks (DNN) when processing complex, unstructured inputs, such as images, videos, natural language texts or audio signals. Provided the intractably large size of such input spaces, the intrinsic limitations of learning algorithms and the ambiguity about the expected predictions for some of the inputs, not only there is no guarantee that DNN’s predictions are always correct, but rather developers must safely assume a low, though not negligible, error probability. A fail-safe Deep Learning based System (DLS) is one equipped to handle DNN faults by means of a supervisor, capable of recognizing predictions that should not be trusted and that should activate a healing procedure bringing the DLS to a safe state.In this paper, we propose an approach to use DNN uncertainty estimators to implement such supervisor. We first discuss advantages and disadvantages of existing approaches to measure uncertainty for DNNs and propose novel metrics for the empirical assessment of the supervisor that rely on such approaches. We then describe our publicly available tool UNCERTAINTY-WIZARD, which allows transparent estimation of uncertainty for regular tf.keras DNNs. Lastly, we discuss a large-scale study conducted on four different subjects to empirically validate the approach, reporting the lessons-learned as guidance for software engineers who intend to monitor uncertainty for fail-safe execution of DLS.
机译:现代软件系统在处理复杂的非结构化输入时依赖深神经网络(DNN),例如图像,视频,自然语言文本或音频信号。提供了这种输入空间的难以接触的大尺寸,学习算法的内在限制和关于一些输入的预期预测的歧义,不仅没有保证DNN的预测总是正确的,而且开发人员必须安全地假设低电平虽然不可忽略,但误差概率并不可忽略。一个失败的深度学习的系统(DLS)是装备通过监督员处理DNN故障的系统(DLS),能够识别不受信任的预测,并且应该激活将DLS带到安全状态的治疗程序。这纸张,我们提出了一种使用DNN不确定性估计的方法来实施此类主管。我们首先讨论现有方法的优缺点,以衡量DNN的不确定性,并提出依赖此类方法的监督员的实证评估的新型度量。然后,我们描述了我们公开的工具不确定性巫师,这允许透明估计常规TF.keras DNN的不确定性。最后,我们讨论了在四个不同的科目中进行的大规模研究,以便经验验证方法,将经验教训报告为打算监测失败安全执行不确定性的软件工程师的指导。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号