...
首页> 外文期刊>Journal of chemical information and modeling >Reliable Prediction Errors for Deep Neural Networks Using Test-Time Dropout
【24h】

Reliable Prediction Errors for Deep Neural Networks Using Test-Time Dropout

机译:使用测试时间辍学的深度神经网络可靠的预测误差

获取原文
获取原文并翻译 | 示例
           

摘要

While the use of deep learning in drug discovery is gaining increasing attention, the lack of methods to compute reliable errors in prediction for Neural Networks prevents their application to guide decision making in domains where identifying unreliable predictions is essential, e.g., precision medicine. Here, we present a framework to compute reliable errors in prediction for Neural Networks using Test Time Dropout and Conformal Prediction. Specifically, the algorithm consists of training a single Neural Network using dropout, and then applying it N times to both the validation and test sets, also employing dropout in this step. Therefore, for each instance in the validation and test sets an ensemble of predictions are generated. The residuals and absolute errors in prediction for the validation set are then used to compute prediction errors for the test set instances using Conformal Prediction. We show using 24 bioactivity data sets from ChEMBL 23 that Dropout Conformal Predictors are valid (i.e., the fraction of instances whose true value lies within the predicted interval strongly correlates with the confidence level) and efficient, as the predicted confidence intervals span a narrower set of values than those computed with Conformal Predictors generated using Random Forest (RF) models. Lastly, we show in retrospective virtual screening experiments that dropout and RF-based Conformal Predictors lead to comparable retrieval rates of active compounds. Overall, we propose a computationally efficient framework (as only N extra forward passes are required in addition to training a single network) to harness Test-Time Dropout and the Conformal Prediction framework, which is generally applicable to generate reliable prediction errors for Deep Neural Networks in drug discovery and beyond.
机译:虽然在药物发现中使用深度学习正在增加越来越关注,但是缺乏在针对神经网络预测中计算可靠误差的方法可以防止其应用来指导在识别不可靠预测的域中的决策,例如精确的药物。在这里,我们介绍了一个框架,用于计算使用测试时间辍学和共形预测对神经网络预测的可靠误差。具体而言,该算法包括使用辍学措施训练单个神经网络,然后将其施加到验证和测试集,也在此步骤中采用丢失。因此,对于验证和测试集中的每个实例,生成预测的集合。然后,使用对验证集预测的残差和绝对误差来计算使用保形预测来计算测试集实例的预测误差。我们以来自ChemB13的24生物活性数据集来显示丢弃保形预制装置有效的(即,真实值在预测间隔内的实例的分数与置信水平强烈相关),并且有效,因为预测的置信区间跨越较窄的集合比使用随机林(RF)模型产生的共形预测器计算的值的值。最后,我们在回顾性的虚拟筛选实验中显示出辍学和基于RF的共形预测因子导致可比的活性化合物的检索速率。总的来说,我们提出了一种计算上有效的框架(除了训练单个网络之外,还需要N超前通过)来利用测试时间辍学和共形预测框架,这通常适用于为深神经网络产生可靠的预测误差在药物发现和超越。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号