首页> 外文会议>10th IEEE International Conference on Data Mining Workshops >An Empirical Comparison of Platt Calibration and Inductive Confidence Machines for Predictions in Drug Discovery
【24h】

An Empirical Comparison of Platt Calibration and Inductive Confidence Machines for Predictions in Drug Discovery

机译:Platt校准和归纳置信机对药物发现预测的经验比较

获取原文

摘要

During the early phase of drug discovery, machine learning methods are often utilized to select compounds to send for experimental screening. In order to accomplish this goal, any method that can provide estimates of error rate for a given set of predictions is an extremely valuable tool. In this paper we compare Platt Calibration Algorithm and recently introduced Conformal Algorithm to control the error rate in the sense of precision while preserving the ability to identify as many compounds as possible (recall) that are highly likely to be bio-active in a certain context. We empirically evaluate and compare the performance of Plattȁ9;s Calibration and offline Mondrian ICM in the context of SVM-based classification on 75 distinct classification problems. We perform this evaluation in the real world setting where the true class labels of compounds are unknown at the time of prediction and are only revealed after the biological experiment is completed. Our empirical results show that under this setting, offline Mondrian ICM and Platt Calibration are not able to bound precision rates very well on an absolute basis. Comparatively, Mondrian ICM, even though not theoretically designed to control precision directly, compares favorably with Platt Calibration for this task.
机译:在药物发现的早期阶段,机器学习方法经常被用来选择化合物进行实验筛选。为了实现此目标,可以为给定的一组预测提供错误率估计的任何方法都是非常有价值的工具。在本文中,我们比较了Platt校准算法,并在最近引入了保形算法,以从精度的角度控制错误率,同时保留了在特定情况下识别出尽可能多的具有生物活性的化合物(召回)的能力。 。我们在75个不同分类问题的基于SVM的分类中,根据经验评估和比较Plattȁ9; s Calibration和离线Mondrian ICM的性能。我们在真实的环境中执行此评估,在该环境中,化合物的真实类别标签在预测时是未知的,并且仅在生物学实验完成后才显示出来。我们的经验结果表明,在这种情况下,离线Mondrian ICM和Platt Calibration绝对无法很好地限制精确率。相比之下,Mondrian ICM尽管在理论上并未设计为直接控制精度,但在完成此任务方面与Platt Calibration相比还是比较有利的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号