首页> 美国卫生研究院文献>Journal of Cheminformatics >Efficiency of different measures for defining the applicability domain of classification models
【2h】

Efficiency of different measures for defining the applicability domain of classification models

机译:定义分类模型适用范围的不同措施的效率

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

The goal of defining an applicability domain for a predictive classification model is to identify the region in chemical space where the model’s predictions are reliable. The boundary of the applicability domain is defined with the help of a measure that shall reflect the reliability of an individual prediction. Here, the available measures are differentiated into those that flag unusual objects and which are independent of the original classifier and those that use information of the trained classifier. The former set of techniques is referred to as novelty detection while the latter is designated as confidence estimation. A review of the available confidence estimators shows that most of these measures estimate the probability of class membership of the predicted objects which is inversely related to the error probability. Thus, class probability estimates are natural candidates for defining the applicability domain but were not comprehensively included in previous benchmark studies. The focus of the present study is to find the best measure for defining the applicability domain for a given binary classification technique and to determine the performance of novelty detection versus confidence estimation. Six different binary classification techniques in combination with ten data sets were studied to benchmark the various measures. The area under the receiver operating characteristic curve (AUC ROC) was employed as main benchmark criterion. It is shown that class probability estimates constantly perform best to differentiate between reliable and unreliable predictions. Previously proposed alternatives to class probability estimates do not perform better than the latter and are inferior in most cases. Interestingly, the impact of defining an applicability domain depends on the observed area under the receiver operator characteristic curve. That means that it depends on the level of difficulty of the classification problem (expressed as AUC ROC) and will be largest for intermediately difficult problems (range AUC ROC 0.7–0.9). In the ranking of classifiers, classification random forests performed best on average. Hence, classification random forests in combination with the respective class probability estimate are a good starting point for predictive binary chemoinformatic classifiers with applicability domain.>Graphical abstract.
机译:定义预测分类模型的适用范围的目的是识别化学空间中该模型的预测可靠的区域。适用性域的边界是借助一种措施来定义的,该措施应反映单个预测的可靠性。在这里,可用的度量分为标记异常对象的度量和独立于原始分类器的度量,以及使用经过训练的分类器信息的度量。前一组技术称为新颖性检测,而后一组技术称为置信度估计。对可用置信估计量的回顾表明,这些度量中的大多数都估计了预测对象的类成员身份的概率,该概率与错误概率成反比。因此,分类概率估计是定义适用性域的自然候选者,但并未全面纳入先前的基准研究中。本研究的重点是找到最佳方法,以定义给定二进制分类技术的适用范围,并确定新颖性检测与置信度估计的性能。研究了六种不同的二进制分类技术以及十个数据集,以对各种度量进行基准测试。接收机工作特性曲线下的面积(AUC ROC)被用作主要的基准标准。结果表明,类别概率估计值始终表现最佳,以区分可靠的预测和不可靠的预测。先前提出的类别概率估计值的替代方案并不比后者更好,并且在大多数情况下均较差。有趣的是,定义适用范围的影响取决于接收方操作员特征曲线下的观察区域。这意味着它取决于分类问题的难度级别(表示为AUC ROC),并且对于中等难度的问题最大(范围为AUC ROC 0.7-0.9)。在分类器的排名中,分类随机森林平均表现最佳。因此,将分类随机森林与相应的类别概率估计值相结合是具有适用性域的预测性二进制化学信息分类器的一个很好的起点。<!-fig ft0-> <!-fig @ position =“ anchor” mode = article f4-> <!-fig mode =“ anchored” f5-> >图形摘要<!-fig / graphic | fig / alternatives / graphic mode =“ anchored” m1-> <!-标题a7->。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号