...
首页> 外文期刊>Chemometrics and Intelligent Laboratory Systems >Validation of soft classification models using partial class memberships: An extended concept of sensitivity & co. applied to grading of astrocytoma tissues
【24h】

Validation of soft classification models using partial class memberships: An extended concept of sensitivity & co. applied to grading of astrocytoma tissues

机译:使用部分类成员资格验证软分类模型:敏感性和联合的扩展概念。用于星形细胞瘤组织分级

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

We use partial class memberships in soft classification to model uncertain labeling and mixtures of classes. Partial class memberships are not restricted to predictions, but may also occur in reference labels (ground truth, gold standard diagnosis) for training and validation data. Classifier performance is usually expressed as fractions of the confusion matrix, like sensitivity, specificity, negative and positive predictive values. We extend this concept to soft classification and discuss the bias and variance properties of the extended performance measures. Ambiguity in reference labels translates to differences between best-case, expected and worst-case performance. We show a second set of measures comparing expected and ideal performance which is closely related to regression performance, namely the root mean squared error RMSE and the mean absolute error MAE. All calculations apply to classical crisp as well as to soft classification (partial class memberships as well as one-class classifiers). The proposed performance measures allow to test classifiers with actual borderline cases. In addition, hardening of e.g. posterior probabilities into class labels is not necessary, avoiding the corresponding information loss and increase in variance. We implemented the proposed performance measures in R package "softclassval" which is available from CRAN and at softclassval.r-forge.r-project.org. Our reasoning as well as the importance of partial memberships for chemometric classification is illustrated by a real-word application: astrocytoma brain tumor tissue grading (80 patients, 37,000 spectra) for finding surgical excision borders. As borderline cases are the actual target of the analytical technique, samples which are diagnosed to be borderline cases must be included in the validation.
机译:我们在软分类中使用部分类别成员资格来对不确定的标签和类别混合进行建模。部分班级成员资格不仅限于预测,还可能出现在用于培训和验证数据的参考标签(真实情况,黄金标准诊断)中。分类器性能通常表示为混淆矩阵的分数,例如敏感性,特异性,阴性和阳性预测值。我们将此概念扩展到软分类,并讨论扩展性能度量的偏差和方差性质。参考标签中的歧义会导致最佳情况,预期情况和最坏情况之间的差异。我们展示了比较预期性能和理想性能的第二组度量,它与回归性能密切相关,即均方根误差RMSE和均值绝对误差MAE。所有计算均适用于经典脆性以及软分类(部分类成员资格和一类分类器)。拟议的绩效衡量标准允许使用实际的临界情况测试分类器。另外,硬化例如。不必将后验概率放入类别标签中,从而避免了相应的信息丢失和方差增加。我们在C包和softclassval.r-forge.r-project.org中提供的R包“ softclassval”中实施了建议的性能指标。我们的推理以及部分成员资格对于化学计量学分类的重要性通过一个真实的字词应用得以说明:星形细胞瘤脑肿瘤组织分级(80位患者,37,000个光谱),用于寻找手术切除边界。由于临界情况是分析技术的实际目标,因此在验证中必须包括被诊断为临界情况的样品。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号