首页> 外文期刊>Bioinformatics >Multiple-rule bias in the comparison of classification rules
【24h】

Multiple-rule bias in the comparison of classification rules

机译:分类规则比较中的多规则偏差

获取原文
获取原文并翻译 | 示例
       

摘要

Motivation: There is growing discussion in the bioinformatics community concerning overoptimism of reported results. Two approaches contributing to overoptimism in classification are (i) the reporting of results on datasets for which a proposed classification rule performs well and (ii) the comparison of multiple classification rules on a single dataset that purports to show the advantage of a certain rule.Results: This article provides a careful probabilistic analysis of the second issue and the 'multiple-rule bias', resulting from choosing a classification rule having minimum estimated error on the dataset. It quantifies this bias corresponding to estimating the expected true error of the classification rule possessing minimum estimated error and it characterizes the bias from estimating the true comparative advantage of the chosen classification rule relative to the others by the estimated comparative advantage on the dataset. The analysis is applied to both synthetic and real data using a number of classification rules and error estimators.
机译:动机:关于报告结果的过度乐观,生物信息学界越来越多的讨论。导致分类过分乐观的两种方法是(i)报告建议的分类规则表现良好的数据集上的结果,以及(ii)在单个数据集上比较多个分类规则的比较,这些数据看起来证明了某个规则的优势。结果:本文对第二个问题和“多个规则偏差”进行了仔细的概率分析,这是由于选择了数据集上估计误差最小的分类规则而导致的。它量化了与估计具有最小估计误差的分类规则的预期真实误差相对应的偏差,并且通过根据数据集上的估计比较优势来估计所选分类规则相对于其他分类规则的真实比较优势来表征偏差。使用许多分类规则和误差估计器,可以将分析应用于合成数据和真实数据。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号