【24h】

A NOVEL CLASSIFICATION METHOD OF MICROARRAY WITH RELIABILITY AND CONFIDENCE

机译:具有可靠性和置信度的微阵列的新分类方法

获取原文

摘要

Most of state-of-the-art machine learning algorithms cannot provide a reliable measure of their classifications and predictions. This paper addresses the importance of reliability and confidence for classification, and presents a novel method based on a combination of the unexcelled ensemble method, random forest (RF), and transductive confidence machine (TCM) which we call TCM-RF. The new algorithm hedges the predictions of RF and gives a well-calibrated region prediction by using the proximity matrix generated with RF as a nonconformity measure of examples. The new method takes advantage of RF and possesses a more precise and robust nonconformity measure. It can deal with redundant and noisy data with mixed types of variables, and is less sensitive to parameter settings. Experiments on benchmark datasets show it is more effective and robust than other TCMs. Further study on a real-world lymphoma microarray dataset shows its superiority over SVM with the ability of controlling the risk of error.
机译:大多数最新的机器学习算法无法提供对其分类和预测的可靠衡量。本文讨论了可靠性和置信度对于分类的重要性,并提出了一种基于出色的集成方法,随机森林(RF)和转导置信度机器(TCM)的新方法,我们将其称为TCM-RF。新算法对RF的预测进行套期保值,并通过将RF生成的邻近矩阵用作示例的不合格度量来给出经过良好校准的区域预测。该新方法利用了RF的优势,并具有更精确和鲁棒的不合格措施。它可以处理混合类型变量的冗余数据和嘈杂数据,并且对参数设置不那么敏感。在基准数据集上进行的实验表明,它比其他TCM更有效,更强大。对真实淋巴瘤微阵列数据集的进一步研究表明,它具有优于SVM的优势,并且具有控制错误风险的能力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号