...
首页> 外文期刊>Genetics and Molecular Research >Multiclass microarray data classification based on confidence evaluation
【24h】

Multiclass microarray data classification based on confidence evaluation

机译:基于置信度评估的多类微阵列数据分类

获取原文
           

摘要

Microarray technology is becoming a powerful tool for clinical diagnosis, as it has potential to discover gene expression patterns that are characteristic for a particular disease. To date, this possibility has received much attention in the context of cancer research, especially in tumor classification. However, most published articles have concentrated on the development of binary classification methods while neglected ubiquitous multiclass problems. Unfortunately, only a few multiclass classification approaches have had poor predictive accuracy. In an effort to improve classification accuracy, we developed a novel multiclass microarray data classification method. First, we applied a “one versus rest-support vector machine” to classify the samples. Then the classification confidence of each testing sample was evaluated according to its distribution in feature space and some with poor confidence were extracted. Next, a novel strategy, which we named as “class priority estimation method based on centroid distance”, was used to make decisions about categories for those poor confidence samples. This approach was tested on seven benchmark multiclass microarray datasets, with encouraging results, demonstrating effectiveness and feasibility.
机译:微阵列技术正在成为临床诊断的有力工具,因为它具有发现特定疾病特征基因表达模式的潜力。迄今为止,在癌症研究的背景下,尤其是在肿瘤分类中,这种可能性受到了广泛的关注。但是,大多数已发表的文章都集中在二进制分类方法的开发上,而忽略了普遍存在的多类问题。不幸的是,只有几种多类分类方法的预测准确性较差。为了提高分类准确性,我们开发了一种新颖的多类微阵列数据分类方法。首先,我们应用了“一个与支持向量机对比”一词对样本进行分类。然后根据每个测试样本在特征空间中的分布来评估其分类置信度,并提取一些置信度较差的样本。接下来,我们将一种称为“基于质心距离的类别优先级估计方法”的新颖策略用于针对那些较差置信度样本做出类别决策。该方法在七个基准多类微阵列数据集上进行了测试,结果令人鼓舞,证明了其有效性和可行性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号