首页> 外文会议>Bioinformatics Research and Applications; Lecture Notes in Bioinformatics; 4463 >A Support Vector Machine Ensemble for Cancer Classification Using Gene Expression Data
【24h】

A Support Vector Machine Ensemble for Cancer Classification Using Gene Expression Data

机译:使用基因表达数据进行癌症分类的支持向量机集成

获取原文
获取原文并翻译 | 示例

摘要

In this paper, we propose a support vector machine (SVM) ensemble classification method. Firstly, dataset is preprocessed by Wilcoxon rank sum test to filter irrelevant genes. Then one SVM is trained using the training set, and is tested by the training set itself to get prediction results. Those samples with error prediction result or low confidence are selected to train the second SVM, and also the second SVM is tested again. Similarly, the third SVM is obtained using those samples, which cannot be correctly classified using the second SVM with large confidence. The three SVMs form SVM ensemble classifier. Finally, the testing set is fed into the ensemble classifier. The final test prediction results can be got by majority voting. Experiments are performed on two standard benchmark datasets: Breast Cancer, ALL/AML Leukemia. Experimental results demonstrate that the proposed method can reach the state-of-the-art performance on classification.
机译:在本文中,我们提出了一种支持向量机(SVM)集成分类方法。首先,通过Wilcoxon秩和检验对数据集进行预处理,以过滤不相关的基因。然后,使用训练集对一个SVM进行训练,并通过训练集本身对其进行测试,以获得预测结果。选择那些具有错误预测结果或低置信度的样本来训练第二个SVM,然后再次测试第二个SVM。类似地,使用那些样本无法获得第三个SVM,而使用第二个SVM无法正确地对它们进行分类。这三个SVM构成SVM集成分类器。最后,将测试集输入到集成分类器中。最终的测试预测结果可以通过多数表决获得。实验在两个标准基准数据集上进行:乳腺癌,ALL / AML白血病。实验结果表明,该方法可以达到分类的最新性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号