...
首页> 外文期刊>Genomics, proteomics & bioinformatics >Hybrid Method Based on Information Gain and Support Vector Machine for Gene Selection in Cancer Classification
【24h】

Hybrid Method Based on Information Gain and Support Vector Machine for Gene Selection in Cancer Classification

机译:基于信息增益和支持向量机的癌症分类基因选择混合方法

获取原文

摘要

It remains a great challenge to achieve sufficient cancer classification accuracy with the entire set of genes, due to the high dimensions, small sample size, and big noise of gene expression data. We thus proposed a hybrid gene selection method, Information Gain-Support Vector Machine (IG-SVM) in this study. IG was initially employed to filter irrelevant and redundant genes. Then, further removal of redundant genes was performed using SVM to eliminate the noise in the datasets more effectively. Finally, the informative genes selected by IG-SVM served as the input for the LIBSVM classifier. Compared to other related algorithms, IG-SVM showed the highest classification accuracy and superior performance as evaluated using five cancer gene expression datasets based on a few selected genes. As an example, IG-SVM achieved a classification accuracy of 90.32% for colon cancer, which is difficult to be accurately classified, only based on three genes including CSRP1 , MYL9 , and GUCA2B .
机译:由于基因表达数据的尺寸大,样本量小和噪声大,要在整个基因组中实现足够的癌症分类准确性仍然是一个巨大的挑战。因此,我们在本研究中提出了一种杂种基因选择方法,即信息增益支持向量机(IG-SVM)。 IG最初用于过滤无关和冗余的基因。然后,使用SVM进一步去除冗余基因,以更有效地消除数据集中的噪声。最后,由IG-SVM选择的信息基因可作为LIBSVM分类器的输入。与其他相关算法相比,IG-SVM使用基于少数几个选定基因的五个癌症基因表达数据集进行评估,显示出最高的分类准确性和出色的性能。例如,仅基于CSRP1,MYL9和GUCA2B这三个基因,IG-SVM对结肠癌的分类精度达到90.32%,很难准确分类。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号