首页> 美国卫生研究院文献>Genomics, Proteomics Bioinformatics >Hybrid Method Based on Information Gain and Support Vector Machine for Gene Selection in Cancer Classification
【2h】

Hybrid Method Based on Information Gain and Support Vector Machine for Gene Selection in Cancer Classification

机译:基于信息增益和支持向量机的癌症分类基因选择混合方法

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

It remains a great challenge to achieve sufficient cancer classification accuracy with the entire set of genes, due to the high dimensions, small sample size, and big noise of gene expression data. We thus proposed a hybrid gene selection method, Information Gain-Support Vector Machine (IG-SVM) in this study. IG was initially employed to filter irrelevant and redundant genes. Then, further removal of redundant genes was performed using SVM to eliminate the noise in the datasets more effectively. Finally, the informative genes selected by IG-SVM served as the input for the LIBSVM classifier. Compared to other related algorithms, IG-SVM showed the highest classification accuracy and superior performance as evaluated using five cancer gene expression datasets based on a few selected genes. As an example, IG-SVM achieved a classification accuracy of 90.32% for colon cancer, which is difficult to be accurately classified, only based on three genes including CSRP1, MYL9, and GUCA2B.
机译:由于基因表达数据的尺寸大,样本量小和噪声大,要在整个基因组中实现足够的癌症分类准确性仍然是一个巨大的挑战。因此,在本研究中,我们提出了一种混合的基因选择方法,即信息增益支持向量机(IG-SVM)。 IG最初用于过滤无关和冗余的基因。然后,使用SVM进一步去除冗余基因,以更有效地消除数据集中的噪声。最后,由IG-SVM选择的信息基因可作为LIBSVM分类器的输入。与其他相关算法相比,IG-SVM使用基于几个选定基因的五个癌症基因表达数据集进行评估,显示出最高的分类准确性和出色的性能。例如,仅基于CSRP1,MYL9和GUCA2B这三个基因,IG-SVM对结肠癌的分类精度达到90.32%,很难准确分类。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号