首页> 外文期刊>Biochemical and Biophysical Research Communications >SVM-T-RFE: A novel gene selection algorithm for identifying metastasis-related genes in colorectal cancer using gene expression profiles
【24h】

SVM-T-RFE: A novel gene selection algorithm for identifying metastasis-related genes in colorectal cancer using gene expression profiles

机译:SVM-T-RFE:一种新颖的基因选择算法,可使用基因表达谱识别大肠癌中与转移相关的基因

获取原文
获取原文并翻译 | 示例
           

摘要

Although metastasis is the principal cause of death cause for colorectal cancer (CRC) patients, the molecular mechanisms underlying CRC metastasis are still not fully understood. In an attempt to identify metastasis-related genes in CRC, we obtained gene expression profiles of 55 early stage primary CRCs, 56 late stage primary CRCs, and 34 metastatic CRCs from the expression project in Oncology (http://www.intgen.org/expo/). We developed a novel gene selection algorithm (SVM-T-RFE), which extends support vector machine recursive feature elimination (SVM-RFE) algorithm by incorporating . T-statistic. We achieved highest classification accuracy (100%) with smaller gene subsets (10 and 6, respectively), when classifying between early and late stage primary CRCs, as well as between metastatic CRCs and late stage primary CRCs. We also compared the performance of SVM-T-RFE and SVM-RFE gene selection algorithms on another large-scale CRC dataset and the five public microarray datasets. SVM-T-RFE bestowed SVM-RFE algorithm in identifying more differentially expressed genes, and achieving highest prediction accuracy using equal or smaller number of selected genes. A fraction of selected genes have been reported to be associated with CRC development or metastasis.
机译:尽管转移是结直肠癌(CRC)患者死亡的主要原因,但仍未完全了解CRC转移的分子机制。为了鉴定CRC中与转移相关的基因,我们从Oncology(http://www.intgen.org)的表达项目中获得了55个早期原发CRC,56个晚期原发CRC和34个转移性CRC的基因表达谱。 / expo /)。我们开发了一种新颖的基因选择算法(SVM-T-RFE),该算法通过合并扩展了支持向量机递归特征消除(SVM-RFE)算法。 T统计。在早期和晚期原发性CRC之间以及转移性CRC与晚期原发性CRC之间进行分类时,我们以较小的基因子集(分别为10和6)获得了最高的分类准确度(100%)。我们还比较了SVM-T-RFE和SVM-RFE基因选择算法在另一个大型CRC数据集和五个公共微阵列数据集上的性能。 SVM-T-RFE赋予SVM-RFE算法以识别更多差异表达的基因,并使用相等或更少数量的选定基因实现最高的预测准确性。据报道,部分选定的基因与CRC的发展或转移有关。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号