首页> 外文期刊>Computational Biology and Bioinformatics, IEEE/ACM Transactions on >Prediction of Cancer Class with Majority Voting Genetic Programming Classifier Using Gene Expression Data
【24h】

Prediction of Cancer Class with Majority Voting Genetic Programming Classifier Using Gene Expression Data

机译:基于基因表达数据的多数投票遗传规划分类器对癌症分类的预测

获取原文
获取原文并翻译 | 示例
       

摘要

In order to get a better understanding of different types of cancers and to find the possible biomarkers for diseases, recently, many researchers are analyzing the gene expression data using various machine learning techniques. However, due to a very small number of training samples compared to the huge number of genes and class imbalance, most of these methods suffer from overfitting. In this paper, we present a majority voting genetic programming classifier (MVGPC) for the classification of microarray data. Instead of a single rule or a single set of rules, we evolve multiple rules with genetic programming (GP) and then apply those rules to test samples to determine their labels with majority voting technique. By performing experiments on four different public cancer data sets, including multiclass data sets, we have found that the test accuracies of MVGPC are better than those of other methods, including AdaBoost with GP. Moreover, some of the more frequently occurring genes in the classification rules are known to be associated with the types of cancers being studied in this paper.
机译:为了更好地理解不同类型的癌症并找到可能的疾病生物标记,最近,许多研究人员正在使用各种机器学习技术来分析基因表达数据。但是,由于与大量基因和类别不平衡相比,训练样本的数量非常少,因此这些方法大多数都存在过度拟合的问题。在本文中,我们提出了一种用于微阵列数据分类的多数投票遗传规划分类器(MVGPC)。代替单个规则或单个规则集,我们通过遗传编程(GP)演化了多个规则,然后将这些规则应用于测试样本,以采用多数表决技术确定其标签。通过对包括多类数据集在内的四个不同的公共癌症数据集进行实验,我们发现MVGPC的测试精度优于包括AdaBoost和GP在内的其他方法。此外,分类规则中一些较常见的基因与本文研究的癌症类型有关。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号