【24h】

A Novel Ensemble Approach for Cancer Data Classification

机译:癌症数据分类的新型集成方法

获取原文
获取原文并翻译 | 示例

摘要

Micorarray data are often extremely asymmetric in dimensionality, such as thousands or even tens of thousands of genes and a few hundreds of samples. Such extreme asymmetry between the dimensionality of genes and samples presents several challenges to conventional clustering and classification methods. In this paper, a novel ensemble method based on correlation analysis is proposed. Firstly, in order to extract useful features and reduce dimensionality, different feature selection methods based on correlation analysis are used to form different feature subsets. Then a pool of candidate base classifiers is generated to learn the subsets which are re-sampling from the different feature subsets. At last, appropriate classifiers are selected to construct the classification committee using EDA (Estimation of Distribution Algorithms) algorithm. Experiments show that the proposed method produces the best recognition rates on two benchmark databases.
机译:微阵列数据通常在维度上极为不对称,例如成千上万的基因和几百个样本。基因和样本的维数之间的这种极端不对称给传统的聚类和分类方法带来了一些挑战。提出了一种基于相关分析的集成方法。首先,为了提取有用的特征并降低维数,基于相关分析的不同特征选择方法被用来形成不同的特征子集。然后,生成候选基本分类器池,以从不同的特征子集中学习要重新采样的子集。最后,选择合适的分类器以利用EDA(Estimation of Distribution Algorithms)算法构建分类委员会。实验表明,该方法在两个基准数据库上产生了最佳的识别率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号