...
首页> 外文期刊>BioMed research international >Recognition of Multiple Imbalanced Cancer Types Based on DNA Microarray Data Using Ensemble Classifiers
【24h】

Recognition of Multiple Imbalanced Cancer Types Based on DNA Microarray Data Using Ensemble Classifiers

机译:基于DNA微阵列数据的集成分类器识别多种失衡癌症类型

获取原文

摘要

DNA microarray technology can measure the activities of tens of thousands of genes simultaneously, which provides an efficient way to diagnose cancer at the molecular level. Although this strategy has attracted significant research attention, most studies neglect an important problem, namely, that most DNA microarray datasets are skewed, which causes traditional learning algorithms to produce inaccurate results. Some studies have considered this problem, yet they merely focus on binary-class problem. In this paper, we dealt with multiclass imbalanced classification problem, as encountered in cancer DNA microarray, by using ensemble learning. We utilized one-against-all coding strategy to transform multiclass to multiple binary classes, each of them carrying out feature subspace, which is an evolving version of random subspace that generates multiple diverse training subsets. Next, we introduced one of two different correction technologies, namely, decision threshold adjustment or random undersampling, into each training subset to alleviate the damage of class imbalance. Specifically, support vector machine was used as base classifier, and a novel voting rule called counter voting was presented for making a final decision. Experimental results on eight skewed multiclass cancer microarray datasets indicate that unlike many traditional classification approaches, our methods are insensitive to class imbalance.
机译:DNA微阵列技术可以同时测量数万个基因的活性,这提供了在分子水平诊断癌症的有效方法。尽管此策略已引起了广泛的研究关注,但大多数研究都忽略了一个重要问题,即大多数DNA微阵列数据集存在偏差,这导致传统的学习算法产生不准确的结果。一些研究已经考虑了这个问题,但他们只关注二进制类问题。在本文中,我们通过使用集成学习处理了癌症DNA微阵列中遇到的多类不平衡分类问题。我们使用了一种从头到尾的编码策略,将多类转换成多个二进制类,每个类都执行特征子空间,这是随机子空间的演进版本,可生成多个不同的训练子集。接下来,我们在每个训练子集中引入了两种不同的校正技术之一,即决策阈值调整或随机欠采样,以减轻班级失衡的损害。具体来说,使用支持向量机作为基础分类器,提出了一种新的投票规则,称为反投票,以做出最终决定。对八个倾斜的多类癌症微阵列数据集的实验结果表明,与许多传统分类方法不同,我们的方法对类不平衡不敏感。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号