首页> 外文期刊>Engineering Applications of Artificial Intelligence >Ensemble-based active learning using fuzzy-rough approach for cancer sample classification
【24h】

Ensemble-based active learning using fuzzy-rough approach for cancer sample classification

机译:基于集成的基于模糊粗糙集的主动学习用于癌症样本分类

获取原文
获取原文并翻译 | 示例
           

摘要

Background and Objective: Classification of cancer from gene expression data is one of the major research areas in the field of machine learning and medical science. Generally, conventional supervised methods are not able to produce desired classification accuracy due to inadequate training samples present in gene expression data to train the system. Ensemble-based active learning technique in this situation can be effective as it determines few informative samples by all the base classifiers and ensemble the decisions of all the base classifiers to get the most informative samples. Most informative samples are labeled by the subject experts and those are added to the training set, which can improve the classification accuracy. Method: We propose a novel ensemble-based active learning using fuzzy-rough approach for cancer sample classification from microarray gene expression data. The proposed method is able to deal with the uncertainty, overlap and indiscernibility usually present in the subtype classes of the gene expression data and can improve the accuracy of the individual base classifier in presence of limited training samples. Results: The proposed method is validated using eight microarray gene expression datasets. The performance of the proposed method in terms of classification accuracy, precision, recall, F_1-measures and kappa is compared with six other methods. The improvements in accuracy achieved by the proposed method compared to its nearest competitive methods are 2.96%, 9.34%, 0.93%, 3.69%, 7.2% and 4.53% respectively for Colon cancer, Prostate cancer, SRBCT, Ovarian cancer, DLBCL and Central nervous system datasets. Results of the paired t-test justify the statistical relevance of the results in favor of the proposed method for most of the datasets. Conclusion: The proposed method is an effective general purpose ensemble-based active learning adopting the fuzzy-rough concept and therefore can be applied for other classification problem in future.
机译:背景与目的:从基因表达数据分类癌症是机器学习和医学领域的主要研究领域之一。通常,由于在基因表达数据中存在不足以训练系统的训练样本,常规的监督方法不能产生期望的分类精度。在这种情况下,基于集成的主动学习技术可能是有效的,因为它由所有基本分类器确定的信息样本很少,并且集合所有基本分类器的决策以获取最多的信息样本。大多数信息样本由主题专家标记,然后将其添加到训练集中,这可以提高分类准确性。方法:我们提出了一种基于整体的主动学习,使用模糊粗糙方法从微阵列基因表达数据中对癌症样本进行分类。所提出的方法能够处理基因表达数据的亚型类别中通常存在的不确定性,重叠性和不可区分性,并且能够在训练样本有限的情况下提高单个碱基分类器的准确性。结果:使用8个微阵列基因表达数据集验证了该方法的有效性。将该方法在分类准确性,精确度,查全率,F_1度量和kappa方面的性能与其他六种方法进行了比较。与最接近的竞争方法相比,该方法在结肠癌,前列腺癌,SRBCT,卵巢癌,DLBCL和中枢神经系统中的准确性分别提高了2.96%,9.34%,0.93%,3.69%,7.2%和4.53%。系统数据集。配对t检验的结果证明了结果的统计相关性,有利于大多数数据集的拟议方法。结论:该方法是一种有效的基于模糊集合概念的基于通用集成的主动学习方法,因此可以在将来应用于其他分类问题。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号