...
首页> 外文期刊>Computational Biology and Bioinformatics, IEEE/ACM Transactions on >An Improved Ensemble Learning Methodfor Classifying High-Dimensionaland Imbalanced Biomedicine Data
【24h】

An Improved Ensemble Learning Methodfor Classifying High-Dimensionaland Imbalanced Biomedicine Data

机译:一种改进的集成学习方法,用于分类高维不平衡生物医学数据

获取原文
获取原文并翻译 | 示例
           

摘要

Training classifiers on skewed data can be technically challenging tasks, especially if the data is high-dimensional simultaneously, the tasks can become more difficult. In biomedicine field, skewed data type often appears. In this study, we try to deal with this problem by combining asymmetric bagging ensemble classifier (asBagging) that has been presented in previous work and an improved random subspace (RS) generation strategy that is called feature subspace (FSS). Specifically, FSS is a novel method to promote the balance level between accuracy and diversity of base classifiers in asBagging. In view of the strong generalization capability of support vector machine (SVM), we adopt it to be base classifier. Extensive experiments on four benchmark biomedicine data sets indicate that the proposed ensemble learning method outperforms many baseline approaches in terms of Accuracy, F-measure, G-mean and AUC evaluation criterions, thus it can be regarded as an effective and efficient tool to deal with high-dimensional and imbalanced biomedical data.
机译:在偏斜数据上训练分类器在技术上可能是具有挑战性的任务,尤其是如果数据同时是高维的,则任务可能会变得更加困难。在生物医学领域,经常会出现偏斜的数据类型。在这项研究中,我们尝试通过结合先前工作中介绍的非对称装袋集成分类器(asBagging)和称为特征子空间(FSS)的改进的随机子空间(RS)生成策略来解决此问题。具体而言,FSS是一种新方法,可提高asBagging中基本分类器的准确性和多样性之间的平衡水平。鉴于支持向量机(SVM)的强大泛化能力,我们将其用作基础分类器。在四个基准生物医学数据集上进行的大量实验表明,所提出的整体学习方法在准确性,F量度,G均值和AUC评估标准方面优于许多基线方法,因此可以被视为一种有效的应对方法高维和不平衡的生物医学数据。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号