首页> 外文会议>Bioinformatics and Biomedicine, 2009. BIBM '09 >Ensemble Learning Based on Active Example Selection for Solving Imbalanced Data Problem in Biomedical Data
【24h】

Ensemble Learning Based on Active Example Selection for Solving Imbalanced Data Problem in Biomedical Data

机译:基于主动样本选择的集成学习解决生物医学数据不平衡问题

获取原文

摘要

The imbalanced data problem is popular in biomedical classification tasks. Since trained classifiers using imbalanced data are mostly derived from the majority class, their prediction performance is poor for the minority class. In this paper, we propose a novel ensemble learning method based on an active example selection algorithm to resolve the imbalanced data problem. To compensate a possible sub-optimal classifier, our proposed ensemble learning methods aggregates classifiers built by the active example selection algorithm. We implement this ensemble learning method based on the active example selection algorithm using incremental naïve Bayes classifiers. Our empirical results show that we greatly improve the performance of classification models trained by five real world imbalanced biomedical data. The proposed ensemble learning methods outperforms other approaches by 0.03~0.15 in terms of AUC which solve imbalanced data problem.
机译:数据不平衡问题在生物医学分类任务中很普遍。由于使用不平衡数据的训练有素的分类器大部分来自多数类别,因此对于少数类别其预测性能较差。在本文中,我们提出了一种基于主动示例选择算法的集成学习方法,以解决数据不平衡问题。为了补偿可能的次优分类器,我们提出的集成学习方法聚合了由主动示例选择算法构建的分类器。我们使用增量朴素贝叶斯分类器,基于活动示例选择算法,实现了这种整体学习方法。我们的经验结果表明,我们极大地提高了由五个现实世界中不平衡的生物医学数据训练出的分类模型的性能。所提出的集成学习方法在解决数据不平衡问题的AUC方面优于其他方法0.03〜0.15。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号