首页> 外文会议>International Conference on Bioinformatics and Biomedicine >Ensemble Learning Based on Active Example Selection for Solving Imbalanced Data Problem in Biomedical Data
【24h】

Ensemble Learning Based on Active Example Selection for Solving Imbalanced Data Problem in Biomedical Data

机译:基于主动示例选择的集合学习,用于解决生物医学数据中的不平衡数据问题

获取原文
获取外文期刊封面目录资料

摘要

The imbalanced data problem is popular in biomedical classification tasks. Since trained classifiers using imbalanced data are mostly derived from the majority class, their prediction performance is poor for the minority class. In this paper, we propose a novel ensemble learning method based on an active example selection algorithm to resolve the imbalanced data problem. To compensate a possible sub-optimal classifier, our proposed ensemble learning methods aggregates classifiers built by the active example selection algorithm. We implement this ensemble learning method based on the active example selection algorithm using incremental naive Bayes classifiers. Our empirical results show that we greatly improve the performance of classification models trained by five real world imbalanced biomedical data. The proposed ensemble learning methods outperforms other approaches by 0.03~0.15 in terms of AUC which solve imbalanced data problem.
机译:不平衡数据问题在生物医学分类任务中是流行的。由于使用不平衡数据的经过培训的分类器主要来自大多数类,因此它们的预测性能对于少数民族类别差。在本文中,我们提出了一种基于活动示例选择算法的新型集合学习方法来解决不平衡数据问题。为了补偿可能的子最优分类器,我们所提出的集合学习方法聚合由活动示例选择算法构建的分类器。我们使用增量Naive Bayes分类器基于主动示例选择算法实现该集合学习方法。我们的经验结果表明,我们大大提高了五个现实世界不平衡生物医学数据培训的分类模型的性能。拟议的集合学习方法在解决不平衡数据问题的AUC方面优于0.03〜0.15的其他方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号