首页> 外文会议>International Workshop on Multiple Classifier Systems(MCS 2007); 20070523-25; Prague(CZ) >Exploiting Diversity in Ensembles: Improving the Performance on Unbalanced Datasets
【24h】

Exploiting Diversity in Ensembles: Improving the Performance on Unbalanced Datasets

机译:利用集成中的多样性:提高不平衡数据集的性能

获取原文
获取原文并翻译 | 示例

摘要

Ensembles are often capable of greater predictive performance than any of their individual classifiers. Despite the need for classifiers to make different kinds of errors, the majority voting scheme, typically used, treats each classifier as though it contributed equally to the group's performance. This can be particularly limiting on unbalanced datasets, as one is more interested in complementing classifiers that can assist in improving the true positive rate without signicantly increasing the false positive rate. Therefore, we implement a genetic algorithm based framework to weight the contribution of each classifier by an appropriate fitness function, such that the classifiers that complement each other on the unbalanced dataset are preferred, resulting in significantly improved performances. The proposed framework can be built on top of any collection of classifiers with different fitness functions.
机译:集成通常比其任何单个分类器都具有更好的预测性能。尽管需要分类器做出不同类型的错误,但通常使用的多数表决方案将每个分类器视为对分类器的贡献均等。这尤其可能限制不平衡的数据集,因为人们对补充分类器更为感兴趣,这些分类器可以帮助提高真实阳性率而不会显着增加错误阳性率。因此,我们实现了一个基于遗传算法的框架,以通过适当的适应度函数加权每个分类器的贡献,从而使在不平衡数据集上互补的分类器成为首选,从而显着提高了性能。所提出的框架可以建立在具有不同适应度函数的任何分类器集合之上。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号