首页> 外文期刊>Frontiers of computer science in China >Evolutionary under-sampling based bagging ensemble method for imbalanced data classification
【24h】

Evolutionary under-sampling based bagging ensemble method for imbalanced data classification

机译:基于演化欠采样的装袋集成方法用于不平衡数据分类

获取原文
获取原文并翻译 | 示例
       

摘要

In the class imbalanced learning scenario, traditional machine learning algorithms focusing on optimizing the overall accuracy tend to achieve poor classification performance especially for the minority class in which we are most interested. To solve this problem, many effective approaches have been proposed. Among them, the bagging ensemble methods with integration of the under-sampling techniques have demonstrated better performance than some other ones including the bagging ensemble methods integrated with the over-sampling techniques, the cost-sensitive methods, etc. Although these under-sampling techniques promote the diversity among the generated base classifiers with the help of random partition or sampling for the majority class, they do not take any measure to ensure the individual classification performance, consequently affecting the achievability of better ensemble performance. On the other hand, evolutionary under-sampling EUS as a novel under-sampling technique has been successfully applied in searching for the best majority class subset for training a good-performance nearest neighbor classifier. Inspired by EUS, in this paper, we try to introduce it into the under-sampling bagging framework and propose an EUS based bagging ensemble method EUS-Bag by designing a new fitness function considering three factors to make EUS better suited to the framework. With our fitness function, EUS-Bag could generate a set of accurate and diverse base classifiers. To verify the effectiveness of EUS-Bag, we conduct a series of comparison experiments on 22 two-class imbalanced classification problems. Experimental results measured using recall, geometric mean and AUC all demonstrate its superior performance.
机译:在班级不平衡的学习场景中,专注于优化整体准确性的传统机器学习算法往往会实现较差的分类性能,尤其是对于我们最感兴趣的少数班级。为了解决这个问题,已经提出了许多有效的方法。其中,集成了欠采样技术的装袋合奏方法表现出比包括过采样技术的集成装袋法,成本敏感方法等其他方法更好的性能。尽管这些欠采样技术借助随机划分或多数类抽样来促进生成的基本分类器之间的多样性,它们没有采取任何措施来确保个体分类性能,因此影响了更好的整体演奏的可实现性。另一方面,作为一种新型欠采样技术的进化欠采样EUS已成功应用于寻找最佳多数类子集以训练性能良好的最近邻分类器。受EUS的启发,本文尝试将EUS引入欠采样装袋框架,并通过考虑三个因素设计一种新的适应度函数,提出一种基于EUS的装袋集成方法EUS-Bag,以使EUS更适合该框架。借助我们的适应度函数,EUS-Bag可以生成一组准确而多样的基础分类器。为了验证EUS-Bag的有效性,我们对22个两类不平衡分类问题进行了一系列比较实验。使用召回率,几何平均值和AUC测得的实验结果均证明了其优越的性能。

著录项

  • 来源
    《Frontiers of computer science in China》 |2018年第2期|331-350|共20页
  • 作者单位

    College of Computer Science and Technology, Nanjing University of Aeronautics and Astronautics, Nanjing 210016,China,National Key Lab of ATFM, Nanjing University of Aeronautics and Astronautics, Nanjing 211106, China;

    College of Computer Science and Technology, Nanjing University of Aeronautics and Astronautics, Nanjing 210016,China,National Key Lab of ATFM, Nanjing University of Aeronautics and Astronautics, Nanjing 211106, China;

    College of Computer Science and Technology, Nanjing University of Aeronautics and Astronautics, Nanjing 210016,China;

    National Key Lab of ATFM, Nanjing University of Aeronautics and Astronautics, Nanjing 211106, China;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    class imbalanced problem; under-sampling; bagging; evolutionary under-sampling; ensemble learning; machine learning; data mining;

    机译:阶级失衡问题;欠采样;套袋进化欠采样;整体学习;机器学习数据挖掘;
  • 入库时间 2022-08-17 23:17:52

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号