...
首页> 外文期刊>Cybernetics, IEEE Transactions on >Evolutionary Cluster-Based Synthetic Oversampling Ensemble (ECO-Ensemble) for Imbalance Learning
【24h】

Evolutionary Cluster-Based Synthetic Oversampling Ensemble (ECO-Ensemble) for Imbalance Learning

机译:基于进化聚类的不平衡学习综合过采样集合(ECO集合)

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Class imbalance problems, where the number of samples in each class is unequal, is prevalent in numerous real world machine learning applications. Traditional methods which are biased toward the majority class are ineffective due to the relative severity of misclassifying rare events. This paper proposes a novel evolutionary cluster-based oversampling ensemble framework, which combines a novel cluster-based synthetic data generation method with an evolutionary algorithm (EA) to create an ensemble. The proposed synthetic data generation method is based on contemporary ideas of identifying oversampling regions using clusters. The novel use of EA serves a twofold purpose of optimizing the parameters of the data generation method while generating diverse examples leveraging on the characteristics of EAs, reducing overall computational cost. The proposed method is evaluated on a set of 40 imbalance datasets obtained from the University of California, Irvine, database, and outperforms current state-of-the-art ensemble algorithms tackling class imbalance problems.
机译:在许多现实世界的机器学习应用程序中普遍存在类不平衡问题,其中每个类中的样本数不相等。由于对稀有事件进行错误分类的相对严重性,偏向多数类的传统方法无效。本文提出了一种新颖的基于聚类的进化过采样集成框架,该框架将基于聚类的新型合成数据生成方法与进化算法(EA)相结合来创建一个集成。所提出的合成数据生成方法基于使用聚类识别过采样区域的当代思想。 EA的新颖用途具有双重目的:优化数据生成方法的参数,同时利用EA的特性生成各种示例,从而降低总体计算成本。在从加州大学欧文分校的数据库中获得的40个失衡数据集上对提出的方法进行了评估,其性能优于解决类失衡问题的最新集成算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号