首页> 外文期刊>Mathematical Problems in Engineering >Classifying Imbalanced Data Sets by a Novel RE-Sample and Cost-Sensitive Stacked Generalization Method
【24h】

Classifying Imbalanced Data Sets by a Novel RE-Sample and Cost-Sensitive Stacked Generalization Method

机译:通过一种新颖的RE样本和成本敏感的堆叠泛化方法对不平衡数据集进行分类

获取原文
获取原文并翻译 | 示例
       

摘要

Learning with imbalanced data sets is considered as one of the key topics in machine learning community. Stacking ensemble is an efficient algorithm for normal balance data sets. However, stacking ensemble was seldom applied in imbalance data. In this paper, we proposed a novel RE-sample and Cost-Sensitive Stacked Generalization (RECSG) method based on 2-layer learning models. The first step is Level 0 model generalization including data preprocessing and base model training. The second step is Level 1 model generalization involving cost-sensitive classifier and logistic regression algorithm. In the learning phase, preprocessing techniques can be embedded in imbalance data learning methods. In the cost-sensitive algorithm, cost matrix is combined with both data characters and algorithms. In the RECSG method, ensemble algorithm is combined with imbalance data techniques. According to the experiment results obtained with 17 public imbalanced data sets, as indicated by various evaluation metrics (AUC, Geo Mean, and AGeoMean), the proposed method showed the better classification performances than other ensemble and single algorithms. The proposed method is especially more efficient when the performance of base classifier is low. All these demonstrated that the proposed method could be applied in the class imbalance problem.
机译:使用不平衡数据集进行学习被视为机器学习社区中的关键主题之一。堆叠合奏是一种用于法线余额数据集的有效算法。但是,在不平衡数据中很少应用堆叠合奏。在本文中,我们提出了一种新颖的基于2层学习模型的RE样本和成本敏感堆栈泛化(RECSG)方法。第一步是0级模型概括,包括数据预处理和基础模型训练。第二步是涉及成本敏感分类器和逻辑回归算法的1级模型概括。在学习阶段,可以将预处理技术嵌入不平衡数据学习方法中。在成本敏感算法中,成本矩阵与数据特征和算法结合在一起。在RECSG方法中,集成算法与不平衡数据技术相结合。根据各种评估指标(AUC,Geo Mean和AGeoMean)显示的从17个公共不平衡数据集获得的实验结果,提出的方法显示出比其他整体和单个算法更好的分类性能。当基本分类器的性能较低时,该方法特别有效。所有这些证明了所提出的方法可以应用于类不平衡问题。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号