首页> 外文期刊>Knowledge-Based Systems >An empirical comparison on state-of-the-art multi-class imbalance learning algorithms and a new diversified ensemble learning scheme
【24h】

An empirical comparison on state-of-the-art multi-class imbalance learning algorithms and a new diversified ensemble learning scheme

机译:最新的多类不平衡学习算法与新型多元化集成学习方案的经验比较

获取原文
获取原文并翻译 | 示例
       

摘要

Class-imbalance learning is one of the most challenging problems in machine learning. As a new and important direction in this field, multi-class imbalanced data classification has attracted a great many research focus in recent years. In this paper, we first make a very comprehensive review on state-of-the-art classification algorithms for multi-class imbalanced data. Moreover, we propose a new multi-class imbalance classification algorithm, which is hereafter referred to as the Diversified Error Correcting Output Codes (DECOC) method. The main idea of DECOC is to combine the improved ECOC (Error Correcting Output Codes) method for tackling class imbalance, and the diversified ensemble learning framework, which finds the best classification algorithm (out of many heterogeneous classification algorithms) for each individual sub-dataset resampled from the original data. We conduct experiments on 19 public datasets to empirically compare the performance of DECOC with 17 state-of-the-art multi-class imbalance learning algorithms, using 4 different accuracy measures: overall accuracy, Geometric mean, F-measure, and Area Under Curve. Experimental results demonstrate that DECOC achieves significantly better accuracy performance than the other 17 algorithms on these accuracy metrics. To advance research in this field, we make all the source codes of DECOC and the above-mentioned 17 state-of-the-art algorithms for imbalanced data classification be available at GitHub:https://github.com/chongshengzhang/Multi_Imbalance.
机译:类不平衡学习是机器学习中最具挑战性的问题之一。作为该领域的一个新的重要方向,近几年来,多类不平衡数据分类吸引了许多研究重点。在本文中,我们首先对多类不平衡数据的最新分类算法进行非常全面的回顾。此外,我们提出了一种新的多类不平衡分类算法,此算法在下文中称为“多种纠错输出码”(DECOC)方法。 DECOC的主要思想是将改进的ECOC(纠错输出代码)方法用于解决类不平衡问题,并将多元化的集成学习框架相结合,从而为每个单独的子数据集找到最佳的分类算法(在许多异构分类算法中)从原始数据中重新采样。我们在19个公共数据集上进行了实验,以实验方式将DECOC与17种最新的多类不平衡学习算法的性能进行比较,并使用4种不同的精度度量:整体精度,几何平均值,F度量和曲线下面积。实验结果表明,在这些准确度指标上,DECOC的准确度性能明显优于其他17种算法。为了推动这一领域的研究,我们在GitHub上提供了DECOC的所有源代码和上述17种用于不平衡数据分类的最新算法:https://github.com/chongshengzhang/Multi_Imbalance。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号