首页> 外文期刊>Procedia Computer Science >Classifier Ensemble Design for Imbalanced Data Classification: A Hybrid Approach
【24h】

Classifier Ensemble Design for Imbalanced Data Classification: A Hybrid Approach

机译:不平衡数据分类的分类器集合设计:一种混合方法

获取原文
           

摘要

Imbalanced learning for classification problems is the active area of research in machine learning. Many classification systems like image retrieval and credit scoring systems have imbalanced distribution of training data sets which causes performance degradation of the classifier. Re-sampling of imbalanced data is commonly used to handle imbalanced distribution as it is independent of the classifier being used. But sometimes they can remove necessary data of the class or can cause over-fitting. Classifier Ensembles have recently achieved more attention as effective technique to handle skewed data. The focus of the work is to gain advantages of both data level and classifier ensemble approach in order to improve the classification performance. We present a novel approach that initially applies pre-processing to the imbalanced dataset in order to reduce the imbalance between the classes. The pre-processed data is provided as training dataset to the classifier ensemble that introduces diversity by using different training datasets as well as different classifier models. The experimentation conducted on the eight imbalanced datasets from KEEL repository helps to prove the significance of the proposed method. A comparative analysis shows the performance improvement in terms of Area under ROC Curve (AUC).
机译:分类问题的不平衡学习是机器学习研究的活跃领域。许多分类系统(例如图像检索和信用评分系统)的训练数据集分布不平衡,这会导致分类器的性能下降。不平衡数据的重新采样通常用于处理不平衡分布,因为它与所使用的分类器无关。但是有时它们可​​能会删除该类的必要数据,或者会导致过度拟合。分类器集成作为一种处理倾斜数据的有效技术,最近已引起更多关注。工作的重点是获得数据级别和分类器集成方法的优势,以提高分类性能。我们提出了一种新颖的方法,该方法最初将预处理应用于不平衡数据集,以减少类之间的不平衡。预处理后的数据作为训练数据集提供给分类器集合,分类器集合通过使用不同的训练数据集和不同的分类器模型来引入多样性。对来自KEEL存储库的八个不平衡数据集进行的实验有助于证明该方法的重要性。对比分析显示,在ROC曲线下面积(AUC)方面,性能有所提高。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号