首页> 外文会议>2017 2nd International Conference on Computational Systems and Information Technology for Sustainable Solution >Hybrid Methods for Class Imbalance Learning Employing Bagging with Sampling Techniques
【24h】

Hybrid Methods for Class Imbalance Learning Employing Bagging with Sampling Techniques

机译:运用套袋技术的班级不平衡学习的混合方法

获取原文
获取原文并翻译 | 示例

摘要

Class imbalance classification has become a dominant problem in supervised learning. The bias of majority class instances dominates in quantity over minority class instances in imbalanced datasets, which produce the suboptimal classification results for classifying the minority class instances. In the last decade, several methods including sampling techniques, cost-sensitive learning, and ensemble methods have been introduced for dealing with class imbalance classification. Among all the methods, the ensemble method performs better in compare with sampling and cost-sensitive learning. The ensemble learning uses sampling technique (either under-sampling or oversampling) with bagging or boosting algorithms. However, which sampling techniques will work better with ensemble learning to improve class imbalance is extremely depend on problem domains. In this paper, we propose two bagging based methods: (a) ADASYNBagging, and (b) RSYNBagging for dealing with imbalanced classification. The ADASYNBagging uses ADASYN based over-sample technique with bagging algorithm. On the contrary, the RSYNBagging uses random under-sampling and ADASYN based over-sample technique with bagging algorithm. RSYNBagging utilizes both under-sampling and over-sampling in alternate iterations and thus incorporates the advantages of both techniques without introducing any extra parameter to tune or increasing time complexity. We have tested the performance of our proposed ADASYNBagging and RSYNBagging methods against existing best performing methods Underbagging, SMOTEBagging on 11 benchmark imbalanced datasets and the initial results are strongly encouraging.
机译:班级不平衡分类已成为监督学习中的主要问题。在不平衡数据集中,多数类实例的偏见在数量上要优于少数类实例,这为分类少数类实例产生了次优的分类结果。在过去的十年中,已经引入了几种方法来处理类别不平衡分类,包括抽样技术,成本敏感型学习和集成方法。在所有方法中,与抽样和成本敏感型学习相比,集成方法表现更好。集成学习使用带有装袋算法或增强算法的采样技术(欠采样或过采样)。但是,哪种采样技术可以更好地与集成学习一起使用,以改善班级的不平衡状况,这在很大程度上取决于问题领域。在本文中,我们提出了两种基于装袋的方法:(a)ADASYNBagging和(b)RSYNBagging用于处理不平衡分类。 ADASYNBagging使用基于ADASYN的过采样技术和装袋算法。相反,RSYNBagging使用随机欠采样和基于ADASYN的过采样技术以及袋装算法。 RSYNBagging在交替迭代中同时使用了欠采样和过采样,因此结合了这两种技术的优点,而无需引入任何额外参数来调整或增加时间复杂度。我们已经针对11个基准不平衡数据集测试了我们提出的ADASYNBagging和RSYNBagging方法相对于现有最佳性能方法Underbag,SMOTEBagging的性能,并且初步结果令人鼓舞。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号