首页> 外文期刊>Multimedia Tools and Applications >KA-Ensemble: towards imbalanced image classification ensembling under-sampling and over-sampling
【24h】

KA-Ensemble: towards imbalanced image classification ensembling under-sampling and over-sampling

机译:KA-Ensemble:迈向非衡度的图像分类合并,在抽样和过度采样

获取原文
获取原文并翻译 | 示例
           

摘要

Imbalanced learning has become a research emphasis in recent years because of the growing number of class-imbalance classification problems in real applications. It is particularly challenging when the imbalanced rate is very high. Sampling, including under-sampling and over-sampling, is an intuitive and popular way in dealing with class-imbalance problems, which tries to regroup the original dataset and is also proved to be efficient. The main deficiency is that under-sampling methods usually ignore many majority class examples while over-sampling methods may easily cause over-fitting problem. In this paper, we propose a new algorithm dubbed KA-Ensemble ensembling under-sampling and over-sampling to overcome this issue. Our KA-Ensemble explores EasyEnsemble framework by under-sampling the majority class randomly and over-sampling the minority class via kernel based adaptive synthetic (Kernel-ADASYN) at meanwhile, yielding a group of balanced datasets to train corresponding classifiers separately, and the final result will be voted by all these trained classifiers. Through combining under-sampling and over-sampling in this way, KA-Ensemble is good at solving class-imbalance problems with large imbalanced rate. We evaluated our proposed method with state-of-the-art sampling methods on 9 image classification datasets with different imbalanced rates ranging from less than 2 to more than 15, and the experimental results show that our KA-Ensemble performs better in terms of accuracy (ACC), F-Measure, G-Mean, and area under curve (AUC). Moreover, it can be used in both dichotomy and multi-classification on both image classification and other class-imbalance problems.
机译:由于实际应用中的越来越多的类别不平衡分类问题,但近年来,学习已经成为一项重点。当不平衡的速率非常高时,它特别具有挑战性。在包括抽样和过度采样的采样是一种在处理类别不平衡问题的直观和流行的方式,这试图重新组合原始数据集,并且也被证明是有效的。主要缺陷是,在取样方法中通常忽略许多多数类示例,而过采样方法可能很容易导致过度拟合问题。在本文中,我们提出了一种新的算法被称为KA-Seaneleble合奏的底层,并过度采样来克服这个问题。我们的ka-senemble通过随机地通过内核基于的自适应合成(Kernel-Adasyn)随机上采样少数级别,通过内核基于群体类进行了对少数群体类进行了采样而探讨了Easy ensemble框架,同时产生了一组平衡数据集,分别培训相应的分类器,并最终结果将由所有这些训练有素的分类器投票。通过以这种方式结合取样和过度采样,KA-Ensemble擅长解决具有较大不平衡率的类别不平衡问题。我们在9个图像分类数据集上评估了我们所提出的方法,以9个图像分类数据集,不同的不平衡率范围不到2到15多个,实验结果表明,我们的KA集合在准确性方面表现更好(ACC),F测量,G均值和曲线下的区域(AUC)。此外,它可以用于二分法和多分类的图像分类和其他类别不平衡问题。

著录项

  • 来源
    《Multimedia Tools and Applications》 |2020年第22期|14871-14888|共18页
  • 作者单位

    Department of Electronic Engineering College of Information Science and Engineering Ocean University of China Qingdao 266100 China;

    Shandong Key Laboratory of Digital Medicine and Computer Assisted Surgery The Affiliated Hospital of Qingdao University Qingdao 266003 China;

    Department of Electronic Engineering College of Information Science and Engineering Ocean University of China Qingdao 266100 China;

    Department of Electronic Engineering College of Information Science and Engineering Ocean University of China Qingdao 266100 China;

    Department of Electronic Engineering College of Information Science and Engineering Ocean University of China Qingdao 266100 China Department of Mathematics. School of Science and Engineering University of Dundee Dundee DD1 4HN UK;

    Department of Electronic Engineering College of Information Science and Engineering Ocean University of China Qingdao 266100 China;

    College of Mechanical and Electrical Engineering Qingdao Agricultural University Qingdao 266109 China;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Class-imbalance learning; Under-sampling; Over-sampling; Ensemble learning; Image classification;

    机译:类别不平衡学习;欠抽样;过度抽样;合奏学习;图像分类;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号