首页> 外文期刊>Information Sciences: An International Journal >SMOTE-IPF: Addressing the noisy and borderline examples problem in imbalanced classification by a re-sampling method with filtering
【24h】

SMOTE-IPF: Addressing the noisy and borderline examples problem in imbalanced classification by a re-sampling method with filtering

机译:SMOTE-IPF:通过带过滤的重采样方法解决不平衡分类中的嘈杂和边界示例问题

获取原文
获取原文并翻译 | 示例
           

摘要

Classification datasets often have an unequal class distribution among their examples. This problem is known as imbalanced classification. The Synthetic Minority Over-sampling Technique (SMOTE) is one of the most well-know data pre-processing methods to cope with it and to balance the different number of examples of each class. However, as recent works claim, class imbalance is not a problem in itself and performance degradation is also associated with other factors related to the distribution of the data. One of these is the presence of noisy and borderline examples, the latter lying in the areas surrounding class boundaries. Certain intrinsic limitations of SMOTE can aggravate the problem produced by these types of examples and current generalizations of SMOTE are not correctly adapted to their treatment.
机译:分类数据集之间的类分布通常不相等。此问题称为不平衡分类。综合少数族裔过采样技术(SMOTE)是最知名的数据预处理方法之一,可以应对这种情况并平衡每个类别的不同示例数量。但是,正如最近的工作所声称的那样,类不平衡本身并不是问题,而且性能下降还与其他与数据分布有关的因素有关。其中之一是存在嘈杂的边界示例,后者位于阶级边界周围的区域。 SMOTE的某些固有局限性会加剧由这些类型的示例产生的问题,并且当前对SMOTE的概括未正确地适用于其处理。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号