首页> 外文会议>International Conference on Computational Performance Evaluation >A Review on Solution to Class Imbalance Problem: Undersampling Approaches
【24h】

A Review on Solution to Class Imbalance Problem: Undersampling Approaches

机译:类不平衡问题的解决方案回顾:欠采样方法

获取原文

摘要

The classification task carries a significant role in the field of effective data mining and numerous classification models are proposed over the years to carry out the job. However, standard classification models are sensitive to the underlying characteristics of the datasets. When employed to a dataset with skewed class distribution, standard classification models tend to misclassify the rare instances as it gets biased towards the majority patterns. This is where the issue of class imbalance makes it mark and causes to significantly degrade the performance of the standard classifiers. Among the several reported solutions for class imbalance issue, undersampling approaches are quite prevalent which offers to balance the class distribution by discarding insignificant majority instances. In this paper, an insight of class imbalance issue is presented in regard of its impact on classification models, the reported solutions and the effectiveness of the undersampling approaches in solving the issue.
机译:分类任务在有效的数据挖掘领域中起着重要作用,多年来,人们提出了许多分类模型来执行这项工作。但是,标准分类模型对数据集的基本特征很敏感。当将标准分类模型应用于具有偏斜的类分布的数据集时,由于它偏向多数模式而趋向于对稀有实例进行错误分类。这是类不平衡问题的标记,并导致显着降低标准分类器的性能。在针对类别不平衡问题的几种已报告解决方案中,欠采样方法非常普遍,可以通过丢弃无关紧要的多数实例来平衡类别分布。本文从类不平衡问题对分类模型的影响,所报告的解决方案以及欠采样方法解决该问题的有效性方面给出了见解。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号