首页> 外文会议>IFIP TC8 international conference on computer information systems and industrial management >Granular Computing and Parameters Tuning in Imbalanced Data Preprocessing
【24h】

Granular Computing and Parameters Tuning in Imbalanced Data Preprocessing

机译:不平衡数据预处理中的粒度计算和参数调整

获取原文

摘要

Selective preprocessing, representing data-level approach to the imbalanced data problem, is one of the most successful methods. This paper introduces novel algorithm combining this kind of technique with the filtering phase. The information granules are formed to distinguish specific types of positive examples that should be adequately treated. Three modes of oversampling, dedicated to minority class instances placed in specific areas of the feature space, are available. The rough set theory is applied to filter and remove inconsistencies from the generated positive samples. The experimental study shows that proposed method in most cases obtains better or similar performance of standard classifiers, such as C4.5 decision tree, in comparison with other techniques. Additionally, multiple values of algorithm's parameters are evaluated. It is experimentally proven that two of the examined parameters values are the most appropriate to various applications. However, the automatic parameters tuning, based on the specific requirements of different data distributions, is recommended.
机译:选择性预处理代表了解决不平衡数据问题的数据级方法,是最成功的方法之一。本文介绍了一种将这种技术与滤波阶段相结合的新颖算法。形成信息颗粒以区分应适当处理的阳性样本的具体类型。提供了三种过采样模式,专用于放置在要素空间特定区域中的少数类实例。粗糙集理论适用于从生成的正样本中过滤和消除不一致性。实验研究表明,与其他技术相比,该方法在大多数情况下可以获得更好或相似的标准分类器性能,例如C4.5决策树。另外,评估算法参数的多个值。实验证明,所检查的两个参数值最适合各种应用。但是,建议根据不同数据分布的特定要求进行自动参数调整。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号