首页> 外文会议>International Conference on Hybrid Artificial Intelligence Systems >Radial-Based Approach to Imbalanced Data Oversampling
【24h】

Radial-Based Approach to Imbalanced Data Oversampling

机译:基于径向的数据超采样方法

获取原文

摘要

The difficulty of the many practical decision problem lies in the nature of analyzed data. One of the most important real data characteristic is imbalance among examples from different classes. Despite more than two decades of research, imbalanced data classification is still one of the vital challenges to be addressed. The traditional classification algorithms display strongly biased performance on imbalanced datasets. One of the most popular way to deal with such a problem is to modify the learning set to decrease disproportion between objects from different classes using over- or undersampling approaches. In this work a novel preprocessing technique for imbalanced datasets is presented, which takes into consideration the mutual density class distribution. The proposed approach has been evaluated on the basis of the computer experiments carried out on the benchmark datasets. Their results seem to confirm the usefulness of the proposed concept in comparison to the state-of-art methods.
机译:许多实际决策问题的难度在于分析数据的性质。来自不同类别的示例中,最重要的真实数据特性之一是不平衡的。尽管有两十年的研究,但数据分类的不平衡仍然是所解决的重要挑战之一。传统的分类算法在不平衡数据集中显示了强烈偏置的性能。处理此类问题的最受欢迎方式之一是修改学习集,使用过采样或未采样方法来减少来自不同类的对象之间的歧化。在这项工作中,提出了一种用于实施数据集的新型预处理技术,其考虑了相互密度的类分布。已经根据基准数据集执行的计算机实验评估所提出的方法。与最先进的方法相比,它们的结果似乎确认了所提出的概念的有用性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号