首页> 外文会议>2012 IEEE 3rd International Conference on Software Engineering and Service Science >A Synthesized Sampling Approach for Improving the Prediction of Imbalanced Classification
【24h】

A Synthesized Sampling Approach for Improving the Prediction of Imbalanced Classification

机译:一种用于改进不平衡分类预测的综合抽样方法

获取原文
获取原文并翻译 | 示例

摘要

Imbalanced dataset is an important factor influencing the effect of learning algorithms. Its influence on the classification learner is even more universal. To deal with imbalanced classification problem, sampling strategy is always an efficient method, however some other aspects of this strategy need to be solved. What distribution should be regulated among the classes and within the class? Which sampling strategy, over-sampling or under-sampling, is more acceptable in specific issues? What metric should be used to measure the classification results? In this paper we propose a general rule to select sampling strategy and design a novel metric -measure, putting more attention to the minority. As for the distribution between the classes our choice of them is based on the standard whether the selected distribution will lead to significant improvement of the evaluation criteria.
机译:数据集不平衡是影响学习算法效果的重要因素。它对分类学习器的影响更为普遍。为了解决分类不平衡的问题,采样策略一直是一种有效的方法,但是该策略的其他一些方面也需要解决。在班级之间和班级内部应规定什么分布?在特定问题上,哪种采样策略(过度采样或欠采样)更可接受?应该使用什么度量标准来度量分类结果?在本文中,我们提出了选择抽样策略并设计新的度量标准的一般规则,将更多的注意力放在少数群体上。至于班级之间的分布,我们根据标准进行选择是基于所选的分布是否会导致评估标准的显着改善。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号