首页> 外文会议>Proceedings of the 2007 International Conference on Machine Learning and Cybernetics >A WEIGHTED ROUGH SET METHOD TO ADDRESS THE CLASS IMBALANCE PROBLEM
【24h】

A WEIGHTED ROUGH SET METHOD TO ADDRESS THE CLASS IMBALANCE PROBLEM

机译:解决类不平衡问题的加权粗糙集方法

获取原文
获取外文期刊封面目录资料

摘要

The class imbalance problem has been said recently to hinder the performance of learning systems.Most of traditional learning algorithms are designed with the assumption of well-balanced datasets, and are biased towards the majority class and thus may predict poorly the minority class examples.In this paper, we develop weighted rough sets (WRS) to deal with this problem.In weighted rough sets, weighted entropy is introduced and extended to compute the information content introduced by attributes.A forward greedy weighted attribute reduction algorithm based on the weighted entropy and a weighted rule extraction algorithm are provided.The factors of weighted strength, weighted certainty and weighted cover are employed to evaluate the extracted rules.Finally, a decision algorithm based on the weighted strength factor is constructed.Based on weighted rough sets, a series of experiments on class imbalance learning are conducted on 20 LCI data sets.In the meaning of AUC and minority class accuracy, WRS achieves the better results than classical rough set in class imbalance learning.Moreover, the evaluation of extracted rules has greater influence than the selection of attributes on weighted rough set learning.
机译:最近有人说班级不平衡问题阻碍了学习系统的性能。大多数传统的学习算法都是在假设数据集均衡的前提下设计的,并且偏向多数班级,因此可能无法预测少数班级的例子。针对这一问题,本文开发了加权粗糙集(WRS)。在加权粗糙集中,引入并扩展了加权熵,以计算出属性引入的信息量。给出了加权规则提取算法。采用加权强度,加权确定性和加权覆盖率等因子对提取的规则进行评估。最后,构造了基于加权强度因子的决策算法。在20个LCI数据集上进行了课堂失衡学习的实验。与传统的粗糙集相比,WRS在班级不平衡学习中具有更好的效果。此外,对加权粗糙集学习而言,提取规则的评估比属性选择具有更大的影响力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号