首页> 外文会议>Iberoamerican congress on pattern recognition >Restricted Decontamination for the Imbalanced Training Sample Problem
【24h】

Restricted Decontamination for the Imbalanced Training Sample Problem

机译:限制净化培训样本问题的净化

获取原文

摘要

The problem of imbalanced training data in supervised methods is currently receiving growing attention. Imbalanced data means that one class is much more represented than the others in the training sample. It has been observed that this situation, which arises in several practical domains, may produce an important deterioration of the classification accuracy, in particular with patterns belonging to the less represented classes. In the present paper, we report experimental results that point at the convenience of correctly downsizing the majority class while simultaneously increasing the size of the minority one in order to balance both classes. This is obtained by applying a modification of the previously proposed Decontamination methodology. Combination of this proposal with the employment of a weighted distance function is also explored.
机译:监督方法中不平衡培训数据的问题目前正在接受不断的关注。不平衡数据意味着一个类比训练样本中的其他类更具代表。已经观察到这种情况下出现在几个实际域中,可能产生分类精度的重要恶化,特别是属于代表的类的模式。在本文中,我们报告了在正确缩小多数阶级的方便的实验结果,同时增加少数群体的大小以平衡两个类。这是通过应用先前提出的去污方法的修改来获得的。还探讨了该提案的组合,并探讨了加权距离功能的建议。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号