首页> 外文会议>International Conference on Computing and Convergence Technology >A modified error function for imbalanced dataset classification problem
【24h】

A modified error function for imbalanced dataset classification problem

机译:一个修改的错误数据集分类问题

获取原文

摘要

The objective of learning is to achieve the least error rate. In this paper we proposed a modified cost function as a means to properly measure error rate for imbalanced dataset. Most cost functions apply the same weights to all classes. However, it has been known that for imbalanced problem, the number of instances in the majority class is larger than the minority class. Therefore, the application of equal weight to all classes will significantly lead to improper classification boundary. That is, for most learning model, the minority class would be dominated by majority class which then causes a misclassification on the minority class. The objective of this paper is to find the appropriate parameters to improve MSE cost function based on overlap ratio and class distribution ratio. Back-propagation algorithm with the proposed modified cost function is used to solve two-class classification problem. UCI datasets are used for the experimentation. The results show that the modified MSE cost function provides a better result than the standard one, based on True-positive rate, G-Mean, and F-measurement.
机译:学习的目标是达到最小的错误率。在本文中,我们提出了一种修改的成本函数,作为正确测量不平衡数据集的错误率的手段。大多数成本函数对所有类应用相同的权重。然而,已经知道,对于不平衡的问题,大多数类中的实例数大于少数类。因此,对所有类别的平等权重的应用将显着导致分类边界不当。也就是说,对于大多数学习模式,少数阶层将由多数阶级主导,然后少数阶级造成少数阶级的错误分类。本文的目的是找到适当的参数,以改善基于重叠比和类分配比率的MSE成本函数。使用所提出的修改成本函数的反向传播算法用于解决两类分类问题。 UCI数据集用于实验。结果表明,基于真正阳性速率,G平均值和F测量,修改的MSE成本函数提供比标准的成本更好的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号