【24h】

Learning with Skewed Class Distributions

机译:学习偏斜的班级分布

获取原文
获取原文并翻译 | 示例

摘要

Several aspects may influence the performance achieved by a classifier created by a Machine Learning system. One of these aspects is related to the difference between the numbers of examples belonging to each class. When this difference is large, the learning system may have difficulties to learn the concept related to the minority class. In this work, we discuss several issues related to learning with skewed class distributions, such as the relationship between cost-sensitive learning and class distributions, and the limitations of accuracy and error rate to measure the performance of classifiers. Also, we survey some methods proposed by the Machine Learning community to solve the problem of learning with unbalanced data sets, and discuss some limitations of these methods.
机译:多个方面可能会影响由机器学习系统创建的分类器所实现的性能。这些方面之一与属于每个类别的示例数量之间的差异有关。当差异很大时,学习系统可能难以学习与少数群体有关的概念。在这项工作中,我们讨论了与偏态班级分布的学习有关的几个问题,例如成本敏感型学习与班级分布之间的关系,以及衡量分类器性能的准确性和错误率的局限性。此外,我们调查了机器学习社区提出的一些解决不平衡数据集学习问题的方法,并讨论了这些方法的一些局限性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号