首页> 外文会议>IEEE International Conference on Fuzzy Systems >Tackling label noise with multi-class decomposition using fuzzy one-class support vector machines
【24h】

Tackling label noise with multi-class decomposition using fuzzy one-class support vector machines

机译:使用模糊一类支持向量机进行多类分解处理标签噪声

获取原文

摘要

Class label noise is a data-level difficulty associated with training objects with incorrectly assigned labels. This problem may originate from poorly documented historic data, errors during data generation process or mistakes made by human experts. Inclusion of such examples during the training process will mislead the classifier by presenting a falsified class distribution and consequently lead to degradation of models' generalization abilities. This phenomenon becomes even more troublesome in multi-class scenarios that may be affected by highly complex intra-class noise. Decomposition strategies with binary classifiers were proven to alleviate this difficulty by using simplified binary subtasks that are less affected by the noise. In this paper we propose to extend this approach by using the one-class classification decomposition. In this scenario each class has assigned individual one-class classifier that aims at capturing its distinguishing characteristics. This allows to create a robust data description and then apply a dedicated classifier combination in order to reconstruct the original multi-class task. We further extend this concept by using fuzzy one-class classifiers that allow to associate membership values with each training objects. This allows us to reduce the influence of uncertain and potentially noisy samples on the shape of learned decision boundary. Experimental study backed-up with statistical analysis shows that fuzzy one-class classifier decomposition offers an excellent robustness to noise in multi-class classification.
机译:类标签噪声是与标签分配不正确的训练对象相关联的数据级别的困难。造成此问题的原因可能是历史文献记录少,数据生成过程中的错误或人类专家的错误。在训练过程中包含此类示例会通过提供伪造的类分布来误导分类器,从而导致模型的泛化能力下降。在可能受到高度复杂的类内噪声影响的多类场景中,此现象变得更加麻烦。事实证明,采用二进制分类器的分解策略可通过使用受噪声影响较小的简化二进制子任务来减轻此困难。在本文中,我们建议通过使用一类分类分解来扩展此方法。在这种情况下,每个类都分配了一个单独的一类分类器,旨在捕获其区别特征。这允许创建健壮的数据描述,然后应用专用的分类器组合以重构原始的多类任务。我们通过使用模糊的一类分类器进一步扩展了这一概念,该分类器允许将成员资格值与每个训练对象相关联。这使我们能够减少不确定样本和潜在噪声样本对学习的决策边界形状的影响。统计分析支持的实验研究表明,模糊一类分类器分解为多类分类中的噪声提供了出色的鲁棒性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号