首页> 外文会议>ICPR 2012;International Conference on Pattern Recognition >Label-noise reduction with support vector machines
【24h】

Label-noise reduction with support vector machines

机译:使用支持向量机减少标签噪声

获取原文

摘要

The problem of detection of label-noise in large datasets is investigated. We consider applications where data are susceptible to label error and a human expert is available to verify a limited number of such labels in order to cleanse the data. We show the support vectors of a Support Vector Machine (SVM) contain almost all of these noisy labels. Therefore, the verification of support vectors allows efficient cleansing of the data. Empirical results are presented for two experiments. In the first experiment, two datasets from the character recognition domain are used and artificial random noise is applied in their labeling. In the second experiment, a large dataset of plankton images, that contains inadvertent human label error, is considered. It is shown that up to 99% of all label-noise from such datasets can be detected by verifying just the support vectors of the SVM classifier.
机译:研究了大型数据集中标签噪声的检测问题。我们考虑在数据容易出现标签错误的情况下使用的应用程序,并且专业人员可以验证有限数量的此类标签以清理数据。我们展示了支持向量机(SVM)的支持向量几乎包含所有这些嘈杂的标签。因此,支持向量的验证允许有效清除数据。给出了两个实验的经验结果。在第一个实验中,使用了来自字符识别域的两个数据集,并将人工随机噪声应用于它们的标记。在第二个实验中,考虑了一个大型浮游生物图像数据集,其中包含无意的人类标签错误。结果表明,仅验证SVM分类器的支持向量,就可以检测出此类数据集中多达99%的标签噪声。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号