...
首页> 外文期刊>Progress in Artificial Intelligence >Automated cleaning of identity label noise in a large face dataset with quality control
【24h】

Automated cleaning of identity label noise in a large face dataset with quality control

机译:在具有质量控制的大面对数据集中自动清洁身份标签噪声

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

For face recognition, some very large-scale datasets are publicly available in recent years, which are usually collected from the Internet using search engines, and thus have many faces with wrong identity (ID) labels (outliers). Additionally, the face images in these datasets have different qualities because of uncontrolled situations. The authors propose a novel approach for cleaning the ID label error, handling face images in different qualities. The face ID labels cleaned by their method can train better models for low-quality face recognition since more low-quality images are correctly labelled for training a deep model. In their low-to-high-quality face verification experiments, the deep model trained on their cleaning results of MS-Celeb-1M.v1 face dataset outperforms the same model trained on the same dataset cleaned by the semantic bootstrapping method. They also apply their ID label cleaning method on a subset of the cross-age celebrity dataset (CACD) face dataset, in which their quality-based cleaning can deliver higher precision and recall than a previous method on detecting the ID label errors.
机译:对于面部识别,近年来一些非常大规模的数据集通常可以使用搜索引擎从互联网收集,因此有许多具有错误身份(ID)标签(异常值)的面孔。另外,由于不受控制的情况,这些数据集中的面部图像具有不同的质量。作者提出了一种清洁ID标签错误的新方法,处理不同品质的面部图像。通过其方法清洁的面部ID标签可以培训更好的模型以获得低质量的面部识别,因为更多的低质量图像被正确标记用于训练深层模型。在其低于高质量的面部验证实验中,深入模型培训的MS-CeleB-1M.v1面部数据集的清洁结果胜过同一模型在由语义引导方法清洁的同一数据集上培训的相同模型。它们还在串行名人数据集(CACD)面部数据集的子集上应用其ID标签清洁方法,其中基于质量的清洁可以提供比以前的方法检测ID标签错误的方法更高的精度和召回。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号