首页> 外文期刊>Biometrics, IET >Automated cleaning of identity label noise in a large face dataset with quality control
【24h】

Automated cleaning of identity label noise in a large face dataset with quality control

机译:通过质量控制自动清除大型面部数据集中的身份标签噪声

获取原文
获取原文并翻译 | 示例
           

摘要

For face recognition, some very large-scale datasets are publicly available in recent years, which are usually collected from the Internet using search engines, and thus have many faces with wrong identity (ID) labels (outliers). Additionally, the face images in these datasets have different qualities because of uncontrolled situations. The authors propose a novel approach for cleaning the ID label error, handling face images in different qualities. The face ID labels cleaned by their method can train better models for low-quality face recognition since more low-quality images are correctly labelled for training a deep model. In their low-to-high-quality face verification experiments, the deep model trained on their cleaning results of MS-Celeb-1M.v1 face dataset outperforms the same model trained on the same dataset cleaned by the semantic bootstrapping method. They also apply their ID label cleaning method on a subset of the cross-age celebrity dataset (CACD) face dataset, in which their quality-based cleaning can deliver higher precision and recall than a previous method on detecting the ID label errors.
机译:对于人脸识别,近年来公开了一些非常大规模的数据集,这些数据集通常是使用搜索引擎从Internet收集的,因此有许多带有错误标识(ID)标签的人脸(异常值)。此外,由于不受控制的情况,这些数据集中的面部图像具有不同的质量。作者提出了一种新颖的方法来清除ID标签错误,处理不同质量的面部图像。用他们的方法清洗的人脸ID标签可以训练出更好的模型来进行低质量的人脸识别,因为正确地标记了更多低质量的图像来训练深度模型。在他们的低到高质量面部验证实验中,针对其清洁结果MS-Celeb-1M.v1面部数据集训练的深度模型优于对通过语义引导方法清除的同一数据集训练的相同模型。他们还将ID标签清洁方法应用于跨年龄名人数据集(CACD)面部数据集的子集,其中基于质量的清洁方法可以比以前的检测ID标签错误的方法提供更高的精度和召回率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号