首页> 外文会议>Pacific-Asia Conference on Knowledge Discovery and Data Mining >CrowdTeacher: Robust Co-teaching with Noisy Answers and Sample-Specific Perturbations for Tabular Data
【24h】

CrowdTeacher: Robust Co-teaching with Noisy Answers and Sample-Specific Perturbations for Tabular Data

机译:Crowdteacher:具有嘈杂的答案和表格数据的特定样本扰动的强大共同教学

获取原文

摘要

Samples with ground truth labels may not always be available in numerous domains. While learning from crowdsourcing labels has been explored, existing models can still fail in the presence of sparse, unreliable, or differing annotations. Co-teaching methods have shown promising improvements for computer vision problems with noisy labels by employing two classifiers trained on each others' confident samples in each batch. Inspired by the idea of separating confident and uncertain samples during the training process, we extend it for the crowdsourcing problem. Our model, CrowdTeacher, uses the idea that perturbation in the input space model can improve the robustness of the classifier for noisy labels. Treating crowdsourcing annotations as a source of noisy labeling, we perturb samples based on the certainty from the aggregated annotations. The perturbed samples are fed to a Co-teaching algorithm tuned to also accommodate smaller tabular data. We showcase the boost in predictive power attained using CrowdTeacher for both synthetic and real datasets across various label density settings. Our experiments reveal that our proposed approach beats baselines modeling individual annotations and then combining them, methods simultaneously learning a classifier and inferring truth labels, and the Co-teaching algorithm with aggregated labels through common truth inference methods.
机译:具有地面真理标签的样本可能并不总是在许多域中可用。在探索了众群标签中学习的同时,现有模型仍然可能在稀疏,不可靠或不同的注释存在下失败。共同教学方法通过使用在每批中的每个分类机上采用训练的两个分类器,有希望改善嘈杂的标签。灵感来自在培训过程中分离自信和不确定样本的想法,我们将其扩展为众群问题。我们的模型,众人使用的想法,即扰动输入空间模型中的扰动可以提高嘈杂标签的分类器的稳健性。将众包注释视为嘈杂标签的源泉,我们根据聚合注释的确定性涉及样品。扰动的样本被馈送到调谐的共同教学算法,以适应更小的表格数据。我们在各种标签密度设置上展示使用Crowdteacher进行的预测电力达到预测电力的提升。我们的实验表明,我们所提出的方法击败基线建模个别注释,然后将它们组合,同时学习分类器和推断真理标签,以及通过常规真理推断方法与聚合标签的共同教学算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号