...
首页> 外文期刊>IEEE Transactions on Knowledge and Data Engineering >Multi-Class Ground Truth Inference in Crowdsourcing with Clustering
【24h】

Multi-Class Ground Truth Inference in Crowdsourcing with Clustering

机译:聚类的众包中的多类地面真理推论

获取原文
获取原文并翻译 | 示例
           

摘要

Due to low quality of crowdsourced labelers, the integrated label of each example is usually inferred from its multiple noisy labels provided by different labelers. This paper proposes a novel algorithm, Ground Truth Inference using Clustering (GTIC), to improve the quality of integrated labels for multi-class labeling. For a labeling case, GTIC utilizes the multiple noisy label sets of examples to generate features. Then, it uses a K-Means algorithm to cluster all examples into different groups, each of which is mapped to a specific class. Examples in the same cluster are assigned a corresponding class label. We compare GTIC with four existing multi-class ground truth inference algorithms, majority voting (MV), Dawid & Skene's (DS), ZenCrowd (ZC) and Spectral DS (SDS), on one synthetic and eight real-world datasets. Experimental results show that the performance of GTIC is significantly superior to the others in terms of both accuracy and M-AUC. Besides, the running time of GTIC is about twenty times faster than EM-based complicated inference algorithms.
机译:由于众包标签的质量低下,每个示例的集成标签通常由不同标签提供的多个嘈杂标签来推断。本文提出了一种新的算法,即基于聚类的地面真理推论(GTIC),以提高用于多类标签的集成标签的质量。对于加标签的情况,GTIC利用示例的多个嘈杂标签集生成特征。然后,它使用K-Means算法将所有示例分为不同的组,每个组都映射到特定的类。在同一集群中的示例被分配了相应的类标签。我们将GTIC与四个现有的多类地面事实推理算法(多数投票(MV),Dawid&Skene's(DS),ZenCrowd(ZC)和Spectral DS(SDS))在一个合成数据集和八个真实数据集上进行了比较。实验结果表明,在准确性和M-AUC方面,GTIC的性能明显优于其他产品。此外,GTIC的运行时间比基于EM的复杂推理算法快约二十倍。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号