Multi-Class Ground Truth Inference in Crowdsourcing with Clustering

Jing Zhang; Victor S. Sheng; Jian Wu; Xindong Wu

首页> 外文期刊>IEEE Transactions on Knowledge and Data Engineering >Multi-Class Ground Truth Inference in Crowdsourcing with Clustering

【24h】

Multi-Class Ground Truth Inference in Crowdsourcing with Clustering

机译：聚类的众包中的多类地面真理推论

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Due to low quality of crowdsourced labelers, the integrated label of each example is usually inferred from its multiple noisy labels provided by different labelers. This paper proposes a novel algorithm, Ground Truth Inference using Clustering (GTIC), to improve the quality of integrated labels for multi-class labeling. For a labeling case, GTIC utilizes the multiple noisy label sets of examples to generate features. Then, it uses a K-Means algorithm to cluster all examples into different groups, each of which is mapped to a specific class. Examples in the same cluster are assigned a corresponding class label. We compare GTIC with four existing multi-class ground truth inference algorithms, majority voting (MV), Dawid & Skene's (DS), ZenCrowd (ZC) and Spectral DS (SDS), on one synthetic and eight real-world datasets. Experimental results show that the performance of GTIC is significantly superior to the others in terms of both accuracy and M-AUC. Besides, the running time of GTIC is about twenty times faster than EM-based complicated inference algorithms.

机译：由于众包标签的质量低下，每个示例的集成标签通常由不同标签提供的多个嘈杂标签来推断。本文提出了一种新的算法，即基于聚类的地面真理推论（GTIC），以提高用于多类标签的集成标签的质量。对于加标签的情况，GTIC利用示例的多个嘈杂标签集生成特征。然后，它使用K-Means算法将所有示例分为不同的组，每个组都映射到特定的类。在同一集群中的示例被分配了相应的类标签。我们将GTIC与四个现有的多类地面事实推理算法（多数投票（MV），Dawid＆Skene's（DS），ZenCrowd（ZC）和Spectral DS（SDS））在一个合成数据集和八个真实数据集上进行了比较。实验结果表明，在准确性和M-AUC方面，GTIC的性能明显优于其他产品。此外，GTIC的运行时间比基于EM的复杂推理算法快约二十倍。

著录项

来源
《IEEE Transactions on Knowledge and Data Engineering》 |2016年第4期|1080-1085|共6页
作者
Jing Zhang; Victor S. Sheng; Jian Wu; Xindong Wu;
展开▼
作者单位

Jing Zhang is with the School of Computer Science and Engineering, Nanjing University of Science and Technology, 200 Xiaolingwei, Nanjing 210094, China. (email: jingzhang.cs@gmail.com);

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Clustering; EM algorithm; clustering,; crowdsourcing; ground truth inference; multi-class labeling;

机译：聚类;EM算法;聚类;众包;地面事实推论;多类标记;

相似文献

外文文献
中文文献
专利

1. Multi-Label Truth Inference for Crowdsourcing Using Mixture Models [J] . Zhang Jing, Wu Xindong IEEE Transactions on Knowledge and Data Engineering . 2021,第5期

机译：使用混合模型众包的多标签真理推断
2. Achieving Approximate Global Optimization of Truth Inference for Crowdsourcing Microtasks [J] . Lizhen Cui, Jing Chen, Wei He, Data Science and Engineering . 2021,第3期

机译：为众包微量化实现近似全球优化真理推断
3. Crowdsourcing Ground Truth for Medical Relation Extraction [J] . ANCA DUMITRACHE, LORA AROYO, CHRIS WELTY ACM Transactions on Interactive Intelligent Systems . 2018,第2期

机译：众包医疗关系提取的真相
4. Federated Truth Inference over Distributed Crowdsourcing Platforms [C] . Ming-Hsun Yang, Gin-Hao Liu, Y.-W. Peter Hong IEEE International Conference on Acoustics, Speech and Signal Processing . 2020

机译：分布式众包平台的联合真理推论
5. Machine Learning, Evolutionary Algorithms, and the Inference of Mathematical Truths. [D] . Hensley, Asher. 2013

机译：机器学习，进化算法和数学真理的推论。
6. Crowdsourcing image analysis for plant phenomics to generate ground truth data for machine learning [O] . Naihui Zhou, Zachary D. Siegel, Scott Zarecor, 2018

机译：对植物表象学进行众包图像分析以生成用于机器学习的地面真相数据
7. Achieving Approximate Global Optimization of Truth Inference for Crowdsourcing Microtasks [O] . Lizhen Cui, Jing Chen, Wei He, 2021

机译：为众群微量量量实现近似全球优化真理推断

Multi-Class Ground Truth Inference in Crowdsourcing with Clustering

摘要

著录项

相似文献

相关主题

期刊订阅