Improving the Quality of Crowdsourcing Labels by Combination of Golden Data and Incentive

机译：结合黄金数据和激励措施提高众包标签的质量

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The rapid rise of deep learning and AI is inseparable from the support of massive labeled data. Crowdsourcing has become a cheap and efficient paradigm for providing labels for large-scale unlabeled data. But, due to the various uncertainty of crowdsourcing workers (or called labelers), much low-quality and false labeled data is yielded. To address this fundamental challenge, many redundancy-based ground truth inference algorithms have been proposed in the past few years, which assign each labeling task to multiple workers and infer the true label of each instance in task from its multiple label set. In this paper, we devise a novel scheme to improve the quality of labeled data and infer the truth label, which utilizes small proportion golden data that has been labeled correctly to estimate workers' ability and reliability and uses the incentive mechanism to motivate workers to do their best. Through experiments, we demonstrate that our method is effective and is also robust to low-quality workers as it outperforms Majority Voting (MV) and some commonly used algorithms.

机译：深度学习和AI的迅速兴起离不开海量标签数据的支持。众包已成为一种为大型未标记数据提供标签的廉价且有效的范例。但是，由于众包工作者（或称为贴标者）的各种不确定性，会产生大量低质量和错误的贴标数据。为了解决这一基本挑战，在过去的几年中，已经提出了许多基于冗余的地面事实推理算法，该算法将每个标记任务分配给多个工作人员，并从其多个标签集中推断出任务中每个实例的真实标签。在本文中，我们设计了一种新颖的方案来提高标记数据的质量并推断真相标签，该方案利用已正确标记的小比例黄金数据来估计工人的能力和可靠性，并使用激励机制来激励工人去做。他们最好的。通过实验，我们证明了我们的方法是有效的，并且对劣质工人也很有效，因为它优于多数投票（MV）和一些常用算法。

著录项

来源
《IEEE International Conference on Anti-counterfeiting, Security, and Identification》|2018年|10-15|共6页
会议地点
作者
Peijun Yang; Haibin Cai; Zhiming Zheng;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
data handling; inference mechanisms; learning (artificial intelligence);

机译：数据处理;推理机制;学习（人工智能）;

相似文献

外文文献
中文文献
专利

1. Noise correction to improve data and model quality for crowdsourcing [J] . Li Chaoqun, Jiang Liangxiao, Xu Wenqiang Engineering Applications of Artificial Intelligence . 2019,第JUNa期

机译：噪声校正可改善众包数据和模型质量
2. Noise correction to improve data and model quality for crowdsourcing [J] . Li Chaoqun, Jiang Liangxiao, Xu Wenqiang Engineering Applications of Artificial Intelligence . 2019,第Juna期

机译：噪声校正，以改善众包的数据和模型质量
3. Noise filtering to improve data and model quality for crowdsourcing [J] . Li Chaoqun, Sheng Victor S., Jiang Liangxiao, Knowledge-Based Systems . 2016,第sepa1期

机译：噪声过滤可改善数据和模型质量以进行众包
4. Improving the Quality of Crowdsourcing Labels by Combination of Golden Data and Incentive [C] . Peijun Yang, Haibin Cai, Zhiming Zheng IEEE International Conference on Anti-Counterfeiting, Security and Identification . 2018

机译：通过金色数据和激励组合提高众包标签的质量
5. Understanding, evaluating and enhancing electronic medical record adoption in a primary caresetting: A programme to improve electronic medical record data quality and its effect on familypractice provision of incentivized and enhanced care for chronic disease patients [D] . Bowen, Michael. 2013

机译：了解，评估和增强在初级护理环境中采用电子病历的方案：一项旨在提高电子病历数据质量及其对家庭实践的激励措施的计划，该方案为慢性病患者提供激励和加强护理
6. Improving response rate and quality of survey data with a scratch lottery ticket incentive [O] . Frank Olsen, Birgit Abelsen, Jan Abel. Olsen 2012

机译：通过刮刮彩票激励提高调查数据的响应率和质量
7. Get another label? Improving data quality and data mining using multiple, noisy labelers [O] . Victor S. Sheng, Foster Provost, Panagiotis G. Ipeirotis 2012

机译：获得另一个标签？使用多个嘈杂的贴标机提高数据质量和数据挖掘

Improving the Quality of Crowdsourcing Labels by Combination of Golden Data and Incentive

摘要

著录项

相似文献

相关主题

期刊订阅