...
首页> 外文期刊>Artificial intelligence >A technical survey on statistical modelling and design methods for crowdsourcing quality control
【24h】

A technical survey on statistical modelling and design methods for crowdsourcing quality control

机译:众包质量控制统计建模与设计方法的技术调查

获取原文
获取原文并翻译 | 示例
           

摘要

Online crowdsourcing provides a scalable and inexpensive means to collect knowledge (e.g. labels) about various types of data items (e.g. text, audio, video). However, it is also known to result in large variance in the quality of recorded responses which often cannot be directly used for training machine learning systems. To resolve this issue, a lot of work has been conducted to control the response quality such that low-quality responses cannot adversely affect the performance of the machine learning systems. Such work is referred to as the quality control for crowdsourcing. Past quality control research can be divided into two major branches: quality control mechanism design and statistical models. The first branch focuses on designing measures, thresholds, interfaces and workflows for payment, gamification, question assignment and other mechanisms that influence workers' behaviour. The second branch focuses on developing statistical models to perform effective aggregation of responses to infer correct responses. The two branches are connected as statistical models (ⅰ) provide parameter estimates to support the measure and threshold calculation, and (ⅱ) encode modelling assumptions used to derive (theoretical) performance guarantees for the mechanisms. There are surveys regarding each branch but they lack technical details about the other branch. Our survey is the first to bridge the two branches by providing technical details on how they work together under frameworks that systematically unify crowdsourcing aspects modelled by both of them to determine the response quality. We are also the first to provide taxonomies of quality control papers based on the proposed frameworks. Finally, we specify the current limitations and the corresponding future directions for the quality control research.
机译:在线众包提供可扩展且廉价的手段,用于收集关于各种类型的数据项(例如文本,音频,视频)的知识(例如标签)。然而,也已知在记录的响应质量方差方差差异,这通常不能直接用于训练机器学习系统。为了解决这个问题,已经进行了大量的工作来控制响应质量,使得低质量的响应不能对机器学习系统的性能产生不利影响。这些工作被称为众包的质量控制。过去的质量控制研究可分为两个主要分支机构:质量控制机制设计和统计模型。第一家分支专注于设计措施,门槛,接口和工作流程,用于支付,游戏,质疑和影响工人行为的其他机制。第二个分支侧重于开发统计模型,以执行对响应的有效聚集,以推断正确的响应。两个分支连接为统计模型(Ⅰ)提供参数估计,以支持测量和阈值计算,(Ⅱ)编码建模假设用于导出机制的(理论)性能保证。有关每个分支的调查,但它们缺乏有关其他分支的技术细节。我们的调查是第一个通过提供关于它们如何在系统统一它们建模的众包以确定响应质量的框架方面的框架上努力实现两个分支机构来弥合两个分支机构。我们也是第一个基于所提出的框架提供质量控制文件分类的国家。最后,我们指定了质量控制研究的当前限制和相应的未来方向。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号