首页> 外文会议>International conference on database systems for advanced applications >A General Fine-Grained Truth Discovery Approach for Crowdsourced Data Aggregation
【24h】

A General Fine-Grained Truth Discovery Approach for Crowdsourced Data Aggregation

机译:用于众包数据聚合的通用细粒度真相发现方法

获取原文

摘要

Crowdsourcing has been proven to be an efficient tool to collect large-scale datasets. Answers provided by the crowds are often noisy and conflicted, which makes aggregating them to infer ground truth a critical challenge. Existing fine-grained truth discovery methods solve this problem by exploring the correlation between source reliability and task topics or answers. However, they can only work on limited tasks, which results in the incompatibility with Writing tasks and Transcription tasks, along with the insufficient utilization of the global dataset. To maintain compatibility, we consider the existence of clusters in both tasks and sources, then propose a general fine-grained method. The proposed approach contains two integral components: kl-means and Pattern-based Truth Discovery (PTD). With the aid of ground truth data, kl-means directly employs a co-clustering reliability model on the correctness matrix to learn the patterns. Then PTD conducts the answer aggregation by incorporating captured patterns, producing a more accurate estimation. Therefore, our approach is compatible with all tasks and can better demonstrate the correlation among tasks and sources. Experimental results show that our method can produce a more precise estimation than other general truth discovery methods due to its ability to learn and utilize the patterns of both tasks and sources.
机译:众包已被证明是收集大规模数据集的有效工具。人群提供的答案通常是嘈杂的和冲突的,这使得将它们汇总以推断地面真相成为一项关键挑战。现有的细粒度真相发现方法通过探索源可靠性与任务主题或答案之间的相关性来解决此问题。但是,它们只能处理有限的任务,这导致与“编写”任务和“转录”任务不兼容,以及对全局数据集的利用不足。为了保持兼容性,我们考虑了任务和源中都存在集群,然后提出了一种通用的细粒度方法。所提出的方法包含两个组成部分:kl-means和基于模式的真相发现(PTD)。借助地面真实数据,kl-means直接在正确性矩阵上采用共聚可靠性模型来学习模式。然后,PTD通过合并捕获的模式进行答案汇总,以产生更准确的估计。因此,我们的方法与所有任务兼容,并且可以更好地证明任务与源之间的相关性。实验结果表明,由于该方法具有学习和利用任务和源代码的模式的能力,因此可以比其他一般真相发现方法产生更精确的估计。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号