首页> 美国卫生研究院文献>other >Modeling Truth Existence in Truth Discovery
【2h】

Modeling Truth Existence in Truth Discovery

机译:在真相发现中模拟真相存在

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

When integrating information from multiple sources, it is common to encounter conflicting answers to the same question. Truth discovery is to infer the most accurate and complete integrated answers from conflicting sources. In some cases, there exist questions for which the true answers are excluded from the candidate answers provided by all sources. Without any prior knowledge, these questions, named no-truth questions, are difficult to be distinguished from the questions that have true answers, named has-truth questions. In particular, these no-truth questions degrade the precision of the answer integration system. We address such a challenge by introducing source quality, which is made up of three fine-grained measures: silent rate, false spoken rate and true spoken rate. By incorporating these three measures, we propose a probabilistic graphical model, which simultaneously infers truth as well as source quality without any a priori training involving ground truth answers. Moreover, since inferring this graphical model requires parameter tuning of the prior of truth, we propose an initialization scheme based upon a quantity named truth existence score, which synthesizes two indicators, namely, participation rate and consistency rate. Compared with existing methods, our method can effectively filter out no-truth questions, which results in more accurate source quality estimation. Consequently, our method provides more accurate and complete answers to both has-truth and no-truth questions. Experiments on three real-world datasets illustrate the notable advantage of our method over existing state-of-the-art truth discovery methods.
机译:在集成来自多个来源的信息时,通常会遇到对同一问题的矛盾答案。真相发现是从冲突的来源中推断出最准确和完整的综合答案。在某些情况下,存在所有源提供的候选答案中都排除了真实答案的问题。在没有任何先验知识的情况下,很难将这些名为无真相的问题与具有真实答案的有真相的问题区分开。尤其是,这些不真实的问题会降低答案集成系统的准确性。我们通过引入源质量来应对这一挑战,源质量由三个细粒度的指标组成:静默率,错误口语率和真实口语率。通过合并这三个度量,我们提出了一个概率图形模型,该模型同时推断真相和源质量,而无需任何涉及地面真相答案的先验训练。此外,由于推断该图形模型需要对真值先验进行参数调整,因此我们提出了一种基于名为真值存在得分的数量的初始化方案,该方案综合了参与率和一致性率两个指标。与现有方法相比,我们的方法可以有效地过滤掉不真实的问题,从而可以更准确地估算源质量。因此,我们的方法可以为“真”和“非”问题提供更准确和完整的答案。在三个真实世界的数据集上进行的实验表明,与现有的最新真相发现方法相比,我们的方法具有明显的优势。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号