首页> 外文期刊>ACM journal of data and information quality >Discovering Patterns for Fact Checking in Knowledge Graphs
【24h】

Discovering Patterns for Fact Checking in Knowledge Graphs

机译:发现模式以检查知识图表

获取原文
获取原文并翻译 | 示例
       

摘要

This article presents a new framework that incorporates graph patterns to support fact checking in knowledge graphs. Our method discovers discriminant graph patterns to construct classifiers for fact prediction. First, we propose a class of graph fact checking rules (GFCs). A GFC incorporates graph patterns that best distinguish true and false facts of generalized fact statements. We provide statistical measures to characterize useful patterns that are both discriminant and diversified. Second, we show that it is feasible to discover GFCs in large graphs with optimality guarantees. We develop an algorithm that performs localized search to generate a stream of graph patterns, and dynamically assemble the best GFCs from multiple GFC sets, where each set ensures quality scores within certain ranges. The algorithm guarantees a (1/2 - ε) approximation when it (early) terminates. We also develop a space-efficient alternative that dynamically spawns prioritized patterns with best marginal gains to the verified GFCs. It guarantees a (1 - 1/e) approximation. Both strategies guarantee a bounded time cost independent of the size of the underlying graph. Third, to support fact checking, we develop two classifiers, which make use of top-ranked GFCs as predictive rules or instance-level features of the pattern matches induced by GFCs, respectively. Using real-world data, we experimentally verify the efficiency and the effectiveness of GFC-based techniques for fact checking in knowledge graphs and verify its application in knowledge exploration and news prediction.
机译:本文介绍了一个新的框架,该框架包含图形模式来支持事实检查知识图表。我们的方法发现判别图案模式来构建事实预测的分类器。首先,我们提出了一类图形事实检查规则(GFC)。 GFC包含最佳区分广义事实陈述的真实和虚假事实的图形模式。我们提供统计措施,以表征判别和多样化的有用模式。其次,我们表明,在具有最优性保证的大图中发现GFC是可行的。我们开发一种执行本地化搜索的算法以生成图形图案流,并动态地组装来自多个GFC集的最佳GFC,其中每个集合确保某些范围内的质量得分。算法在(早期)终止时保证(1/2 - ε)近似。我们还开发了一种节省高效的替代方案,可动态产生具有最佳边缘收益的优先模式,以便验证的GFC。它保证了(1 - 1 / e)近似。这两个策略都保证了与底层图表的大小无关的有界时间成本。第三,为了支持事实检查,我们开发了两个分类器,它可以分别利用顶级GFC作为GFC引起的模式匹配的预测规则或实例级功能。使用现实世界数据,我们通过实验验证基于GFC的技术的效率和有效性,以便检查知识图表并验证其在知识探索和新闻预测中的应用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号