首页> 外文会议>IEEE International Conference on Knowledge Graph >Resolving Seemingly Conflicting Fact Statements Caused by Missing Terms
【24h】

Resolving Seemingly Conflicting Fact Statements Caused by Missing Terms

机译:解决因术语缺失而引起的看似矛盾的事实陈述

获取原文

摘要

The Web has become the most useful resource for people to acquire information. However, it still contains much incorrect and inaccurate information. While the problem of resolving conflicting structured Web data has been studied extensively, the corresponding problem involving unstructured text data has rarely been ad-dressed. In this paper, we consider how to resolve seemingly conflicting fact statements (SCFS). A statement is a fact statement if it attempts to state a fact, although the stated fact may not necessarily be correct. In this paper, multiple fact statements are SCFSs if they are identical except that they have different terms at a certain (same) position. In general, SCFSs are not necessarily truly in conflict. Some conflicts may result from imprecise statements and may be resolved by improving the precision of the statement. In this work, we focus on a special type of SCFSs whose corresponding statements are imprecise because they have missing terms. More specifically, given a set of SCFSs, we propose a method to mine the possible missing terms. This method effectively transforms an imprecise fact statement to precise ones. At the same time, our method also resolves the seemingly conflicts among the transformed fact statements. The experimental results show that our best solution for missing term mining on different data sets can achieve close to 90% F-score, which improves a baseline method by about 40%. We also propose a method to place the mined missing term for each SCFS at the right place in the statement and our experiments show that this method has an accuracy of about 73%.
机译:Web已成为人们获取信息的最有用资源。但是,它仍然包含许多不正确和不准确的信息。尽管解决冲突的结构化Web数据的问题已得到广泛研究,但涉及非结构化文本数据的相应问题却很少得到解决。在本文中,我们考虑如何解决看似矛盾的事实陈述(SCFS)。如果陈述试图陈述事实,则该陈述为事实陈述,尽管陈述的事实不一定是正确的。在本文中,如果多个事实陈述相同,除了它们在某个(相同)位置具有不同的用语外,它们就是SCFS。通常,SCFS不一定真正存在冲突。某些冲突可能是由于语句不正确导致的,可以通过提高语句的精度来解决。在这项工作中,我们集中于一种特殊类型的SCFS,由于其缺少术语,因此其对应的语句不精确。更具体地说,给定一组SCFS,我们提出了一种方法来挖掘可能的缺失术语。这种方法有效地将不精确的事实陈述转换为精确的事实陈述。同时,我们的方法还解决了转换后的事实陈述之间的看似冲突。实验结果表明,我们针对不同数据集的缺失项挖掘的最佳解决方案可以实现接近90%的F分数,这将基线方法提高了约40%。我们还提出了一种将每个SCFS的挖掘缺失项放置在语句中正确位置的方法,我们的实验表明,该方法的准确性约为73%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号