首页> 外文会议>Conference on empirical methods in natural language processing >DeFactoNLP: Fact Verification using Entity Recognition, TFIDF Vector Comparison and Decomposable Attention
【24h】

DeFactoNLP: Fact Verification using Entity Recognition, TFIDF Vector Comparison and Decomposable Attention

机译:defactonlp:使用实体识别,TFIDF矢量比较和可分解注意力的事实验证

获取原文

摘要

In this paper, we describe DeFactoNLP, the system we designed for the FEVER 2018 Shared Task. The aim of this task was to conceive a system that can not only automatically assess the veracity of a claim but also retrieve evidence supporting this assessment from Wikipedia. In our approach, the Wikipedia documents whose Term Frequency-Inverse Document Frequency (TFIDF) vectors are most similar to the vector of the claim and those documents whose names are similar to those of the named entities (NEs) mentioned in the claim are identified as the documents which might contain evidence. The sentences in these documents are then supplied to a textual entailment recognition module. This module calculates the probability of each sentence supporting the claim, contradicting the claim or not providing any relevant information to assess the veracity of the claim. Various features computed using these probabilities are finally used by a Random Forest classifier to determine the overall truthfulness of the claim. The sentences which support this classification are returned as evidence. Our approach achieved a 0.4277 evidence F1-score, a 0.5136 label accuracy and a 0.3833 FEVER score.
机译:在本文中,我们描述了Defactonlp,我们为发烧2018年共享任务设计的系统。这项任务的目的是设想一个不仅可以自动评估索赔的真实性的系统,而且还检索支持维基百科的评估的证据。在我们的方法中,术语频率 - 逆文档频率(TFIDF)向量的维基百科文档与权利要求的索赔和那些名称类似于所述权利要求中提到的名称实体(NE)的文件的传染媒介被识别为可能包含证据的文件。然后将这些文档中的句子提供给文本引入识别模块。该模块计算支持这些索赔的每个句子的可能性,抵触声明或不提供任何相关信息以评估索赔的真实性。随机林分类器最终使用使用这些概率计算的各种特征,以确定所述权利要求的整体真实性。支持此分类的句子作为证据返回。我们的方法取得了0.4277个证据F1分数,0.5136的标签准确性和0.3833的发热得分。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号