首页> 外文会议>Conference on empirical methods in natural language processing >SIRIUS-LTG: An Entity Linking Approach to Fact Extraction and Verification
【24h】

SIRIUS-LTG: An Entity Linking Approach to Fact Extraction and Verification

机译:Sirius-LTG:实体链接到事实提取和验证的方法

获取原文

摘要

This article presents the SIRIUS-LTG system for the Fact Extraction and VERification (FEVER) Shared Task. It consists of three components: 1) Wikipedia Page Retrieval: First we extract the entities in the claim, then we find potential Wikipedia URI candidates for each of the entities using a SPARQL query over DBpedia 2) Sentence selection: We investigate various techniques i.e. Smooth Inverse Frequency (SIF), Word Mover's Distance (WMD), Soft-Cosine Similarity, Cosine similarity with unigram Term Frequency Inverse Document Frequency (TF-IDF) to rank sentences by their similarity to the claim. 3) Textual Entailment: We compare three models for the task of claim classification. We apply a Decomposable Attention (DA) model (Parikh et al., 2016), a Decomposed Graph Entailment (DGE) model (Khot et al., 2018) and a Gradient-Boosted Decision Trees (TalosTree) model (Sean et al., 2017) for this task. The experiments show that the pipeline with simple Cosine Similarity using TFIDF in sentence selection along with DA model as labelling model achieves the best results on the development set (F1 evidence: 32.17, label accuracy: 59.61 and FEVER score: 0.3778). Furthermore, it obtains 30.19, 48.87 and 36.55 in terms of F1 evidence, label accuracy and FEVER score, respectively, on the test set. Our system ranks 15th among 23 participants in the shared task prior to any human-evaluation of the evidence.
机译:本文介绍了Sirius-LTG系统,用于事实提取和验证(发烧)共享任务。它由三个组成部分组成:1)维基百科页面检索:首先,我们将各个实体提取各个实体,然后使用SPARQL查询在DBPedia 2)句子选择:我们调查各种技术,即平滑逆频率(SIF),Word Mover的距离(WMD),软余弦相似性,与Unigram术语频率逆文档频率(TF-IDF)的余弦相似度,通过与索赔的相似性来排序句子。 3)文本意外:我们比较三种模型用于索赔分类的任务。我们应用一种可分解​​的注意力(DA)模型(Parikh等,2016),一个分解的图表征兆(DGE)模型(Khot等,2018)和梯度提升决策树(Talostree)模型(Sean等人。 ,2017)为此任务。实验表明,随着句子选择中使用TFIDF的简单余弦相似性的管道与DA模型一起实现了开发集的最佳结果(F1证据:32.17,标签准确度:59.61和发热得分:0.3778)。此外,它在测试集上分别获得F1证据,标记精度和发热分数的30.19,48.87和36.55。在对证据的任何人类评估之前,我们的系统在23名参与者中排名第15位。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号