SIRIUS-LTG: An Entity Linking Approach to Fact Extraction and Verification

机译：Sirius-LTG：实体链接到事实提取和验证的方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This article presents the SIRIUS-LTG system for the Fact Extraction and VERification (FEVER) Shared Task. It consists of three components: 1) Wikipedia Page Retrieval: First we extract the entities in the claim, then we find potential Wikipedia URI candidates for each of the entities using a SPARQL query over DBpedia 2) Sentence selection: We investigate various techniques i.e. Smooth Inverse Frequency (SIF), Word Mover's Distance (WMD), Soft-Cosine Similarity, Cosine similarity with unigram Term Frequency Inverse Document Frequency (TF-IDF) to rank sentences by their similarity to the claim. 3) Textual Entailment: We compare three models for the task of claim classification. We apply a Decomposable Attention (DA) model (Parikh et al., 2016), a Decomposed Graph Entailment (DGE) model (Khot et al., 2018) and a Gradient-Boosted Decision Trees (TalosTree) model (Sean et al., 2017) for this task. The experiments show that the pipeline with simple Cosine Similarity using TFIDF in sentence selection along with DA model as labelling model achieves the best results on the development set (F1 evidence: 32.17, label accuracy: 59.61 and FEVER score: 0.3778). Furthermore, it obtains 30.19, 48.87 and 36.55 in terms of F1 evidence, label accuracy and FEVER score, respectively, on the test set. Our system ranks 15th among 23 participants in the shared task prior to any human-evaluation of the evidence.

机译：本文介绍了Sirius-LTG系统，用于事实提取和验证（发烧）共享任务。它由三个组成部分组成：1）维基百科页面检索：首先，我们将各个实体提取各个实体，然后使用SPARQL查询在DBPedia 2）句子选择：我们调查各种技术，即平滑逆频率（SIF），Word Mover的距离（WMD），软余弦相似性，与Unigram术语频率逆文档频率（TF-IDF）的余弦相似度，通过与索赔的相似性来排序句子。 3）文本意外：我们比较三种模型用于索赔分类的任务。我们应用一种可分解的注意力（DA）模型（Parikh等，2016），一个分解的图表征兆（DGE）模型（Khot等，2018）和梯度提升决策树（Talostree）模型（Sean等人。，2017）为此任务。实验表明，随着句子选择中使用TFIDF的简单余弦相似性的管道与DA模型一起实现了开发集的最佳结果（F1证据：32.17，标签准确度：59.61和发热得分：0.3778）。此外，它在测试集上分别获得F1证据，标记精度和发热分数的30.19,48.87和36.55。在对证据的任何人类评估之前，我们的系统在23名参与者中排名第15位。

著录项

来源
《Conference on empirical methods in natural language processing》|2018年|xii 171 p.|共5页
会议地点
作者
Farhad Nooralahzadeh; Lilja Ovrelid;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类程序设计、软件工程;
关键词
入库时间 2022-08-20 23:26:45

相似文献

外文文献
中文文献
专利

1. A Novel Approach for Analyzing Entity Linking Between Words and Entities for a Knowledge Base Using an Attention-Based Bilinear Joint Learning and Weighted Summation Model [J] . Luo Shuanghu, Wang Penglong, Cao Min Quality Control, Transactions . 2020,第期

机译：一种新的方法，用于分析使用基于注意的双线性联合学习和加权求和模型的知识库词和实体之间的实体链接的方法
2. FEEL: Framework for the integration of Entity Extraction and Linking systems [J] . Julio Hernandez, Jose L. Martinez-Rodriguez, Ivan Lopez-Arevalo, Journal of web semantics: . 2020,第Mara期

机译：感觉：实体提取和链接系统集成的框架
3. LiTeWi: A Combined Term Extraction and Entity Linking Method for Eliciting Educational Ontologies From Textbooks [J] . Angel Conde, Mikel Larranaga, Ana Arruarte, Journal of the American Society for Information Science and Technology . 2016,第2期

机译：LiTeWi：组合术语提取和实体链接方法，用于从教科书中剔除教育本体
4. SIRIUS-LTG: An Entity Linking Approach to Fact Extraction and Verification [C] . Farhad Nooralahzadeh, Lilja Ovrelid First workshop on fact extraction and VERification 2018 . 2018

机译：SIRIUS-LTG：事实提取和验证的实体链接方法
5. Entity Analysis with Weak Supervision: Typing, Linking, and Attribute Extraction. [D] . Ling, Xiao. 2015

机译：具有弱监督的实体分析：键入，链接和属性提取。
6. Correction: A linear classifier based on entity recognition tools and a statistical approach to method extraction in the protein-protein interaction literature [O] . Anália Lourenço, Michael Conover, Andrew Wong, 2012

机译：校正：一种基于实体识别工具的线性分类器以及一种蛋白质-蛋白质相互作用文献中的统计方法提取方法
7. SIRIUS-LTG: An Entity Linking Approach to Fact Extraction and Verification [O] . Farhad Nooralahzadeh, Lilja Øvrelid 2018

机译：Sirius-LTG：实体链接到事实提取和验证的方法

SIRIUS-LTG: An Entity Linking Approach to Fact Extraction and Verification

摘要

著录项

相似文献

相关主题

期刊订阅