首页> 外文会议>International semantic web conference >QA4IE: A Question Answering Based Framework for Information Extraction
【24h】

QA4IE: A Question Answering Based Framework for Information Extraction

机译:QA4IE:基于问题回答的信息提取框架

获取原文

摘要

Information Extraction (IE) refers to automatically extracting structured relation tuples from unstructured texts. Common IE solutions, including Relation Extraction (RE) and open IE systems, can hardly handle cross-sentence tuples, and are severely restricted by limited relation types as well as informal relation specifications (e.g., free-text based relation tuples). In order to overcome these weaknesses, we propose a novel IE framework named QA4IE, which leverages the flexible question answering (QA) approaches to produce high quality relation triples across sentences. Based on the framework, we develop a large IE benchmark with high quality human evaluation. This benchmark contains 293K documents, 2M golden relation triples, and 636 relation types. We compare our system with some IE baselines on our benchmark and the results show that our system achieves great improvements.
机译:信息提取(IE)是指从非结构化文本中自动提取结构化关系元组。包括关系提取(RE)和开放式IE系统在内的常见IE解决方案几乎无法处理跨语句元组,并且受到有限的关系类型以及非正式关系规范(例如,基于自由文本的关系元组)的严格限制。为了克服这些弱点,我们提出了一个名为QA4IE的新颖IE框架,该框架利用灵活的问题解答(QA)方法在句子之间产生高质量的关系三元组。基于该框架,我们开发了具有高质量人工评估的大型IE基准。此基准包含293K个文档,2M黄金关联三元组和636个关联类型。我们将我们的系统与基准上的一些IE基线进行了比较,结果表明我们的系统取得了很大的进步。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号