首页> 外文期刊>Expert Systems with Application >OpenIE-based approach for Knowledge Graph construction from text
【24h】

OpenIE-based approach for Knowledge Graph construction from text

机译:基于OpenIE的文本知识图构建方法

获取原文
获取原文并翻译 | 示例
           

摘要

Transforming unstructured text into a formal representation is an important goal of the Semantic Web in order to facilitate the integration and retrieval of information. The construction of Knowledge Graphs (KGs) pursues such an idea, where named entities (real world things) and their relations are extracted from text. In recent years, many approaches for the construction of KGs have been proposed by exploiting Discourse Analysis, Semantic Frames, or Machine Learning algorithms with existing Semantic Web data. Although such approaches are useful for processing taxonomies and connecting beliefs, they provide several linguistic descriptions, which lead to semantic data heterogeneity and thus, complicating data consumption. Moreover, Open Information Extraction (OpenIE) approaches have been slightly explored for the construction of KGs, which provide binary relations representing atomic units of information that could simplify the querying and representation of data. In this paper, we propose an approach to generate KGs using binary relations produced by an OpenIE approach. For such purpose, we present strategies for favoring the extraction and linking of named entities with KG individuals, and additionally, their association with grammatical units that lead to producing more coherent facts. We also provide decisions for selecting the extracted information elements for creating potentially useful RDF triples for the KG. Our results demonstrate that the integration of information extraction units with grammatical structures provides a better understanding of proposition-based representations provided by OpenIE for supporting the construction of KGs. (C) 2018 Elsevier Ltd. All rights reserved.
机译:为了促进信息的集成和检索,将非结构化文本转换为形式表示形式是语义网的重要目标。知识图(KGs)的构建遵循这样的思想,即从文本中提取命名实体(现实世界中的事物)及其关系。近年来,通过利用现有语义Web数据利用话语分析,语义框架或机器学习算法,提出了许多构建KG的方法。尽管此类方法可用于处理分类法和联系信念,但它们提供了几种语言描述,这导致语义数据异质性,从而使数据使用复杂化。此外,对于KG的构造,已经稍微探索了开放信息提取(OpenIE)方法,该KG提供了表示信息原子单位的二进制关系,可以简化数据的查询和表示。在本文中,我们提出了一种使用OpenIE方法生成的二进制关系生成KG的方法。为此,我们提出了有利于命名实体与K​​G个体的提取和链接,以及它们与导致产生更多连贯事实的语法单元的关联的策略。我们还提供选择提取信息元素的决策,以为KG创建潜在有用的RDF三元组。我们的结果表明,信息提取单元与语法结构的集成为OpenIE提供的基于命题的表示形式提供了更好的理解,以支持KG的构建。 (C)2018 Elsevier Ltd.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号