首页> 外国专利> Extracting complex entities and relationships from unstructured data

Extracting complex entities and relationships from unstructured data

机译:从非结构化数据中提取复杂的实体和关系

摘要

To extract relationships between complex entities from unstructured data, a parser parses, using an existing language model, the unstructured data to generate a parse tree. From the parse tree, a set of tokens is created. A token in the set of tokens includes a set of words found in the unstructured data. The set of tokens is inserted in the existing language model to form an enhanced language model. The unstructured data is re-parsed using the enhanced language model to create a knowledge graph. From the knowledge graph, a relationship between a subset of the set of tokens is extracted.
机译:为了从非结构化数据中提取复杂实体之间的关系,解析器使用现有的语言模型解析非结构化数据以生成解析树。从解析树中,创建一组令牌。令牌集合中的令牌包括在非结构化数据中找到的一组单词。令牌集被插入到现有的语言模型中以形成增强的语言模型。使用增强的语言模型重新解析非结构化数据,以创建知识图。从知识图提取令牌集合的子集之间的关系。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号