首页> 外国专利> Identifying conceptual gaps in a knowledge base

Identifying conceptual gaps in a knowledge base

机译:识别知识库中的概念空白

摘要

A method and system for augmenting a corpus with documents on concepts not sufficiently covered within the corpus is provided. The augmentation system generates a corpus concept graph from the documents of a corpus. A corpus concept graph represents concepts of the documents as nodes and related concepts as links between nodes. To generate a corpus concept graph, the augmentation system identifies the concepts that are related within each document of the corpus and adds nodes and links to the corpus concept graph for related concepts. The augmentation system analyzes the corpus concept graph to determine whether the relatedness of concepts of the documents of the corpus is sufficient. If the relatedness of a pair of concepts is not sufficient, then the augmentation system attempts to identify documents not already in the corpus that are related to the concepts that are not sufficiently related.
机译:提供了一种用于在主体上充分覆盖概念的文档来增强主体的方法和系统。增强系统从语料库的文档生成语料库概念图。语料库概念图将文档的概念表示为节点,而相关的概念表示为节点之间的链接。为了生成语料库概念图,扩充系统识别在语料库的每个文档内相关的概念,并为相关概念将节点和链接添加到语料库概念图。扩充系统分析语料库概念图,以确定语料库文档的概念之间的相关性是否足够。如果一对概念的相关性不足,则扩充系统会尝试识别与尚未充分相关的概念相关的语料库中尚未存在的文档。

著录项

  • 公开/公告号US7555472B2

    专利类型

  • 公开/公告日2009-06-30

    原文格式PDF

  • 申请/专利权人 ALAN CRAIG;KALEV LEETARU;

    申请/专利号US20050218667

  • 发明设计人 ALAN CRAIG;KALEV LEETARU;

    申请日2005-09-02

  • 分类号G06F17/00;G06N5/02;

  • 国家 US

  • 入库时间 2022-08-21 19:30:25

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号