providing at least one input pattern (p) with N pattern slots (N1), the input pattern (p) expressing a specific semantic relation between N entities that fill the N pattern slots of the input pattern (p) as slot fillers,providing at least one cluster (c) of articles, the articles of the cluster (c) relating to a common main topic;processing the articles with respect to the input pattern (p) and identifying the identities which match the semantic type of the N pattern slots;if the at least one input pattern matches a portion of an article (a) of the at least one cluster (c): storing the N slot fillers (s1, s2, . . . , sN), which match the slots of the pattern (p), and a cluster identifier Ic of the cluster (c) into a first table S, wherein the N-tuple (s1, s2, . . . , sN) and the cluster identifier Ic of the associated cluster (c) form one element of the table S;for each element of table S, identifying appearances of the slot fillers (s1, s2, . . . , sN) in a plurality of articles of cluster (c) and for each appearance so identified, storing the slot fillers (s1, s2, . . . , sN) together with the sentence in which they occur into a second table C0;from the sentences stored in table C0, extracting patterns which span over the corresponding N slot fillers (s1, s2, . . . , sN), the extracted pattern expressing a semantic relation between the N slot fillers; andstoring the extracted pattern together with the input pattern as entailment relation into the knowledge base."/> METHOD FOR THE EXTRACTION OF RELATION PATTERNS FROM ARTICLES
首页> 外国专利> METHOD FOR THE EXTRACTION OF RELATION PATTERNS FROM ARTICLES

METHOD FOR THE EXTRACTION OF RELATION PATTERNS FROM ARTICLES

机译:从文章中提取关系型的方法

摘要

A method for building a knowledge base containing entailment relations, includingproviding at least one input pattern (p) with N pattern slots (N1), the input pattern (p) expressing a specific semantic relation between N entities that fill the N pattern slots of the input pattern (p) as slot fillers,providing at least one cluster (c) of articles, the articles of the cluster (c) relating to a common main topic;processing the articles with respect to the input pattern (p) and identifying the identities which match the semantic type of the N pattern slots;if the at least one input pattern matches a portion of an article (a) of the at least one cluster (c): storing the N slot fillers (s1, s2, . . . , sN), which match the slots of the pattern (p), and a cluster identifier Ic of the cluster (c) into a first table S, wherein the N-tuple (s1, s2, . . . , sN) and the cluster identifier Ic of the associated cluster (c) form one element of the table S;for each element of table S, identifying appearances of the slot fillers (s1, s2, . . . , sN) in a plurality of articles of cluster (c) and for each appearance so identified, storing the slot fillers (s1, s2, . . . , sN) together with the sentence in which they occur into a second table C0;from the sentences stored in table C0, extracting patterns which span over the corresponding N slot fillers (s1, s2, . . . , sN), the extracted pattern expressing a semantic relation between the N slot fillers; andstoring the extracted pattern together with the input pattern as entailment relation into the knowledge base.
机译:一种构建包含蕴含关系的知识库的方法,包括 为至少一个具有N个模式槽(N> 1)的输入模式(p)提供输入模式(p)表示填充输入模式(p)的N个模式插槽的N个实体之间的特定语义关系,作为插槽填充符, 至少提供一个项目(c)的集群,该集群(c)的一个主题与一个共同的主要主题有关;模式(p)并标识与N个模式槽的语义类型匹配的身份; 如果至少一个输入模式与一个模式的一部分匹配,至少一个(c)组的(a)条: 存储N个插槽填充符(s 1 ,s 2 < / Sub>,...,s N ),它们与模式(p)的时隙匹配,并且集群(c)的集群标识符Ic进入第一表S,其中N -tuple(s 1 ,s 2 ,...,s N )和群集标识符I c (c)的关联簇形成表S的一个元素; 对于表S的每个元素,标识时隙填充符(s < (c)的多个商品中的Sub> 1 ,s 2 ,...,s N ),并针对如此标识的每个外观进行存储插槽填充符(s 1 ,s 2 ,...,s N )以及它们在其中出现在第二个表中的句子C 0 ; 从存储在表C 0 中的语句中提取模式whi ch跨越相应的N个插槽填充符(s 1 ,s 2 、. 。 。 ,s N ),提取的模式表示N个时隙填充符之间的语义关系;并且 将提取的模式与输入模式作为依存关系存储到知识库中。 < / UnorderedList>

著录项

  • 公开/公告号US2010138216A1

    专利类型

  • 公开/公告日2010-06-03

    原文格式PDF

  • 申请/专利权人 HRISTO TANEV TANEV;

    申请/专利号US20080595936

  • 发明设计人 HRISTO TANEV TANEV;

    申请日2008-04-15

  • 分类号G06F17/27;

  • 国家 US

  • 入库时间 2022-08-21 18:54:29

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号