首页> 外国专利> System and method for mining large, diverse, distributed, and heterogeneous datasets

System and method for mining large, diverse, distributed, and heterogeneous datasets

机译:挖掘大型,多样化,分布式和异构数据集的系统和方法

摘要

A method for directed mining of a heterogeneous dataset with a computer comprising: populating a rule base with known rules, wherein each rule has a context and a situation; populating a case base with known cases, wherein each case has a context and a situation, and wherein the case base is partitioned from the rule base; ascribing a natural language semantics to predicates of the known cases and rules; randomly transforming the known rules and the known cases to form new rules by extracting a maximum number of common predicates; segmenting the rules and the cases on the basis of shared predicates without making distinction between context and situation predicates; abducing new knowledge from the dataset by fuzzily matching the context of a new rule to a situation the new rule does not cover; and issuing a query to a user to supply missing predicates of the fuzzy match.
机译:一种用计算机定向挖掘异构数据集的方法,包括:用已知规则填充规则库,其中每个规则都有上下文和情况;用已知案例填充案例库,其中每个案例都有上下文和情境,并且案例库与规则库是分开的;将自然语言语义赋予已知情况和规则的谓词;通过提取最大数量的公共谓词,将已知规则和已知案例随机转换为新规则;根据共享谓词对规则和案例进行细分,而无需区分上下文谓词和情况谓词;通过模糊地将新规则的上下文与新规则未涵盖的情况进行匹配,从数据集中获取新知识;向用户发出查询以提供模糊匹配的缺失谓词。

著录项

  • 公开/公告号US9449280B2

    专利类型

  • 公开/公告日2016-09-20

    原文格式PDF

  • 申请/专利权人 STUART HARVEY RUBIN;

    申请/专利号US201414316439

  • 发明设计人 STUART HARVEY RUBIN;

    申请日2014-06-26

  • 分类号G06N7/02;G06N5/02;G06N5/04;

  • 国家 US

  • 入库时间 2022-08-21 14:31:39

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号