首页> 外国专利> Method and apparatus for data mining to discover associations and covariances associated with data

Method and apparatus for data mining to discover associations and covariances associated with data

机译:用于数据挖掘以发现与数据关联的关联和协方差的方法和装置

摘要

Data mining techniques are provided which are effective and efficient for discovering useful information from an amorphous collection or data set of records. For example, the present invention provides for the mining of data, e.g., of several or many records, to discover interesting associations between entries of qualitative text, and covariances between data of quantitative numerical types, in records. Although not limited thereto, the invention has particular application and advantage when the data is of a type such as clinical, pharmacogenomic, forensic, police and financial records, which are characterized by many varied entries, since the problem is then said to be one of “high dimensionality” which has posed mathematical and technical difficulties for researchers. This is especially true when considering strong negative associations and negative covariance, i.e., between items of data which may so rarely come together that their concurrence is never seen in any record, yet the fact that this is not expected is of potential great interest.
机译:提供了数据挖掘技术,该技术对于从无定形集合或记录数据集中发现有用的信息是有效的。例如,本发明提供了对例如几条或多条记录的数据的挖掘,以发现记录中定性文本条目之间的有趣关联以及定量数值类型的数据之间的协方差。尽管不限于此,但是当数据是诸如临床,药物基因组学,法医,警察和财务记录等类型的数据时,本发明具有特殊的应用和优点,这些数据的特征是许多不同的条目,因为那时问题被认为是其中一个问题。 “高维”给研究人员带来了数学和技术难题。当考虑到强烈的负关联和负协方差时,尤其是在数据项之间,这种数据项很少合并在一起,以至于在任何记录中都从未见过它们的并发性时,尤其如此,这是潜在的巨大兴趣。

著录项

  • 公开/公告号US7043476B2

    专利类型

  • 公开/公告日2006-05-09

    原文格式PDF

  • 申请/专利权人 BARRY ROBSON;

    申请/专利号US20020269375

  • 发明设计人 BARRY ROBSON;

    申请日2002-10-11

  • 分类号G06F17/30;

  • 国家 US

  • 入库时间 2022-08-21 21:40:47

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号