【24h】

MINING ASSOCIATION RULES

机译:采矿协会规则

获取原文

摘要

Discovering probabilistic association rules between items in large databases is becoming an important process as the size and availability of the data increase. Today, the success of mining association rules technology is usually attributed to the a-priori optimization idea. A drawback of a-priori is the implicit assumption that all items have similar prior probabilities, which makes it sensitive to the support threshold and, for a specific threshold, it may miss some rules of higher confidence than the output ones. To catch these rules, a lower support level must be used which in turn makes the a-priori optimization less useful. To avoid these problems, we propose a new technique based on the relative support between items and the notion of association graph that captures the association relationships between the items. We present a necessary and sufficient condition for the existence of a rule with specific confidence. The necessary part of this condition is used to generate candidate itemsets without counting them from the database. Since the degree of any node in the association graph is bounded, generating the maximal itemsets is efficient. We provide a recursive algorithm to generate all maximal itemsets. The new method is suitable for interactive mining.
机译:发现大型数据库中的项目之间的概率关联规则正在成为数据增加的大小和可用性的重要过程。今天,采矿协会规则技术的成功通常归因于a-priori优化的想法。 a-priori的缺点是所有项目具有类似的现有概率的隐含假设,这使得对支持阈值敏感,并且对于特定阈值,它可能会错过比输出比输出更高的置信度规则。为了捕获这些规则,必须使用较低的支持级别,这反过来使a-priori优化不太有用。为避免这些问题,我们提出了一种基于项目之间的相对支持的新技术和关联图之间的概念,其捕获项目之间的关联关系。我们为特定信心存在一个规则存在的必要和充分条件。此条件的必要部分用于生成候选项集,而无需将它们与数据库计数。由于关联图中的任何节点的程度被界定,因此生成最大项集是有效的。我们提供递归算法来生成所有最大项集。新方法适用于互动挖掘。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号