首页> 外文会议>Machine learning and data mining in pattern recognition >Mining Multiple Level Non-redundant Association Rules through Two-Fold Pruning of Redundancies
【24h】

Mining Multiple Level Non-redundant Association Rules through Two-Fold Pruning of Redundancies

机译:通过两次修剪冗余来挖掘多级非冗余关联规则

获取原文
获取原文并翻译 | 示例

摘要

Association rules (AR) are a class of patterns which describe regularities in a set of transactions. When items of transactions are organized in a taxonomy, AR can be associated with a level of the taxonomy since they contain only items at that level. A drawback of multiple level AR mining is represented by the generation of redundant rules which do not add further information to that expressed by other rules. In this paper, a method for the discovery of non-redundant multiple level AR is proposed. It follows the usual two-stepped procedure for AR mining and it prunes redundancies in each step. In the first step, redundancies are removed by resorting to the notion of multiple level closed frequent itemsets, while in the second step, pruning is based on an extension of the notion of minimal rules. The proposed technique has been applied to a real case of analysis of textual data. An empirical comparison with the Apriori algorithm proves the advantages of the proposed method in terms of both time-performance and redundancy reduction.
机译:关联规则(AR)是一类模式,用于描述一组交易中的规则性。当交易项目按分类法进行组织时,AR可以与该分类法级别相关联,因为它们仅包含该级别的项目。多级AR挖掘的缺点是生成冗余规则,这些规则不会向其他规则表示的信息添加更多信息。本文提出了一种发现非冗余多级AR的方法。它遵循用于AR挖掘的常规两步过程,并且在每个步骤中都会修剪冗余。第一步,通过使用多级封闭频繁项集的概念来消除冗余,而在第二步中,修剪基于最小规则的概念的扩展。所提出的技术已应用于文本数据分析的真实案例。与Apriori算法的经验比较证明了该方法在时间性能和冗余减少方面的优势。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号