首页> 外文会议>International conference on database and expert systems applications >Efficient Discovery of Correlated Patterns in Transactional Databases Using Items' Support Intervals
【24h】

Efficient Discovery of Correlated Patterns in Transactional Databases Using Items' Support Intervals

机译:使用项目支持间隔在事务数据库中有效发现相关模式

获取原文

摘要

Correlated patterns are an important class of regularities that exist in a transactional database. CoMine uses pattern-growth technique to discover the complete set of correlated patterns that satisfy the user-defined minimum support and minimum all-confidence constraints. The technique involves compacting the database into FP-tree, and mining it recursively by building conditional pattern bases (CPB) for each item (or suffix pattern) in FP-tree. The CPB of the suffix pattern in CoMine represents the set of complete prefix paths in FP-tree co-occurring with itself. Thus, CoMine implicitly assumes that the suffix pattern can concatenate with all items in its prefix paths to generate correlated patterns of higher-order. It has been observed that such an assumption can cause performance problems in CoMine. This paper makes an effort to improve the performance of CoMine by introducing a novel concept known as items' support intervals. The concept says that an item in FP-tree can generate correlated patterns of higher-order by concatenating with only those items in its prefix-paths that have supports within a specific interval. We call the proposed algorithm as CoMine++. Experimental results on various datasets show that CoMine++ can discover high correlated patterns effectively.
机译:关联模式是事务数据库中存在的一类重要的规则。 CoMine使用模式增长技术来发现满足用户定义的最小支持和最小所有置信度约束的相关模式的完整集合。该技术涉及将数据库压缩为FP树,并通过为FP树中的每个项目(或后缀模式)建立条件模式库(CPB)来递归地挖掘数据库。 CoMine中后缀模式的CPB表示与它自己同时出现的FP-tree中完整前缀路径的集合。因此,CoMine隐式地假定后缀模式可以与其前缀路径中的所有项连接在一起,以生成相关的高阶模式。已经观察到,这样的假设会导致CoMine中的性能问题。本文通过引入称为项目支持间隔的新概念来努力提高CoMine的性能。该概念表示,FP树中的项目可以通过仅将其前缀路径中在特定间隔内具有支持的那些项目进行级联来生成更高阶的相关模式。我们将提出的算法称为CoMine ++。在各种数据集上的实验结果表明,CoMine ++可以有效地发现高度相关的模式。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号