首页> 外文会议>Computers and Their Applications >DPC-I: An Efficient Algorithm to Find the Large Itemset of a Specific Size
【24h】

DPC-I: An Efficient Algorithm to Find the Large Itemset of a Specific Size

机译:DPC-I:查找特定大小的大型商品集的高效算法

获取原文

摘要

Discovery of association rules is an important problem in the area of data mining. For this problem, how to efficiently count large itemsets is the major work, where a large itemset is a set of items appearing in a sufficient number of transactions. Most of previous study generated candidates and tested them to gain the large itemsets. It is very time-exhausted. Therefore, in this paper, we propose the DPC-I algorithm, which Directly Prunes non-large itemsets and Counts the others, to find a large itemset of a certain Interesting size. In the DPC-I algorithm, given a k, we efficiently construct L_k based on L_2, instead of step by step, where L_k denotes the set of large k-itemsets with minimum support. We conduct several experiments using different synthetic transaction databases. The simulation results show that the DPC-I algorithm outperforms the Apriori algorithm and FP-Growth algorithm in the execution time for all transaction database settings.
机译:关联规则的发现是数据挖掘领域中的重要问题。对于此问题,如何有效地计算大型项目集是一项主要工作,其中大型项目集是出现在足够数量的交易中的一组项目。以前的大多数研究都生成了候选对象,并对其进行了测试以获取大项目集。这非常耗时。因此,在本文中,我们提出了DPC-I算法,该算法直接修剪非大型项目集并计算其他项目集,以找到具有一定大小的大型项目集。在DPC-I算法中,给定一个k,我们基于L_2高效地构造L_k,而不是逐步地构造L_k,其中L_k表示具有最小支持的大k个项集的集合。我们使用不同的综合交易数据库进行了几次实验。仿真结果表明,在所有事务数据库设置的执行时间上,DPC-I算法均优于Apriori算法和FP-Growth算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号