首页> 外文期刊>Expert systems with applications >DBV-Miner: A Dynamic Bit-Vector approach for fast mining frequent closed itemsets
【24h】

DBV-Miner: A Dynamic Bit-Vector approach for fast mining frequent closed itemsets

机译:DBV-Miner:一种动态位向量方法,用于快速挖掘频繁关闭的项目集

获取原文
获取原文并翻译 | 示例
           

摘要

Frequent closed itemsets (FCI) play an important role in pruning redundant rules fast. Therefore, a lot of algorithms for mining FCI have been developed. Algorithms based on vertical data formats have some advantages in that they require scan databases once and compute the support of itemsets fast. Recent years, BitTable (Dong & Han, 2007) and IndexBitTable (Song, Vang, & Xu, 2008) approaches have been applied for mining frequent itemsets and results are significant. However, they always use a fixed size of Bit-Vector for each item (equal to number of transactions in a database). It leads to consume more memory for storage Bit-Vectors and the time for computing the intersection among Bit-Vectors. Besides, they only apply for mining frequent itemsets, algorithm for mining FCI based on BitTable is not proposed. This paper introduces a new method for mining FCI from transaction databases. Firstly, Dynamic Bit-Vector (DBV) approach will be presented and algorithms for fast computing the intersection between two DBVs are also proposed. Lookup table is used for fast computing the support (number of bits 1 in a DBV) of itemsets. Next, subsumption concept for memory and computing time saving will be discussed. Finally, an algorithm based on DBV and subsumption concept for mining frequent closed itemsets fast is proposed. We compare our method with CHARM, and recognize that the proposed algorithm is more efficient than CHARM in both the mining time and the memory usage.
机译:频繁关闭项目集(FCI)在快速修剪冗余规则中起着重要作用。因此,已经开发了许多用于挖掘FCI的算法。基于垂直数据格式的算法具有一些优势,因为它们需要一次扫描数据库并快速计算项目集的支持。近年来,BitTable(Dong&Han,2007)和IndexBitTable(Song,Vang,&Xu,2008)方法已被用于挖掘频繁项集,并且结果非常重要。但是,对于每个项目,它们始终使用固定大小的位向量(等于数据库中的事务数)。这将导致消耗更多的内存来存储位向量,以及计算位向量之间的交集所需的时间。此外,它们仅适用于频繁项集的挖掘,没有提出基于BitTable的FCI挖掘算法。本文介绍了一种从交易数据库中挖掘FCI的新方法。首先,将提出动态位向量(DBV)方法,并提出了用于快速计算两个DBV之间的交点的算法。查找表用于快速计算项目集的支持(DBV中的位数1)。接下来,将讨论用于存储和节省计算时间的包含概念。最后,提出了一种基于DBV和包含概念的频繁封闭项目集快速挖掘算法。我们将我们的方法与CHARM进行了比较,并认识到所提算法在挖掘时间和内存使用上均比CHARM更有效。

著录项

  • 来源
    《Expert systems with applications》 |2012年第8期|p.7196-7206|共11页
  • 作者

    Bay Vo; Tzung-Pei Hong; Bac Le;

  • 作者单位

    Department of Computer Science. Ho Chi Minh City University of Technology, Ho Chi Minh, Viet Nam;

    Department of Computer Science and Information Engineering, National University of Kaohsiung, Kaohsiung, Taiwan, ROC,Department of Computer Science and Engineering, National Sun Yat-sen University, Kaohsiung, Taiwan, ROC;

    Department of Computer Science, University of Science, Ho Chi Minh, Viet Nam;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    BitTable; data mining; dynamic bit-vector; frequent closed itemsets; vertical data format;

    机译:位表;数据挖掘;动态位向量;经常关闭的项目集;垂直数据格式;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号