首页> 外文会议> >H-mine: hyper-structure mining of frequent patterns in large databases
【24h】

H-mine: hyper-structure mining of frequent patterns in large databases

机译:H-mine:大型数据库中频繁模式的超结构挖掘

获取原文

摘要

Methods for efficient mining of frequent patterns have been studied extensively by many researchers. However, the previously proposed methods still encounter some performance bottlenecks when mining databases with different data characteristics, such as dense vs. sparse, long vs. short patterns, memory-based vs. disk-based, etc. In this study, we propose a simple and novel hyper-linked data structure, H-struct and a new mining algorithm, H-mine, which takes advantage of this data structure and dynamically adjusts links in the mining process. A distinct feature of this method is that it has very limited and precisely predictable space overhead and runs really fast in memory-based setting. Moreover it can be scaled up to very large databases by database partitioning, and when the data set becomes dense, (conditional) FP-trees can be constructed dynamically as part of the mining process. Our study shows that H-mine has high performance in various kinds of data, outperforms the previously developed algorithms in different settings, and is highly scalable in mining large databases. This study also proposes a new data mining methodology, space-preserving mining, which may have strong impact in the future development of efficient and scalable data mining methods.
机译:许多研究人员已经广泛研究了有效挖掘频繁模式的方法。但是,先前的方法在挖掘具有不同数据特征的数据库时仍然会遇到一些性能瓶颈,例如密集与稀疏,长与短模式,基于内存与基于磁盘等。在这项研究中,我们提出了一个简单新颖的超链接数据结构H-struct和新的挖掘算法H-mine,该算法利用了这种数据结构并在挖掘过程中动态调整了链接。此方法的独特之处在于它具有非常有限且可精确预测的空间开销,并且在基于内存的设置中运行速度非常快。此外,可以通过数据库分区将其扩展到非常大的数据库,并且当数据集变得密集时,(有条件的)FP树可以作为挖掘过程的一部分动态构建。我们的研究表明,H-mine在各种数据中均具有高性能,在不同的环境下性能优于先前开发的算法,并且在挖掘大型数据库时具有高度可扩展性。这项研究还提出了一种新的数据挖掘方法,即空间保留挖掘,它可能对有效和可扩展数据挖掘方法的未来发展产生重大影响。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号