首页> 外文会议>Pacific Asia Conference on Language, Information and Computation >A Compact FP-tree for Fast Frequent Pattern Retrieval
【24h】

A Compact FP-tree for Fast Frequent Pattern Retrieval

机译:用于快速频繁模式检索的紧凑型FP树

获取原文

摘要

Frequent patterns are useful in many data mining problems including query suggestion. Frequent patterns can be mined through frequent pattern tree (FP-tree) data structure which is used to store the compact (or compressed) representation of a transaction database (Han, et al, 2000). In this paper, we propose an algorithm to o press frequent pattern set into a smaller one, and store the set in a modified version of FP-tree (called compact FP-tree) as an inverted indexing of patterns for later quick retrieval (for query suggestion). With the compact FP-tree, we can also restore the original frequent pattern set. Our experiment results show that our compact FP-tree has a very good compression ratio, especially on sparse dataset which is the nature of query log.
机译:频繁模式在许多数据挖掘问题(包括查询建议)中很有用。可以通过频繁模式树(FP-tree)数据结构来挖掘频繁模式,该数据结构用于存储事务数据库的紧凑(或压缩)表示形式(Han等,2000)。在本文中,我们提出了一种算法,可以将频繁模式集压缩为较小的模式,并将其存储在FP-tree的修改版本(称为紧凑FP-tree)中,作为模式的反向索引,以供以后快速检索(用于查询建议)。使用紧凑的FP树,我们还可以恢复原始的频繁模式集。我们的实验结果表明,我们的紧凑型FP树具有很好的压缩率,尤其是在稀疏数据集上,这是查询日志的本质。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号