首页> 外文会议>Granular Computing, 2005 IEEE International Conference on >Compact transaction database for efficient frequent pattern mining
【24h】

Compact transaction database for efficient frequent pattern mining

机译:紧凑的交易数据库,用于高效的频繁模式挖掘

获取原文
获取外文期刊封面目录资料

摘要

Mining frequent patterns is one of the fundamental and essential operations in many data mining applications, such as discovering association rules. In this paper, we propose an innovative approach to generating compact transaction databases for efficient frequent pattern mining. It uses a compact tree structure, called CT-tree, to compress the original transactional data. This allows the CT-a priori algorithm, which is revised from the classical a priori algorithm, to generate frequent patterns quickly by skipping the initial database scan and reducing a great amount of I/O time per database scan. Empirical evaluations show that our approach is effective, efficient and promising, while the storage space requirement as well as the mining time can be decreased dramatically on both synthetic and real-world databases.
机译:频繁模式的挖掘是许多数据挖掘应用程序(例如发现关联规则)中的基本操作和必不可少的操作之一。在本文中,我们提出了一种创新的方法来生成紧凑的交易数据库,以进行有效的频繁模式挖掘。它使用称为CT树的紧凑树结构来压缩原始事务数据。这允许从经典先验算法修订而来的CT-a先验算法通过跳过初始数据库扫描并减少每次数据库扫描的大量I / O时间来快速生成频繁模式。实证评估表明,我们的方法是有效,高效和有前途的,而在合成数据库和实际数据库中,存储空间需求以及挖掘时间都可以大大减少。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号