首页> 外文期刊>Data & Knowledge Engineering >Discovering frequent itemsets by support approximation and itemset clustering
【24h】

Discovering frequent itemsets by support approximation and itemset clustering

机译:通过支持近似和项目集聚类发现频繁的项目集

获取原文
获取原文并翻译 | 示例
       

摘要

To speed up the task of association rule mining, a novel concept based on support approximation has been previously proposed for generating frequent itemsets. However, the mining technique utilized by this concept may incur unstable accuracy due to approximation error. To overcome this drawback, in this paper we combine a new clustering method with support approximation, and propose a mining method, namely CAC, to discover frequent itemsets based on the Principle of Inclusion and Exclusion. The clustering technique groups highly similar members to improve the accuracy of support approximation. The hit ratio analysis and experimental results presented in this paper verify that CAC improves accuracy. Without repeatedly scanning a database and storing vast information in memory, the CAC method is able mine frequent itemsets with relative stability. The advantages that the CAC method enjoys in both accuracy and performance make it an effective and useful technique for discovering frequent itemsets in a database.
机译:为了加快关联规则挖掘的任务,以前已经提出了一种基于支持近似的新颖概念来生成频繁项集。但是,由于近似误差,该概念所采用的挖掘技术可能会导致精度不稳定。为了克服这个缺点,本文将一种新的聚类方法与支持近似相结合,并提出了一种基于包含与排除原理的挖掘方法,即CAC,以发现频繁项集。聚类技术对高度相似的成员进行分组,以提高支持近似的准确性。本文提出的命中率分析和实验结果证明,CAC提高了准确性。无需重复扫描数据库并将大量信息存储在内存中,CAC方法就可以相对稳定地挖掘频繁的项目集。 CAC方法在准确性和性能上均具有优势,这使其成为一种发现数据库中频繁项集的有效且有用的技术。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号