首页> 外文会议>International Conference on Fuzzy Systems and Knowledge Discovery >An Efficient Algorithm for Mining Large Item Sets
【24h】

An Efficient Algorithm for Mining Large Item Sets

机译:挖掘大项目集的高效算法

获取原文

摘要

It propose Online Mining  Algorithm ( OMA) which online discover large item sets. Without pre-setting a default threshold, the OMA algorithm achieves its efficiency and threshold-flexibility by calculating item-sets’ counts. It is unnecessary and  independent of the default threshold and can flexibly adapt to any user’s input threshold. In addition, we propose Cluster-Based Association Rule Algorithm (CARA) creates cluster tables to aid discovery of large item sets. It only requires a single scan of the database, followed by contrasts with the partial cluster tables. It  not only prunes considerable amounts of data reducing the time needed to perform data scans and requiring less contrast, but also ensures the correctness of the mined results. By using the CARA algorithm to create cluster tables in advance, each CPU can be utilized to process a cluster table; thus large item sets can be immediately mined even when the database is very large.
机译:它提出在线挖掘算法(OMA)在线发现大项目集。如果没有预先设置默认阈值,OMA算法通过计算项目集的计数来实现其效率和阈值灵活性。它是不必要的,独立于默认阈值,可以灵活地适应任何用户的输入阈值。此外,我们提出基于群集的关联规则算法(Cara)创建群集表以帮助发现大项目集。它只需要单一扫描数据库,然后与部分群集表格形成对比。它不仅修剪了大量的数据,减少了执行数据扫描所需的时间并需要更少的对比度,而且还可以确保所开采的结果的正确性。通过使用Cara算法预先创建群集表,可以利用每个CPU来处理群集表;因此,即使数据库非常大,也可以立即开采大项目集。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号