首页> 外文学位 >Efficient mining and maintenance of association rules in large datasets.
【24h】

Efficient mining and maintenance of association rules in large datasets.

机译:在大型数据集中高效挖掘和维护关联规则。

获取原文
获取原文并翻译 | 示例

摘要

Data mining is the exploration and analysis of large quantities of data to discover meaningful patterns and rules. Mining frequent itemsets plays an essential role in many data mining tasks, which attempts to find interesting associations or correlations among a large set of data items. Efficient discovery of frequent large itemsets and its dual problem of mining association rules are well studied and efficient solution techniques have been developed and deployed in data analysis and mining tools. When new transactions are added to the dataset, it is important to maintain such discovered patterns and rules without requiring processing the whole dataset and re-computing from scratch.; In this research, we first focus on the maintenance problem and propose an in-memory technique to identify frequent large itemsets when the data set grows by addition of new transactions. The basic solution idea is to identify and use negative borders for maintenance. We then use this idea and develop a divide-and-conquer technique, based on partitioning , to compute frequent itemsets in large datasets, which do not fit into the main memory. Our experimental results show that the proposed techniques are efficient and scalable.
机译:数据挖掘是对大量数据的探索和分析,以发现有意义的模式和规则。频繁项集的挖掘在许多数据挖掘任务中起着至关重要的作用,这些任务试图在大量数据项之间找到有趣的关联或相关性。对频繁的大型项目集的有效发现及其挖掘关联规则的双重问题进行了深入研究,并且已经开发了有效的解决方案技术并将其部署在数据分析和挖掘工具中。当新的事务被添加到数据集时,重要的是保持这种发现的模式和规则,而无需处理整个数据集并从头开始重新计算。在这项研究中,我们首先关注维护问题,并提出一种内存中技术,以在通过添加新交易而增长数据集时识别频繁出现的大型项目集。基本的解决方案是识别并使用负边界进行维护。然后,我们使用此思想并开发基于分区的分治技术,以计算大型数据集中的频繁项集,这些数据集不适合主内存。我们的实验结果表明,提出的技术是有效且可扩展的。

著录项

  • 作者

    Song, Yu.;

  • 作者单位

    Concordia University (Canada).;

  • 授予单位 Concordia University (Canada).;
  • 学科 Computer Science.
  • 学位 M.Comp.Sc.
  • 年度 2005
  • 页码 88 p.
  • 总页数 88
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 自动化技术、计算机技术;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号