首页> 外文期刊>Procedia Computer Science >Frequent Item Set Mining Using INC_MINE in Massive Online Analysis Frame Work
【24h】

Frequent Item Set Mining Using INC_MINE in Massive Online Analysis Frame Work

机译:在大规模在线分析框架中使用INC_MINE频繁进行项目集挖掘

获取原文
       

摘要

Frequent Pattern Mining is one of the major data mining techniques, which is exhaustively studied in the past decade. The technological advancements have resulted in huge data generation, having increased rate of data distribution. The generated data is called as a ‘data stream’. Data streams can be mined only by using sophisticated techniques. The paper aims at carrying out frequent pattern mining on data streams. Stream mining has great challenges due to high memory usage and computational costs. Massive online analysis frame work is a software environment used to perform frequent pattern mining using INC_MINE algorithm. The algorithm uses the method of closed frequent mining. The data sets used in the analysis are Electricity data set and Airline data set. The authors also generated their own data set, OUR-GENERATOR for the purpose of analysis and the results are found interesting. In the experiments five samples of instance sizes (10000, 15000, 25000, 35000, 50000) are used with varying minimum support and window sizes for determining frequent closed itemsets and semi frequent closed itemsets respectively. The present work establishes that association rule mining could be performed even in the case of data stream mining by INC_MINE algorithm by generating closed frequent itemsets which is first of its kind in the literature.
机译:频繁模式挖掘是主要的数据挖掘技术之一,在过去十年中进行了详尽的研究。技术的进步导致了巨大的数据生成,并提高了数据分发率。生成的数据称为“数据流”。只能使用复杂的技术来挖掘数据流。本文旨在对数据流进行频繁的模式挖掘。由于高内存使用和计算成本,流挖掘面临巨大挑战。大规模的在线分析框架是一种软件环境,用于使用INC_MINE算法执行频繁的模式挖掘。该算法采用封闭频繁挖掘的方法。分析中使用的数据集是电力数据集和航空公司数据集。作者还生成了自己的数据集OUR-GENERATOR进行分析,结果令人感兴趣。在实验中,使用实例大小(10000、15000、25000、35000、50000)的五个样本分别具有不同的最小支持和窗口大小来分别确定频繁关闭项目集和半频繁关闭项目集。本工作建立了即使在通过INC_MINE算法进行数据流挖掘的情况下,也可以通过生成封闭的频繁项集来执行关联规则挖掘,这在文献中尚属首次。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号