首页> 外文期刊>International journal of business information systems >An adaptive algorithm for frequent pattern mining over data streams using diffset strategy
【24h】

An adaptive algorithm for frequent pattern mining over data streams using diffset strategy

机译:一种基于差异集策略的数据流频繁模式挖掘自适应算法

获取原文
获取原文并翻译 | 示例
           

摘要

Frequent pattern mining using sliding window over data streams is commonly used due to its wide applicability. Determining suitable window size and detection of concept change are the major issues and can be addressed by having flexible window based on amount of changes in frequent patterns. For mining frequent patterns over data streams, vertical mining algorithms can be used. However, in these algorithms, size of transaction identifiers (tidsets) and the time for computation of intersection between tidsets is large. Moreover, presence of null transactions does not contribute any useful frequent patterns. A new algorithm called recent frequent pattern mining based on diffset with elimination of null transactions (RFP-DIFF-ENT) over data streams using variable size window is proposed. It stores difference of tidsets and eliminates null transactions which minimise memory and mining time. Experimental results show that proposed algorithm saves computation time, memory usage and minimises the number of frequent patterns.
机译:由于其广泛的适用性,通常使用在数据流上使用滑动窗口的频繁模式挖掘。确定合适的窗口大小和检测概念变化是主要问题,可以通过基于频繁模式变化量的灵活窗口来解决。为了在数据流上挖掘频繁的模式,可以使用垂直挖掘算法。但是,在这些算法中,交易标识符(tidset)的大小和计算tidset之间的交点的时间很大。而且,空交易的存在不会带来任何有用的频繁模式。提出了一种新的算法,该算法基于基于差异集的diffset并使用可变大小窗口消除了数据流上的空事务(RFP-DIFF-ENT)。它存储了差异集,并消除了空事务,从而最大程度地减少了内存和挖掘时间。实验结果表明,该算法节省了计算时间,内存使用量,并减少了频繁模式的数量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号