首页> 外文会议>International conference on web-age information management >Two-Phase Mining for Frequent Closed Episodes
【24h】

Two-Phase Mining for Frequent Closed Episodes

机译:频繁闭幕的两阶段挖掘

获取原文

摘要

The concept of episodes was introduced for discovering the useful and interesting temporal patterns from the sequential data. Over the years, many episode mining strategies have been suggested, which can be roughly classified into two classes: Apriori-based breadth-first algorithms and projection-based depth-first algorithms. As we know, both kinds of algorithms are level-wise pattern growth methods, so that they have higher computational overhead due to level-wise growth iteration. In addition, their mining time will increase with the increase of sequence length. In the paper, we propose a novel two-phase strategy to discover frequent closed episodes. That is, in phase Ⅰ, we present a level-wise shrinking mechanism, based on maximal duration episodes, to find the candidate frequent closed episodes from the episodes with the same 2-neighboring episode prefix, and in phase Ⅱ, we compare the candidates with different prefixes to discover the final frequent closed episodes. The advantage of the suggested mining strategy is it can reduce mining time due to narrowing episode mapping range when doing closure judgment. Experiments on simulated and real datasets demonstrate that the suggested strategy is effective and efficient.
机译:引入情节的概念是为了从顺序数据中发现有用和有趣的时间模式。多年来,已经提出了许多情节挖掘策略,这些策略可以大致分为两类:基于Apriori的广度优先算法和基于投影的深度优先算法。众所周知,这两种算法都是逐级模式增长方法,因此由于逐级增长迭代,它们具有较高的计算开销。另外,它们的挖掘时间将随着序列长度的增加而增加。在本文中,我们提出了一种新颖的两阶段策略来发现频繁的封闭情节。也就是说,在第一阶段,我们基于最大持续时间事件,提出了一种逐级缩小的机制,从具有相同的两个相邻事件前缀的事件中找到候选者频繁关闭事件,而在第二阶段,我们对候选者进行比较使用不同的前缀来发现最终的频繁封闭情节。建议的挖掘策略的优势在于,由于在进行闭合判断时缩小了情节映射范围,因此可以减少挖掘时间。在模拟数据集和真实数据集上进行的实验表明,该策略是有效且高效的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号