首页> 外文期刊>International journal of computers, communications and control >Mining Temporal Sequential Patterns Based on Multi-granularities
【24h】

Mining Temporal Sequential Patterns Based on Multi-granularities

机译:基于多粒度的时间序列模式挖掘

获取原文
           

摘要

Sequential pattern mining is an important data mining problem that can extract frequent subsequences from sequences. However, the times between successive items in a sequence is typically used as user-specified constraints to pre-process the input data or to prune the pattern search space. In either cases, the times cannot be used to identify item intervals of sequential patterns. In this paper, we introduce a form of multi-granularity sequence patterns, which is a sequential pattern where each transition time is annotated with multi-granularity boundary interval and average time derived from the source data rather than the user-predetermined time interval or only a typical time. Then we present a novel algorithm, MG-PrefixSpan, of multiple granularity sequential patterns based on PrefixSpan[, which discovers all such patterns. Empirical evaluation shows that MG-PrefixSpan scales up linearly as the size of database, and has a good scalability with respect to the length of sequence and the size of transaction.
机译:顺序模式挖掘是一个重要的数据挖掘问题,可以从序列中提取频繁的子序列。但是,序列中连续项之间的时间通常用作用户指定的约束条件,以预处理输入数据或修剪模式搜索空间。在这两种情况下,时间都不能用于标识顺序模式的项目间隔。在本文中,我们介绍了一种多粒度序列模式,这是一种顺序模式,其中每个过渡时间都用多粒度边界间隔和从源数据得出的平均时间(而不是用户指定的时间间隔或仅典型的时间。然后,我们提出了一种基于PrefixSpan [的多重粒度顺序模式的新颖算法MG-PrefixSpan [],它发现了所有此类模式。实证评估表明,MG-PrefixSpan随数据库大小线性增长,并且在序列长度和事务大小方面具有良好的可伸缩性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号