首页> 外文期刊>Data mining and knowledge discovery >NegPSpan: efficient extraction of negative sequential patterns with embedding constraints
【24h】

NegPSpan: efficient extraction of negative sequential patterns with embedding constraints

机译:negpspan:高效提取负序列模式,嵌入约束

获取原文
获取原文并翻译 | 示例
           

摘要

Sequential pattern mining is concerned with the extraction of frequent or recurrent behaviors, modeled as subsequences, from a sequence dataset. Such patterns inform about which events are frequently observed in sequences, i.e. events that really happen. Sometimes, knowing that some specific event does not happen is more informative than extracting observed events. Negative sequential patterns (NSPs) capture recurrent behaviors by patterns having the form of sequences mentioning both observed events and absence of events. Few approaches have been proposed to mine such NSPs. In addition, the syntax and semantics of NSPs differ in the different methods which makes it difficult to compare them. This article provides a unified framework for the formulation of the syntax and the semantics of NSPs. Then, we introduce a new algorithm, NegPSpan, that extracts NSPs using a prefix-based depth-first scheme, enabling maxgap constraints that other approaches do not take into account. The formal framework highlights the differences between the proposed approach and methods from the literature, especially against the state of the art approach eNSP. Intensive experiments on synthetic and real datasets show that NegPSpan can extract meaningful NSPs and that it can process bigger datasets than eNSP thanks to significantly lower memory requirements and better computation times.
机译:顺序模式挖掘涉及从序列数据集中提取为常规或经常性行为的提取。这种模式通知在序列中经常观察到哪些事件,即真正发生的事件。有时,知道一些特定事件不会发生,而不是提取观察到的事件。负顺序图案(NSP)通过具有提及观察到的事件和不存在事件的序列形式的模式捕获复发行为。已经提出了很少的方法来挖掘此类NSP。此外,NSP的语法和语义在不同的方法中不同,这使得难以比较它们。本文为制定语法和NSP的语义提供了统一的框架。然后,我们介绍了一种新的算法,NegPSPAN,使用基于前缀的深度的第一个方案提取NSP,使得可以实现其他方法不考虑的MaxGap约束。正式框架突出了文献中提出的方法和方法之间的差异,尤其是违反现有技术的国家。合成和实时数据集的密集实验表明,NegPSPAN可以提取有意义的NSP,并且它可以根据ENSP处理更大的数据集,并且由于显着降低了内存要求和更好的计算时间。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号