首页> 外文期刊>Applied Intelligence: The International Journal of Artificial Intelligence, Neural Networks, and Complex Problem-Solving Technologies >Mining contrast sequential pattern based on subsequence time distribution variation with discreteness constraints
【24h】

Mining contrast sequential pattern based on subsequence time distribution variation with discreteness constraints

机译:基于随后与离散约束的子序列时间分布变化的挖掘序列模式

获取原文
获取原文并翻译 | 示例
       

摘要

Contrast sequential pattern is defined as a pattern that occurs frequently in one sequence dataset but not in the others. Contrast sequential pattern mining has been widely used in many fields, such as customer behavior analysis and medical diagnosis. Existing algorithms first require users to set a distinguishing location and then use this fixed location to identify distribution differences of different subsequences, i.e., the subsequence pattern that appears before the given distinguishing location in one sequence dataset and after the same location in another sequence dataset. However, it is difficult for users to set an appropriate location without sufficient prior knowledge. Since the distinguishing location is different for different subsequences, setting a fixed location may ignore many meaningful patterns. In addition, previous studies rarely considered the time distribution variation of subsequences and the discreteness of patterns. To solve the above problems, we propose a novel method of mining contrast sequential pattern based on subsequence time distribution variation with discreteness constraints in this paper. A suffix-tree based search algorithm, which transforms the dataset to be processed into a tree representation, is designed to mine contrast sequential pattern based on subsequence time distribution variation. Experiments are conducted on real-world time-series datasets, and the experimental results validate the superiority of our method in terms of effectiveness and efficiency when compared with other state-of-the-art methods.
机译:对比度顺序图案被定义为频繁发生在一个序列数据集中但不在其他序列中出现的模式。对比度顺序模式挖掘已广泛用于许多领域,例如客户行为分析和医学诊断。现有算法首先要求用户设置一个区分位置,然后使用该固定位置来识别不同子序列的不同子序列的分布差异,即在一个序列数据集中的给定区分位置之前出现的子序列模式,以及在另一个序列数据集中的相同位置之后。但是,用户难以在没有足够的先验知识的情况下设置适当的位置。由于区分位置对于不同的子序列不同,因此设置固定位置可以忽略许多有意义的模式。此外,之前的研究很少考虑随后的时间分布变化和模式的离散性。为了解决上述问题,我们提出了一种基于随后在本文的离散约束的基于子序列时间分布变化的挖掘对比序列模式的新方法。基于后缀树的搜索算法将要处理的数据集转换为树形表示,旨在基于子节点时间分布变化的对比度顺序模式。实验在现实世界的时间序列数据集进行,实验结果与其他最先进的方法相比,在有效性和效率方面验证了我们的方法的优势。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号