首页> 外文会议>International conference on database systems for advanced applications >Mining Itemset-Based Distinguishing Sequential Patterns with Gap Constraint
【24h】

Mining Itemset-Based Distinguishing Sequential Patterns with Gap Constraint

机译:基于项目集的区分与间隙约束的分辨序列模式

获取原文

摘要

Mining contrast sequential patterns, which are sequential patterns that characterize a given sequence class and distinguish that class from another given sequence class, has a wide range of applications including medical informatics, computational finance and consumer behavior analysis. In previous studies on contrast sequential pattern mining, each element in a sequence is a single item or symbol. This paper considers a more general case where each element in a sequence is a set of items. The associated contrast sequential patterns will be called itemset-based distinguishing sequential patterns (itemset-DSP). After discussing the challenges on mining itemset-DSP, we present iDSP-Miner, a mining method with various pruning techniques, for mining itemset-DSPs that satisfy given support and gap constraint. In this study, we also propose a concise border-like representation (with exclusive bounds) for sets of similar itemset-DSPs and use that representation to improve efficiency of our proposed algorithm. Our empirical study using both real data and synthetic data demonstrates that iDSP-Miner is effective and efficient.
机译:挖掘对比度序列模式,它们是表征给定序列类的顺序模式,并区分从另一个给定的序列类别中的类,具有广泛的应用,包括医疗信息,计算金融和消费者行为分析。在以前关于对比度顺序模式挖掘的研究中,序列中的每个元素是单个项目或符号。本文考虑了更常规的情况,其中序列中的每个元素是一组项目。相关的对比度顺序模式将被称为基于项目集的区分顺序模式(itemset-dsp)。在讨论挖掘项目集-DSP上的挑战之后,我们呈现IDSP-MINER,一种具有各种修剪技术的挖掘方法,用于挖掘给定支持和间隙约束的挖掘项目集-DSP。在本研究中,我们还提出了一个简明的边界状表示(具有独占限制),用于类似的项目集-DSP,并使用该表示来提高我们所提出的算法的效率。我们使用真实数据和合成数据的实证研究表明IDSP-Miner是有效和有效的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号