首页> 外文会议>International Conference on Enterprise Information Systems >MINING SEQUENTIAL PATTERNS WITH REGULAR EXPRESSION CONSTRAINTS USING SEQUENTIAL PATTERN TREE
【24h】

MINING SEQUENTIAL PATTERNS WITH REGULAR EXPRESSION CONSTRAINTS USING SEQUENTIAL PATTERN TREE

机译:使用顺序图案树的常规表达约束挖掘序列模式

获取原文

摘要

The significant growth of sequence database sizes in recent years increase the importance of developing new techniques for data organization and query processing. Discovering sequential patterns is an important problem in data mining with a host of application domains. For effectiveness and efficiency consideration, constraints are essential for many sequential applications. In this paper, we give a brief review of different sequential pattern mining algorithms, and then introduce a new algorithm (termed NewSPIRIT) for mining frequent sequential patterns that satisfy user specified regular expression constraints. The general idea of our algorithm is to use a finite state automata to represent the regular expression constraints and build a sequential pattern tree that represents all sequences of data which satisfy this constraints by scanning the database of sequences only once. Experimental results shows that our NewSPIRIT is much more efficient than existing algorithms.
机译:近年来序列数据库大小的显着增长增加了开发数据组织和查询处理新技术的重要性。发现顺序模式是数据挖掘的一个重要问题,具有一系列应用程序域。为了有效和效率考虑,约束对于许多顺序应用是必不可少的。在本文中,我们介绍了对不同顺序模式挖掘算法的简要介绍,然后介绍一种新的算法(称为NewsPirit),用于满足用户指定的正则表达式约束的频繁顺序模式。我们的算法的一般概念是使用有限状态自动机来表示正则表达式约束,并构建一个顺序模式树,该树是通过仅扫描一次序列数据库来满足该约束的所有数据序列。实验结果表明,我们的报告比现有算法更有效。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号