首页> 外文期刊>Information Systems >Incremental update on sequential patterns in large databases by implicit merging and efficient counting
【24h】

Incremental update on sequential patterns in large databases by implicit merging and efficient counting

机译:通过隐式合并和有效计数对大型数据库中的顺序模式进行增量更新

获取原文
获取原文并翻译 | 示例
       

摘要

Current approaches for sequential pattern mining usually assume that the mining is performed in a static sequence database. However, databases are not static due to update so that the discovered patterns might become invalid and new patterns could be created. In addition to higher complexity, the maintenance of sequential patterns is more challenging than that of association rules owing to sequence merging. Sequence merging, which is unique in sequence databases, requires the appended new sequences to be merged with the existing ones if their customer ids are the same. Re-mining of the whole database appears to be inevitable since the information collected in previous discovery will be corrupted by sequence merging. Instead of re-mining, the proposed IncSP (Incremental Sequential Pattern Update) algorithm solves the maintenance problem through effective implicit merging and efficient separate counting over appended sequences. Patterns found previously are incrementally updated rather than re-mined from scratch. Moreover, the technique of early candidate pruning further speeds up the discovery of new patterns. Empirical evaluation using comprehensive synthetic data shows that IncSP is fast and scalable.
机译:当前用于顺序模式挖掘的方法通常假定在静态序列数据库中执行挖掘。但是,由于更新,数据库不是静态的,因此发现的模式可能无效并且可以创建新的模式。除了更高的复杂性之外,由于序列合并,顺序模式的维护比关联规则的维护更具​​挑战性。序列合并在序列数据库中是唯一的,如果它们的客户ID相同,则要求将附加的新序列与现有序列合并。重新挖掘整个数据库似乎是不可避免的,因为先前发现中收集的信息将因序列合并而损坏。代替重新挖掘,提出的IncSP(增量顺序模式更新)算法通过有效的隐式合并和对附加序列的有效单独计数解决了维护问题。先前找到的模式会增量更新,而不是从头开始重新挖掘。此外,早期候选修剪技术进一步加快了新模式的发现。使用综合的综合数据进行的经验评估表明,IncSP快速且可扩展。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号