首页> 外文期刊>ETRI journal >A Novel Approach for Mining High-Utility Sequential Patterns in Sequence Databases
【24h】

A Novel Approach for Mining High-Utility Sequential Patterns in Sequence Databases

机译:在序列数据库中挖掘高可用性序列模式的新方法

获取原文
           

摘要

Mining sequential patterns is an important research issue in data mining and knowledge discovery with broad applications. However, the existing sequential pattern mining approaches consider only binary frequency values of items in sequences and equal importance/significance values of distinct items. Therefore, they are not applicable to actually represent many real-world scenarios. In this paper, we propose a novel framework for mining high- utility sequential patterns for more real-life applicable information extraction from sequence databases with non-binary frequency values of items in sequences and different importance/significance values for distinct items. Moreover, for mining high-utility sequential patterns, we propose two new algorithms: UtilityLevel is a high-utility sequential pattern mining with a level-wise candidate generation approach, and UtilitySpan is a high-utility sequential pattern mining with a pattern growth approach. Extensive performance analyses show that our algorithms are very efficient and scalable for mining high-utility sequential patterns.
机译:顺序模式的挖掘是数据挖掘和知识发现中具有广泛应用的重要研究问题。但是,现有的顺序模式挖掘方法仅考虑序列中项的二进制频率值,并且考虑不同项的重要性/重要性值相等。因此,它们不适用于实际代表许多现实情况。在本文中,我们提出了一个新颖的框架,用于挖掘具有高实用性的序列模式,以便从序列数据库中提取具有更实际应用信息的序列,其中序列中项目的非二进制频率值以及不同项目的不同重要性/重要性值。此外,对于挖掘高实用性顺序模式,我们提出了两种新算法:UtilityLevel是使用逐级候选生成方法的高实用性顺序模式挖掘,而UtilitySpan是使用模式增长方法的高实用性顺序模式挖掘。广泛的性能分析表明,我们的算法对于挖掘高实用性顺序模式非常有效且可扩展。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号