首页> 外文期刊>International journal of computational intelligence systems >Progressive CFM-Miner: An Algorithm to Mine CFM - Sequential Patterns from a Progressive Database
【24h】

Progressive CFM-Miner: An Algorithm to Mine CFM - Sequential Patterns from a Progressive Database

机译:渐进式CFM-Miner:一种挖掘CFM的算法-渐进式数据库中的顺序模式

获取原文
获取原文并翻译 | 示例
       

摘要

Sequential pattern mining is a vital data mining task to discover the frequently occurring patterns in sequence databases. As databases develop, the problem of maintaining sequential patterns over an extensively long period of time turn into essential, since a large number of new records may be added to a database. To reflect the current state of the database where previous sequential patterns would become irrelevant and new sequential patterns might appear, there is a need for efficient algorithms to update, maintain and manage the information discovered. Several efficient algorithms for maintaining sequential patterns have been developed. Here, we have presented an efficient algorithm to handle the maintenance problem of CFM-sequential patterns (Compact, Frequent, Monetary-constraints based sequential patterns). In order to efficiently capture the dynamic nature of data addition and deletion into the mining problem, initially, we construct the updated CFM-tree using the CFM patterns obtained from the static database. Then, the database gets updated from the distributed sources that have data which may be static, inserted, or deleted. Whenever the database is updated from the multiple sources, CFM tree is also updated by including the updated sequence. Then, the updated CFM-tree is used to mine the progressive CFM-patterns using the proposed tree pattern mining algorithm. Finally, the experimentation is carried out using the synthetic and real life distributed databases that are given to the progressive CFM-miner. The experimental results and analysis provides better results in terms of the generated number of sequential patterns, execution time and the memory usage over the existing IncSpan algorithm.
机译:顺序模式挖掘是一项重要的数据挖掘任务,用于发现序列数据库中频繁发生的模式。随着数据库的发展,在很长一段时间内保持顺序模式的问题变得至关重要,因为可能会将大量新记录添加到数据库中。为了反映数据库的当前状态,以前的顺序模式将变得不相关,并且可能会出现新的顺序模式,因此需要一种有效的算法来更新,维护和管理发现的信息。已经开发了几种用于维持顺序模式的有效算法。在这里,我们提出了一种有效的算法来处理CFM顺序模式(基于紧凑,频繁,基于货币约束的顺序模式)的维护问题。为了有效地捕获数据添加和删除到挖掘问题中的动态性质,首先,我们使用从静态数据库获得的CFM模式构造更新的CFM树。然后,数据库从具有可能是静态,插入或删除的数据的分布式源中更新。每当从多个来源更新数据库时,CFM树也会通过包含更新的序列来更新。然后,使用提出的树模式挖掘算法,将更新后的CFM树用于挖掘渐进CFM模式。最后,使用合成的和现实生活中的分布式数据库进行实验,该数据库被提供给渐进式CFM矿工。在生成的顺序模式数量,执行时间和现有IncSpan算法的内存使用量方面,实验结果和分析提供了更好的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号