...
首页> 外文期刊>Expert Systems with Application >Incremental mining of weighted maximal frequent itemsets from dynamic databases
【24h】

Incremental mining of weighted maximal frequent itemsets from dynamic databases

机译:从动态数据库增量挖掘加权的最大频繁项目集

获取原文
获取原文并翻译 | 示例
           

摘要

Frequent itemset mining allows us to find hidden, important information from large databases. Moreover, processing incremental databases in the itemset mining area has become more essential because a huge amount of data has been accumulated continually in a variety of application fields and users want to obtain mining results from such incremental data in more efficient ways. One of the major problems in incremental itemset mining is that the corresponding mining results can be very large-scale according to threshold settings and data volumes. In addition, it is considerably hard to analyze all of them and find meaningful information. Furthermore, not all of the mining results become actually important information. In this paper, to solve these problems, we propose an algorithm for mining weighted maximal frequent itemsets from incremental databases. By scanning a given incremental database only once, the proposed algorithm can not only conduct its mining operations suitable for the incremental environment but also extract a smaller number of important itemsets compared to previous approaches. The proposed method also has an effect on expert and intelligent systems since it can automatically provide more meaningful pattern results reflecting characteristics of given incremental databases and threshold settings, which can help users analyze the given data more easily. Our comprehensive experimental results show that the proposed algorithm is more efficient and scalable than previous state-of-the-art algorithms. (C) 2016 Elsevier Ltd. All rights reserved.
机译:频繁的项集挖掘使我们能够从大型数据库中找到隐藏的重要信息。而且,在项集挖掘区域中处理增量数据库变得更加重要,因为在各种应用领域中不断积累了大量数据,并且用户希望以更有效的方式从此类增量数据中获取挖掘结果。增量项集挖掘的主要问题之一是,根据阈值设置和数据量,相应的挖掘结果可能非常大规模。此外,要分析所有这些信息并找到有意义的信息非常困难。此外,并非所有的采矿结果实际上都是重要的信息。为了解决这些问题,我们提出了一种从增量数据库中挖掘加权最大频繁项集的算法。通过仅扫描给定的增量数据库一次,与以前的方法相比,所提出的算法不仅可以进行适合增量环境的挖掘操作,而且可以提取较少数量的重要项集。所提出的方法还可以对专家和智能系统产生影响,因为它可以自动提供更有意义的模式结果,以反映给定增量数据库和阈值设置的特征,从而可以帮助用户更轻松地分析给定数据。我们全面的实验结果表明,与以前的最新技术相比,该算法具有更高的效率和可扩展性。 (C)2016 Elsevier Ltd.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号