首页> 外文会议>International conference on web-age information management >More Efficient Algorithm for Mining Frequent Patterns with Multiple Minimum Supports
【24h】

More Efficient Algorithm for Mining Frequent Patterns with Multiple Minimum Supports

机译:具有多个最小支持的频繁模式挖掘的更高效算法

获取原文

摘要

Frequent pattern mining (FPM) is an important data mining task, having numerous applications. However, an important limitation of traditional FPM algorithms, is that they rely on a single minimum support threshold to identify frequent patterns (FPs). As a solution, several algorithms have been proposed to mine FPs using multiple minimum supports. Nevertheless, a crucial problem is that these algorithms generally consume a large amount of memory and have long execution times. In this paper, we address this issue by introducing a novel algorithm named efficient discovery of Frequent Patterns with Multiple minimum supports from the Enumeration-tree (FP-ME). The proposed algorithm discovers FPs using a novel Set-Enumeration-tree structure with Multiple minimum supports (ME-tree), and employs a novel sorted downward closure (SDC) property of FPs with multiple minimum supports. The proposed algorithm directly discovers FPs from the ME-tree without generating candidates. Furthermore, an improved algorithms, named FP-ME_(DiffSet), is also proposed based on the DiffSet concept, to further increase mining performance. Substantial experiments on real-life datasets show that the proposed approaches not only avoid the "rare item problem", but also efficiently and effectively discover the complete set of FPs in transac-tional databases.
机译:频繁模式挖掘(FPM)是一项重要的数据挖掘任务,具有许多应用程序。但是,传统FPM算法的一个重要局限性在于它们依靠单个最小支持阈值来识别频繁模式(FP)。作为解决方案,已提出了几种使用多个最小支持量来挖掘FP的算法。然而,一个关键问题是这些算法通常消耗大量内存,并且执行时间长。在本文中,我们通过引入一种新颖的算法来解决此问题,该算法名为“有效模式”,它具有枚举树(FP-ME)的多个最小支持。所提出的算法使用具有多个最小支持的新颖的Set-Enumeration-tree结构(ME-tree)发现FP,并采用具有多个最小支持的FP的新颖排序向下闭合(SDC)属性。所提出的算法直接从ME树中发现FP,而不会产生候选。此外,还基于DiffSet概念提出了一种名为FP-ME_(DiffSet)的改进算法,以进一步提高挖掘性能。对现实数据集的大量实验表明,所提出的方法不仅避免了“稀有物品问题”,而且还有效地在交易数据库中发现了完整的FP集。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号