首页> 外文会议>Formal concept analysis. >Finding Top-iV Colossal Patterns Based on Clique Search with Dynamic Update of Graph
【24h】

Finding Top-iV Colossal Patterns Based on Clique Search with Dynamic Update of Graph

机译:基于具有动态更新图的小组搜索查找Top-iV巨大模式

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

In this paper, we discuss a method for finding top-N colossal frequent patterns. A colossal pattern we try to extract is a maximal pattern with top-N largest length. Since colossal patterns can be found in relatively lower areas of an itemset (concept) lattice, an efficient method with some effective pruning mechanisms is desired. We design a depth-first branch-and-bound algorithm for finding colossal patterns with top-N length, where a notion of pattern graph plays an important role. A pattern graph is a compact representation of the class of frequent patterns with a designated length. A colossal pattern can be found as a clique in a pattern graph satisfying a certain condition. From this observation, we design an algorithm for finding our target patterns by examining cliques in a graph defined from the pattern graph. The algorithm is based on a depth-first branch-and-bound method for finding a maximum clique. It should be noted that as our search progresses, the graph we are concerned with is dynamically updated into a sparser one which makes our task of finding cliques much easier and the branch-and-bound pruning more powerful. To the best of our knowledge, it is the first algorithm tailored for the problem which can exactly identify top-N colossal patterns. In our experimentation, we compare our algorithm with famous maximal frequent itemset miners from the viewpoint of computational efficiency for a synthetic and a benchmark dataset.
机译:在本文中,我们讨论了一种用于查找前N个巨大频繁模式的方法。我们尝试提取的一个巨大模式是最大长度为N的最大模式。由于可以在项目集(概念)晶格的相对较低的区域中找到巨大的图案,因此需要一种具有一些有效修剪机制的有效方法。我们设计了深度优先的分支定界算法,以查找具有前N个长度的巨大模式,其中模式图的概念起着重要的作用。模式图是具有指定长度的常见模式类别的紧凑表示。在满足一定条件的图案图中,可以找到巨大的图案作为集团。根据这一观察结果,我们设计了一种算法,该算法可通过检查从模式图定义的图中的团来找到目标模式。该算法基于深度优先分支定界方法来查找最大集团。应当注意的是,随着搜索的进行,我们关注的图形会动态更新为稀疏的图形,这使我们更容易找到团簇,而分支限界修剪功能更加强大。据我们所知,这是针对该问题量身定制的第一个算法,可以准确地识别出前N个巨大的模式。在我们的实验中,我们从综合和基准数据集的计算效率的观点出发,将我们的算法与著名的最大频繁项目集挖掘器进行了比较。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号