首页> 外文期刊>Knowledge and information systems >Frequent pattern mining in attributed trees: algorithms and applications
【24h】

Frequent pattern mining in attributed trees: algorithms and applications

机译:属性树中的频繁模式挖掘:算法和应用

获取原文
获取原文并翻译 | 示例
           

摘要

Frequent pattern mining is an important data mining task with a broad range of applications. Initially focused on the discovery of frequent itemsets, studies were extended to mine structural forms like sequences, trees or graphs. In this paper, we introduce a new domain of patterns, attributed trees (atrees), and a method to extract these patterns in a forest of atrees. Attributed trees are trees in which vertices are associated with itemsets. Mining this type of patterns (called asubtrees), which combines tree mining and itemset mining, requires the exploration of a huge search space. To make our approach scalable, we investigate the mining of condensed representations. For attributed trees, the classical concept of closure involves both itemset closure and structural closure. We present three algorithms for mining all patterns, closed patterns w.r.t. itemsets (content) and/or structure in attributed trees. We show that, for low support values, mining content-closed attributed trees is a good compromise between non-redundancy of solutions and execution time.
机译:频繁模式挖掘是一项具有广泛应用程序的重要数据挖掘任务。最初着眼于频繁项集的发现,研究扩展到挖掘序列,树或图等结构形式。在本文中,我们介绍了一种新的模式领域,即属性树(atrees),以及一种在森林中提取这些模式的方法。属性树是其中顶点与项目集相关联的树。挖掘结合了树挖掘和项集挖掘的这种模式(称为子树),需要探索巨大的搜索空间。为了使我们的方法具有可扩展性,我们研究了压缩表示的挖掘。对于属性树,经典的封闭概念涉及项目集封闭和结构封闭。我们提出了三种用于挖掘所有模式的算法,即闭合模式w.r.t.属性树中的项目集(内容)和/或结构。我们表明,对于低支持值,挖掘内容封闭的属性树是解决方案的非冗余性与执行时间之间的良好折衷。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号