首页> 外文期刊>Journal of Southeast University >Mining condensed frequent subtree base
【24h】

Mining condensed frequent subtree base

机译:挖掘压缩频繁子树库

获取原文
获取原文并翻译 | 示例
       

摘要

In frequent tree pattern mining, the number of frequent subtrees generated is often too large. To tackle this problem, the concept of condensed frequent subtree base is proposed. The base consists of the maximal frequent subtrees for a series of support thresholds. It is a subset of frequent subtrees, and is used to approximate the support of arbitrary frequent subtrees with guaranteed maximal error bound. In. addition, an algorithm is developed to mine such a condensed subtree base in a database of labeled rooted ordered trees. The algorithm adopts the way of right-most extension to generate systematically all frequent rooted ordered subtrees. Several techniques are proposed to prune the branches that do not correspond to the maximal frequent subtrees. Heuristic techniques are used to arrange the order of computation so that relatively expensive computation is avoided as much as possible. Experimental results show that the size of the base is less than 10 percent of that of the complete set, and the algorithm outperforms the previous algorithms.
机译:在频繁树模式挖掘中,生成的频繁子树的数量通常太大。为了解决这个问题,提出了压缩频繁子树库的概念。该基数由一系列支持阈值的最大频繁子树组成。它是频繁子树的子集,用于在保证最大误差范围的情况下近似支持任意频繁子树。在。另外,开发了一种算法来在标记的根有序树的数据库中挖掘这种压缩的子树库。该算法采用最右边扩展的方式来系统地生成所有频繁生根的有序子树。提出了几种技术来修剪不对应于最大频繁子树的分支。启发式技术用于安排计算顺序,以便尽可能避免使用相对昂贵的计算。实验结果表明,该库的大小小于完整库的10%,并且该算法优于以前的算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号