首页> 外文会议>WISE 2014 >BOSTER: An Efficient Algorithm for Mining Frequent Unordered Induced Subtrees
【24h】

BOSTER: An Efficient Algorithm for Mining Frequent Unordered Induced Subtrees

机译:趋势:频繁的未排序引起的子树的挖掘算法

获取原文

摘要

Extracting frequent subtrees from the tree structured data has important applications in Web mining. In this paper, we introduce a novel canonical form for rooted labelled unordered trees called the balanced-optimal-search canonical form (BOCF) that can handle the isomorphism problem efficiently. Using BOCF, we define a tree structure guided scheme based enumeration approach that systematically enumerates only the valid subtrees. Finally, we present the balanced optimal search tree miner (BOSTER) algorithm based on BOCF and the proposed enumeration approach, for finding frequent induced subtrees from a database of labelled rooted unordered trees. Experiments on the real datasets compare the efficiency of BOSTER over the two state-of-the-art algorithms for mining induced unordered subtrees, HybridTreeMiner and UNI3. The results are encouraging.
机译:从树结构数据中提取频繁的子树具有网站挖掘中的重要应用。在本文中,我们介绍了一种新颖的规范形式,用于植根标记的无序树木,称为平衡最佳搜索的规范形式(Bocf),可以有效地处理同构态。使用BOCF,我们定义了一种基于树结构的导向方案的枚举方法,系统地仅枚举有效的子树。最后,我们介绍了基于BOCF的平衡最佳搜索树矿工(倾向)算法和所提出的枚举方法,用于从标记的rooted无序树的数据库中查找频繁的诱导子树。真实数据集的实验比较了招收的效率,用于采矿诱导的无序子树,HybridTreeMENER和UNI3的两种最先进的算法。结果令人鼓舞。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号