首页> 外文会议>International conference on web information systems engineering >BOSTER: An Efficient Algorithm for Mining Frequent Unordered Induced Subtrees
【24h】

BOSTER: An Efficient Algorithm for Mining Frequent Unordered Induced Subtrees

机译:BOSTER:一种用于挖掘频繁的无序诱导子树的高效算法

获取原文

摘要

Extracting frequent subtrees from the tree structured data has important applications in Web mining. In this paper, we introduce a novel canonical form for rooted labelled unordered trees called the balanced-optimal-search canonical form (BOCF) that can handle the isomorphism problem efficiently. Using BOCF, we define a tree structure guided scheme based enumeration approach that systematically enumerates only the valid subtrees. Finally, we present the balanced optimal search tree miner (BOSTER) algorithm based on BOCF and the proposed enumeration approach, for finding frequent induced subtrees from a database of labelled rooted unordered trees. Experiments on the real datasets compare the efficiency of BOSTER over the two state-of-the-art algorithms for mining induced unordered subtrees, HybridTreeMiner and UNO. The results are encouraging.
机译:从树状结构数据中提取频繁的子树在Web挖掘中具有重要的应用。在本文中,我们为生根标记的无序树引入了一种新的规范形式,称为平衡最优搜索规范形式(BOCF),它可以有效处理同构问题。使用BOCF,我们定义了一种基于树结构的基于枚举方案的枚举方法,该枚举方法仅系统地枚举有效的子树。最后,我们提出了一种基于BOCF和提出的枚举方法的平衡最优搜索树挖掘器(BOSTER)算法,用于从带有标签的无根树的数据库中查找频繁的诱导子树。在真实数据集上进行的实验比较了BOSTER在挖掘感应无序子树的两种最新算法(HybridTreeMiner和UNO)上的效率。结果令人鼓舞。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号