...
首页> 外文期刊>IEEE Transactions on Knowledge and Data Engineering >Efficiently mining frequent trees in a forest: algorithms and applications
【24h】

Efficiently mining frequent trees in a forest: algorithms and applications

机译:在森林中有效地挖掘常用树木:算法和应用

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Mining frequent trees is very useful in domains like bioinformatics, Web mining, mining semistructured data, etc. We formulate the problem of mining (embedded) subtrees in a forest of rooted, labeled, and ordered trees. We present TREEMINER, a novel algorithm to discover all frequent subtrees in a forest, using a new data structure called scope-list. We contrast TREEMINER with a pattern matching tree mining algorithm (PATTERNMATCHER), and we also compare it with TREEMINERD, which counts only distinct occurrences of a pattern. We conduct detailed experiments to test the performance and scalability of these methods. We also use tree mining to analyze RNA structure and phylogenetics data sets from bioinformatics domain.
机译:在诸如生物信息学,Web挖掘,挖掘半结构化数据等领域中,频繁树的挖掘非常有用。我们提出了在有根,有标签和有序树的森林中挖掘(嵌入)子树的问题。我们提出了TREEMINER,这是一种新颖的算法,它使用称为范围列表的新数据结构来发现森林中所有常见的子树。我们将TREEMINER与模式匹配树挖掘算法(PATTERNMATCHER)进行对比,并将其与TREEMINERD(仅计算模式的不同出现次数)进行比较。我们进行了详细的实验,以测试这些方法的性能和可伸缩性。我们还使用树挖掘技术来分析生物信息学领域的RNA结构和系统发育数据集。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号