Efficiently mining frequent trees in a forest: algorithms and applications

Zaki M.J.

首页> 外文期刊>IEEE Transactions on Knowledge and Data Engineering >Efficiently mining frequent trees in a forest: algorithms and applications

【24h】

Efficiently mining frequent trees in a forest: algorithms and applications

机译：在森林中有效地挖掘常用树木：算法和应用

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Mining frequent trees is very useful in domains like bioinformatics, Web mining, mining semistructured data, etc. We formulate the problem of mining (embedded) subtrees in a forest of rooted, labeled, and ordered trees. We present TREEMINER, a novel algorithm to discover all frequent subtrees in a forest, using a new data structure called scope-list. We contrast TREEMINER with a pattern matching tree mining algorithm (PATTERNMATCHER), and we also compare it with TREEMINERD, which counts only distinct occurrences of a pattern. We conduct detailed experiments to test the performance and scalability of these methods. We also use tree mining to analyze RNA structure and phylogenetics data sets from bioinformatics domain.

机译：在诸如生物信息学，Web挖掘，挖掘半结构化数据等领域中，频繁树的挖掘非常有用。我们提出了在有根，有标签和有序树的森林中挖掘（嵌入）子树的问题。我们提出了TREEMINER，这是一种新颖的算法，它使用称为范围列表的新数据结构来发现森林中所有常见的子树。我们将TREEMINER与模式匹配树挖掘算法（PATTERNMATCHER）进行对比，并将其与TREEMINERD（仅计算模式的不同出现次数）进行比较。我们进行了详细的实验，以测试这些方法的性能和可伸缩性。我们还使用树挖掘技术来分析生物信息学领域的RNA结构和系统发育数据集。

著录项

来源
《IEEE Transactions on Knowledge and Data Engineering》 |2005年第8期|p.1021-1035|共15页
作者
Zaki M.J.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词
data mining; macromolecules; pattern matching; tree data structures; PATTERNMATCHER; RNA structure; TREEMINER; bioinformatics domain; data mining; data structure; frequent tree mining; labeled trees; pattern matching; phylogenetics data sets; scope-list; Index Terms- F;

机译：数据挖掘;大分子;模式匹配;树数据结构;PATTERNMATCHER;RNA结构;树;

相似文献

外文文献
中文文献
专利

1. Frequent pattern mining in attributed trees: algorithms and applications [J] . Pasquier Claude, Sanhes Jeremy, Flouvat Frederic, Knowledge and information systems . 2016,第3期

机译：属性树中的频繁模式挖掘：算法和应用
2. An Improvised Frequent Pattern Tree Based Association Rule Mining Technique with Mining Frequent Item Sets Algorithm and a Modified Header Table [J] . Vandit Agarwal, Mandhani Kushal, Preetham Kumar International Journal of Data Mining & Knowledge Management Process . 2015,第2期

机译：一种改进的基于频繁模式树的关联规则挖掘技术，具有挖掘频繁项集算法和改进的表头表的能力
3. An efficient frequent pattern mining algorithm using a highly compressed prefix tree [J] . Zhu Xiaolin, Liu Yongguo Intelligent data analysis . 2019,第SUPPLa期

机译：使用高度压缩的前缀树的高效频繁模式挖掘算法
4. Efficiently mining frequent trees in a forest [C] . Mohammed J. Zaki Proceedings of the Eighth ACM SIGKDD international conference on knowledge discovery and data mining(KDD-2000) . 2002

机译：在森林中有效地开采常见树木
5. Frequent Itemset Hiding Algorithm Using Frequent Pattern Tree Approach. [D] . Alnatsheh, Rami. 2012

机译：使用频繁模式树方法的频繁项集隐藏算法。
6. Empirical study of seven data mining algorithms on different characteristics of datasets for biomedical classification applications [O] . Yiyan Zhang, Yi Xin, Qin Li, 2017

机译：七种数据挖掘算法在生物医学分类应用中不同数据集特征的实证研究
7. Efficiently Mining Frequent Trees in a Forest: Algorithms and Applications [O] . Mohammed J. Zaki 2005

机译：有效挖掘森林中频繁的树木：算法和应用
8. Parallel Formulations of Tree-Projection Based Sequence Mining Algorithms. [R] . Guralnik, V., Karypis, G. 2003

机译：基于树投影的序列挖掘算法的并行公式。

Efficiently mining frequent trees in a forest: algorithms and applications

摘要

著录项

相似文献

相关主题

期刊订阅