Extraction of Frequent Tree Patterns without Subtrees Maintenance

机译：无需子树维护的频繁树模式提取

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The inherent flexibility in both structure and semantics let tree capture most kinds of data, model a wide variety of data sources, and produce an enormous number of information. The ability to extract valuable knowledge from them becomes increasingly important and desirable, however, existing tree mining algorithms suffer from several serious pitfalls in finding frequent patterns from massive tree datasets, because most of them have used a priori property for candidate generation and frequency counting. Some of the major problems are due to (1) modeling data as hierarchical tree structure, (2) computationally high cost of the candidate maintenance, (3) repetitious input dataset scans, and (4) the high memory dependency. Therefore, a more efficient and practical approach for tree data is required. In this paper, we systematically develop the pattern growth method instead of the a priori method, for mining maximal frequent tree patterns which are special frequent patterns of a set of trees. The proposed method not only gets rid of the process for infrequent subtrees pruning, but also totally eliminates the problem of generating candidate subtrees. Hence, it significantly improves the whole mining process.

机译：结构和语义上固有的灵活性使树可以捕获大多数数据，为各种数据源建模，并产生大量信息。从它们中提取有价值的知识的能力变得越来越重要和令人期望，但是，现有的树挖掘算法在从大量树数据集中查找频繁模式时会遇到一些严重的陷阱，因为它们中的大多数已将先验属性用于候选者生成和频率计数。一些主要问题归因于（1）将数据建模为分层树结构，（2）候选维护的计算成本高，（3）重复的输入数据集扫描和（4）高度的内存依赖性。因此，需要用于树数据的更有效和实用的方法。在本文中，我们系统地开发了模式增长方法而不是先验方法，以挖掘最大频繁树模式，该模式是一组树的特殊频繁模式。所提出的方法不仅摆脱了子树不频繁修剪的过程，而且完全消除了生成候选子树的问题。因此，它极大地改善了整个采矿过程。

著录项

来源
《Future Generation Communication and Networking Symposia, FGCNS, 2008 Second International Conference on》|2008年|P.54-59|共6页
会议地点
作者
Paik; Juryon; Choi; Wongil; Lee; Eunjoo; Kim; Ung Mo;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类工业技术;
关键词
Frequent subtrees; Pattern-growth; Tree Mining;

机译：频繁的子树;模式增长;树挖掘;

相似文献

外文文献
中文文献
专利

1. Mining Frequent Patterns without Candidate Generation: A Frequent-Pattern Tree Approach [J] . Jiawei Han, Jian Pei, Yiwen Yin, Data mining and knowledge discovery . 2004,第1期

机译：没有候选生成的频繁模式：频繁模式树方法
2. Efficient Incremental Maintenance of Frequent Patterns with FP-Tree [J] . Xiu-Li Ma, Yun-Hai Tong, Shi-Wei Tang, 计算机科学技术学报（英文版） . 2004,第006期

机译：使用FP-Tree高效地维护频繁模式
3. EFFICIENT MINING OF CLOSED TREE PATTERNS FROM LARGE TREE DATABASES WITH SUBTREE CONSTRAINT [J] . VIET ANH NGUYEN, KOICHIRO DOI, AKIHIRO YAMAMOTO International Journal of Artificial Intelligence Tools: Architectures, Languages, Algorithms . 2012,第6期

机译：具有次约束的大型树数据库中闭合树模式的高效挖掘
4. Extraction of Frequent Tree Patterns without Subtrees Maintenance [C] . International Conference on Future Generation Communication and Networking Symposia . 2008

机译：提取频繁的树木模式，没有子树维护
5. Frequent Itemset Hiding Algorithm Using Frequent Pattern Tree Approach. [D] . Alnatsheh, Rami. 2012

机译：使用频繁模式树方法的频繁项集隐藏算法。
6. Enumerating all maximal frequent subtrees in collections of phylogenetic trees [O] . Akshay Deepak, David Fernández-Baca 2014

机译：枚举系统发育树集合中的所有最大频繁子树
7. Mining Frequent Patterns without Candidate Generation: A Frequent-Pattern Tree Approach [O] . Jiawei Han, Jian Pei, Yiwen Yin, 2004

机译：挖掘没有候选者生成的频繁模式：频繁模式树方法

Extraction of Frequent Tree Patterns without Subtrees Maintenance

摘要

著录项

相似文献

相关主题

期刊订阅