首页> 外文会议>International Conference on Future Generation Communication and Networking Symposia >Extraction of Frequent Tree Patterns without Subtrees Maintenance
【24h】

Extraction of Frequent Tree Patterns without Subtrees Maintenance

机译:提取频繁的树木模式,没有子树维护

获取原文

摘要

The inherent flexibility in both structure and semantics let tree capture most kinds of data, model a wide variety of data sources, and produce an enormous number of information. The ability to extract valuable knowledge from them becomes increasingly important and desirable, however, existing tree mining algorithms suffer from several serious pitfalls in finding frequent patterns from massive tree datasets, because most of them have used a priori property for candidate generation and frequency counting. Some of the major problems are due to (1) modeling data as hierarchical tree structure, (2) computationally high cost of the candidate maintenance, (3) repetitious input dataset scans, and (4) the high memory dependency. Therefore, a more efficient and practical approach for tree data is required. In this paper, we systematically develop the pattern growth method instead of the a priori method, for mining maximal frequent tree patterns which are special frequent patterns of a set of trees. The proposed method not only gets rid of the process for infrequent subtrees pruning, but also totally eliminates the problem of generating candidate subtrees. Hence, it significantly improves the whole mining process.
机译:结构和语义的固有灵活性让树捕获大多数数据,旨在造型各种数据源,并产生巨大数量的信息。从它们中提取有价值的知识的能力变得越来越重要,并且可以越来越重要,然而,现有的树挖掘算法遭受了几个严重的缺陷,在来自大规模树数据集中发现频繁的模式,因为它们中的大多数已经使用了候选生成和频率计数的先验属性。一些主要问题是由于(1)建模数据作为分层树结构,(2)计算候选维护的高成本,(3)重复输入数据集扫描,(4)高内存依赖性。因此,需要一种更有效和实用的树数据方法。在本文中,我们系统地开发了模式增长方法而不是先验方法,用于采矿最大频繁的树形图案,这是一组树的特殊频繁模式。所提出的方法不仅摆脱了不常见的子树修剪的过程,而且还完全消除了产生候选子树的问题。因此,它显着提高了整个采矿过程。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号