首页> 外文会议>Asia-Pacific Web Conference(APWeb 2004); 20040414-20040417; Hangzhou; CN >Efficient Incremental Pattern Mining from Semi-structured Dataset
【24h】

Efficient Incremental Pattern Mining from Semi-structured Dataset

机译:半结构化数据集的高效增量模式挖掘

获取原文
获取原文并翻译 | 示例

摘要

Tree-structured frequent pattern mining is an important issue in semi-structured data mining. In this paper, we study the problem of incremental pattern mining from labeled ordered trees by using the knowledge discovered in the previous mining operations, and present an efficient algorithm IncFreqt for discovering frequent substructures from a large collection of semi-structured dataset. Because the inserted position of the increased data tree is not limited to the root of the original data tree, this paper introduces a new expansion algorithm called Bridging expansion, which is used to find the patterns that can not be discovered by normal rightmost expansion in the case that the new data tree is not inserted to be a subtree of the root of the original one. The experimental results show a great improvement in the efficiency of our algorithm compared with that of non-incremental pattern mining algorithm.
机译:树型频繁模式挖掘是半结构化数据挖掘中的重要问题。在本文中,我们使用先前的挖掘操作中发现的知识研究了从带标签的有序树中进行增量模式挖掘的问题,并提出了一种有效的算法IncFreqt,用于从大量半结构化数据集中发现频繁的子结构。由于增加后的数据树的插入位置不限于原始数据树的根,因此本文介绍了一种称为“桥接扩展”的新扩展算法,该算法用于查找无法通过常规最右边扩展发现的模式。新数据树未插入为原始数据树根的子树的情况。实验结果表明,与非增量模式挖掘算法相比,我们的算法效率有了很大的提高。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号