首页> 外文期刊>Intelligent data analysis >Mining of closed frequent subtrees from frequently updated databases
【24h】

Mining of closed frequent subtrees from frequently updated databases

机译:从频繁更新的数据库中挖掘封闭的频繁子树

获取原文
获取原文并翻译 | 示例
           

摘要

We study the problem of mining closed frequent subtrees from tree databases that are updated regularly over time. Closed frequent subtrees provide condensed and complete information for all frequent subtrees in the database. Although mining closed frequent subtrees is in general faster than mining all frequent subtrees, this is still a very time consuming process, and thus it is undesirable to mine from scratch when the change to the database is small. The set of previous mined closed subtrees should be reused as much as possible to compute new emerging subtrees. We propose, in this paper, a novel and efficient incremental mining algorithm for closed frequent labeled ordered trees. We adopt a divide-and-conquer strategy and apply different mining techniques in different parts of the mining process. The proposed algorithm requires no additional scan of the whole database while its memory usage is reasonable. Our experimental study on both synthetic and real-life datasets demonstrates the efficiency and scalability of our algorithm.
机译:我们研究了从定期更新的树数据库中挖掘封闭的频繁子树的问题。封闭的频繁子树为数据库中的所有频繁子树提供压缩的完整信息。尽管挖掘封闭的频繁子树通常比挖掘所有频繁子树更快,但是这仍然是一个非常耗时的过程,因此,当对数据库的更改很小时,不希望从头开始挖掘。应该尽可能重用先前挖掘的封闭子树的集合,以计算新出现的子树。在本文中,我们提出了一种新颖且高效的增量式挖掘算法,用于封闭的频繁标记树。我们采用分而治之的策略,并在采矿过程的不同部分应用不同的采矿技术。所提出的算法不需要额外扫描整个数据库,而其内存使用却是合理的。我们对合成和现实数据集的实验研究证明了我们算法的效率和可扩展性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号