首页> 外文会议>ACMKDD International Conference on Knowledge Discovery and Data Mining;KDD 2008 >Mining Adaptively Frequent Closed Unlabeled Rooted Trees in Data Streams
【24h】

Mining Adaptively Frequent Closed Unlabeled Rooted Trees in Data Streams

机译:数据流中的自适应频繁关闭未标记的有根树的挖掘

获取原文

摘要

Closed patterns are powerful representatives of frequent patterns, since they eliminate redundant information. We propose a new approach for mining closed unlabeled rooted trees adaptively from data streams that change over time. Our approach is based on an efficient representation of trees and a low complexity notion of relaxed closed trees, and leads to an on-line strategy and an adaptive sliding window technique for dealing with changes over time. More precisely, we first present a general methodology to identify closed patterns in a data stream, using Galois Lattice Theory. Using this methodology, we then develop three closed tree mining algorithms: an incremental one IncTreeNat, a sliding-window based one, WinTreeNat, and finally one that mines closed trees adaptively from data streams, Ada-TreeNat. To the best of our knowledge this is the first work on mining frequent closed trees in streaming data varying with time. We give a first experimental evaluation of the proposed algorithms.
机译:封闭模式是频繁模式的有力代表,因为它们消除了多余的信息。我们提出了一种新的方法,该方法可从随时间变化的数据流中自适应地挖掘未标记的闭有根的树。我们的方法基于树木的有效表示和宽松的封闭树木的低复杂度概念,并导致了一种在线策略和自适应滑动窗口技术来处理随时间变化的问题。更准确地说,我们首先介绍一种使用伽罗瓦格子理论识别数据流中闭合模式的通用方法。然后,使用这种方法,我们开发了三种封闭树挖掘算法:一种增量式IncTreeNat算法,一种基于滑动窗口的WinTreeNat算法,最后一种是从数据流中自适应地挖掘封闭树的算法Ada-TreeNat。据我们所知,这是挖掘随时间变化的流式数据中频繁关闭的树的第一项工作。我们对提出的算法进行了首次实验评估。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号