首页> 外文会议>International conference on database systems for advanced applications >Partition-Based Clustering with Sliding Windows for Data Streams
【24h】

Partition-Based Clustering with Sliding Windows for Data Streams

机译:带有滑动式Windows的基于分区的群集,用于数据流

获取原文

摘要

Data stream clustering with sliding windows generates clusters for every window movement. Because repeated clustering on all changed windows is highly inefficient in terms of memory and computation time, a clustering algorithm should be designed with considering only inserted and deleted tuples of windows. In this paper, we address this problem by sliding window aggregation technique and cluster modification strategy. We propose a novel data structure for construction and maintenance of 2-level synopses. This data structure enables to update synopses efficiently and support precise sliding window operations. We also suggest a modification strategy to decide whether to append new synopses to pre-existing clusters or perform clustering on whole synopses according to the difference between probability distributions of the original and updated clusters. Experimental results show that proposed method outperforms state-of-the-art methods.
机译:带有滑动窗口的数据流聚类会为每个窗口移动生成聚类。由于在所有更改的窗口上重复进行聚类在内存和计算时间方面的效率都非常低,因此在设计聚类算法时应仅考虑窗口的插入和删除的元组。在本文中,我们通过滑动窗口聚合技术和聚类修改策略解决了这个问题。我们提出了一种新颖的数据结构,用于2级概要的构建和维护。该数据结构能够有效地更新概要并支持精确的滑动窗口操作。我们还建议一种修改策略,根据原始和更新聚类的概率分布之间的差异,决定是将新的概要添加到现有聚类中,还是对整个概要进行聚类。实验结果表明,提出的方法优于最新方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号