首页> 外文会议>ICCSE2008;International Conference on Computer Science Education >A Clustering Algorithm for Multiple Data Streams Based on Spectral Component Similarity
【24h】

A Clustering Algorithm for Multiple Data Streams Based on Spectral Component Similarity

机译:基于谱分量相似度的多数据流聚类算法

获取原文

摘要

A new clustering algorithm for multiple data streams using a novel similarity metric is proposed. Based on spectral component similarity analysis, the algorithm can effectively cluster streams which show similar behavior with some unknown time delay. Auto-regressive modeling technique is exploited to measure lag correlation between data streams, which is used as the distance metrics for clustering. Based on a sliding window model, the algorithm can continuously report the most recent clustering results, and dynamically adjust the number of clusters. Our experimental results on real and synthetic datasets show that our algorithm has high clustering quality, efficiency, and scalability than other similar methods.
机译:提出了一种使用新型相似性度量的多数据流聚类新算法。基于频谱分量相似性分析,该算法可以有效地聚类表现出相似行为但具有未知时间延迟的流。利用自回归建模技术来测量数据流之间的滞后相关性,将其用作聚类的距离度量。该算法基于滑动窗口模型,可以连续报告最新的聚类结果,并动态调整聚类数量。我们在真实和合成数据集上的实验结果表明,与其他类似方法相比,我们的算法具有较高的聚类质量,效率和可伸缩性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号