首页> 外文会议>International Computer Science and Engineering Conference >Efficient evolution-based clustering of high dimensional data streams with dimension projection
【24h】

Efficient evolution-based clustering of high dimensional data streams with dimension projection

机译:基于维投影的高效演化基于聚类的高维数据流

获取原文

摘要

SE-Stream is an evolution-based stream clustering method that supports high dimensional data streams. SE-Stream is able to monitor and detect change in the clustering structure during the progression of data streams. In this paper, we improve performance of SE-Stream by reducing its execution time and increasing its cluster quality. SE-Stream reduces complexity of stream processing by determining appropriated subset of dimensions of each active cluster to express cluster specific characteristics during the progression of data streams. With elimination of redundant operations, SE-Stream is improved both in terms of cluster quality and execution time. Experimental results on two real-world datasets show that SE-Stream outperforms its previous version in terms execution time. Further, the cluster quality in terms of both purity and f-measure has been considerably improved. Compared with HPStream, a state of the art algorithm for projected clustering of high dimensional data streams, SE-Stream outperforms in terms of cluster quality and yields comparable execution time.
机译:SE-Stream是一种基于进化的流聚类方法,支持高维数据流。 SE-Stream能够在数据流进行过程中监视和检测群集结构的变化。在本文中,我们通过减少SE-Stream的执行时间和提高其集群质量来提高SE-Stream的性能。 SE-Stream通过确定每个活动群集的维度的适当子集来表达数据流进行过程中群集特定的特性,从而降低了流处理的复杂性。通过消除冗余操作,SE-Stream在群集质量和执行时间方面都得到了改善。在两个实际数据集上的实验结果表明,SE-Stream在执行时间方面优于其先前版本。此外,就纯度和f-度量而言,簇质量已得到显着改善。与HPStream(用于高维数据流的投影聚类的最新算法)相比,SE-Stream在聚类质量方面表现出色,并且产生了可比的执行时间。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号