首页> 外文会议>International conference on artificial neural networks >StreamLeader: A New Stream Clustering Algorithm not Based in Conventional Clustering
【24h】

StreamLeader: A New Stream Clustering Algorithm not Based in Conventional Clustering

机译:StreamLeader:一种不基于常规聚类的新流聚类算法

获取原文

摘要

Stream clustering algorithms normally require two phases: an online first step that statistically summarizes the stream while forming special structures - such as micro-clusters- and a second, offline phase, that uses a conventional clustering algorithm taking the micro-clusters as pseudo-points to deliver the final clustering. This procedure tends to produce oversized or overlapping clusters in medium-to-high dimensional spaces, and typically degrades seriously in noisy data environments. In this paper we introduce StreamLeader, a novel stream clustering algorithm suitable to massive data that does not resort to a conventional clustering phase, being based on the notion of Leader Cluster and on an aggressive noise reduction process. We report an extensive systematic testing in which the new algorithm is shown to consistently outperform its contenders both in terms of quality and scalability.
机译:流聚类算法通常需要两个阶段:在线第一步,统计形成特定结构(例如微簇)时对流进行汇总;第二步,离线阶段,该阶段使用常规聚类算法,将微簇作为伪点提供最终的集群。此过程往往会在中高维空间中生成超大或重叠的簇,并且通常在嘈杂的数据环境中会严重退化。在本文中,我们基于领导者群集的概念和积极的降噪过程,介绍了StreamLeader,这是一种新颖的流群集算法,适用于不求助于常规群集阶段的海量数据。我们报告了一项广泛的系统测试,在该测试中,新算法在质量和可扩展性方面均始终优于其竞争对手。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号