首页> 外文OA文献 >A new online clustering approach for data in arbitrary shaped clusters
【2h】

A new online clustering approach for data in arbitrary shaped clusters

机译:一种新的在线聚类方法,用于任意形状聚类中的数据

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

In this paper we demonstrate a new density based clustering technique, CODAS, for online clustering of streaming data into arbitrary shaped clusters. CODAS is a two stage process using a simple local density to initiate micro-clusters which are then combined into clusters. Memory efficiency is gained by not storing or re-using any data. Computational efficiency is gained by using hyper-spherical micro-clusters to achieve a micro-cluster joining technique that is dimensionally independent for speed. The micro-clusters divide the data space in to sub-spaces with a core region and a non-core region. Core regions which intersect define the clusters. A threshold value is used to identify outlier micro-clusters separately from small clusters of unusual data. The cluster information is fully maintained on-line. In this paper we compare CODAS with ELM, DEC, Chameleon, DBScan and Denstream and demonstrate that CODAS achieves comparable results but in a fully on-line and dimensionally scale-able manner.
机译:在本文中,我们演示了一种新的基于密度的聚类技术CODAS,用于将流数据在线聚类为任意形状的聚类。 CODAS是一个两阶段过程,使用简单的局部密度来引发微团簇,然后将其组合成簇。通过不存储或重用任何数据来提高内存效率。通过使用超球形微团簇来实现微团簇连接技术,从而获得计算效率,该技术在尺寸上与速度无关。微型集群将数据空间划分为具有核心区域和非核心区域的子空间。相交的核心区域定义了群集。阈值用于与异常数据的小型群集分开识别异常微簇。群集信息完全在线维护。在本文中,我们将CODAS与ELM,DEC,Chameleon,DBScan和Denstream进行了比较,并证明了CODAS达到了可比的结果,但是以完全在线且可扩展的方式进行。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号