首页> 外文会议>IEEE International Conference on Cybernetics >A new online clustering approach for data in arbitrary shaped clusters
【24h】

A new online clustering approach for data in arbitrary shaped clusters

机译:一种新的在线聚类方法,用于任意形状簇中的数据

获取原文

摘要

In this paper we demonstrate a new density based clustering technique, CODSAS, for online clustering of streaming data into arbitrary shaped clusters. CODAS is a two stage process using a simple local density to initiate micro-clusters which are then combined into clusters. Memory efficiency is gained by not storing or re-using any data. Computational efficiency is gained by using hyper-spherical micro-clusters to achieve a micro-cluster joining technique that is dimensionally independent for speed. The micro-clusters divide the data space in to sub-spaces with a core region and a non-core region. Core regions which intersect define the clusters. A threshold value is used to identify outlier micro-clusters separately from small clusters of unusual data. The cluster information is fully maintained on-line. In this paper we compare CODAS with ELM, DEC, Chameleon, DBScan and Denstream and demonstrate that CODAS achieves comparable results but in a fully on-line and dimensionally scale-able manner.
机译:在本文中,我们展示了一种新的基于密度的聚类技术,用于在线聚类将流数据流入任意形状的簇。 CoDas是一种使用简单的局部密度来引发微簇的两个阶段过程,然后将微簇组合成簇。通过不存储或重新使用任何数据,可以获得内存效率。通过使用超球面微簇来实现计算效率,以实现尺寸独立于速度的微簇连接技术。微集群将数据空间划分为具有核心区域和非核心区域的子空间。与群集相交的核心区域。阈值用于识别来自小型异常数据的小集群的异常微集群。群集信息完全维护在线。在本文中,我们将Codas与Elm,Dec,Chameleon,DBSCAN和DENSTROM进行比较,并证明CODAS实现了可比的结果,而是以完全在线和尺度扩展的方式。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号