...
首页> 外文期刊>International Journal of Engineering & Technology >MCDAStream: a real-time data stream clustering based on micro-cluster density and attraction
【24h】

MCDAStream: a real-time data stream clustering based on micro-cluster density and attraction

机译:MCDAStream:基于微集群密度和吸引力的实时数据流聚类

获取原文
   

获取外文期刊封面封底 >>

       

摘要

Real-time data stream clustering has been widely used in many fields, and it can extract useful information from massive sets of data. Most of the existing density-based algorithms cluster the data streams based on the density within the micro-clusters. These algorithms completely omit the data density in the area between the micro-clusters and recluster the micro-clusters based on erroneous assumptions about the distribution of the data within and between the micro-clusters that lead to poor clustering results. This paper describes a novel density-based clustering algorithm for evolving data streams called MCDAStream, which clusters the data stream based on micro-cluster density and attraction between the micro-clusters. The attraction of micro-clusters characterizes the positional information of the data points in each micro-cluster. We generate better clustering results by considering both micro-cluster density and attraction of micro-clusters. The quality of the proposed algorithm is evaluated on various synthetic and real-time datasets with distinct characteristics and quality metrics.
机译:实时数据流聚类已在许多领域中广泛使用,它可以从大量数据中提取有用的信息。现有的大多数基于密度的算法都基于微集群中的密度对数据流进行聚类。这些算法完全忽略了微团簇之间区域的数据密度,并基于关于微团簇内部和之间的数据分布的错误假设,对微团簇进行重新分组,从而导致了较差的聚类结果。本文介绍了一种用于演化数据流的基于密度的新颖聚类算法,称为MCDAStream,该算法基于微集群密度和微集群之间的吸引力对数据流进行聚类。微型集群的吸引力描述了每个微型集群中数据点的位置信息。通过同时考虑微集群密度和微集群的吸引力,我们可以产生更好的聚类结果。在具有不同特征和质量指标的各种合成和实时数据集上评估所提出算法的质量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号