【24h】

Clustering over uncertain data stream

机译:聚类在不确定的数据流中

获取原文

摘要

Most existing clustering algorithms on uncertain data stream cannot discover the arbitrary shapes since they are based on k-means. To address this issue, this paper proposes a density grid-based clustering algorithm (DG-UStream) over uncertain data stream. DG-UStream uses the grid structure to store the summary information of data tuples in the stream which could be easily updated periodically. Clusters are formed by merging the adjacent grids. Furthermore, an efficient technique is developed to detect and delete the isolated grids which could greatly reduce the time and space costs. The experimental results show that DG-UStream has superior clustering performance in terms of clustering quality and time efficiency.
机译:在不确定数据流上的大多数现有聚类算法都无法发现任意形状,因为它们基于K-means。 为了解决这个问题,本文提出了一种基于浓度网格的聚类算法(DG-USTREAM)在不确定的数据流中。 DG-UStream使用网格结构来存储流中的数据元组的摘要信息,可以定期更新。 通过合并相邻网格来形成簇。 此外,开发了一种有效的技术来检测和删除隔离网格,这可以大大减少时间和空间成本。 实验结果表明,在聚类质量和时间效率方面,DG-USTREAM具有优异的聚类性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号