【24h】

Weighted K-Means for Density-Biased Clustering

机译:密度偏置聚类的加权K均值

获取原文

摘要

Clustering is a task of grouping data based on similarity. A popular k-means algorithm groups data by firstly assigning all data points to the closest clusters, then determining the cluster means. The algorithm repeats these two steps until it has converged. We propose a variation called weighted k-means to improve the clustering scalability. To speed up the clustering process, we develop the reservoir-biased sampling as an efficient data reduction technique since it performs a single scan over a data set. Our algorithm has been designed to group data of mixture models. We present an experimental evaluation of the proposed method.
机译:群集是基于相似性分组数据的任务。流行的K-means算法通过首先将所有数据点分配给最接近的群集,然后确定集群方式。该算法在融合之前重复这两步。我们提出了一种称重K-Means的变型,以提高聚类可扩展性。为了加快聚类过程,我们将库偏置的抽样开发为有效的数据减少技术,因为它在数据集上执行单次扫描。我们的算法旨在为混合模型进行分组数据。我们提出了该方法的实验评价。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号