首页> 外文会议>Workshop on Algorithm Engineering and Experiments >An Algorithm for Online K-Means Clustering
【24h】

An Algorithm for Online K-Means Clustering

机译:在线k-means聚类的算法

获取原文

摘要

This paper shows that one can be competitive with the k-means objective while operating online. In this model, the algorithm receives vectors v_1, ..., v_n one by one in an arbitrary order. For each vector v_t the algorithm outputs a cluster identifier before receiving v_(t+1). Our online algorithm generates O(k log n log γn) clusters whose expected k-means cost is O(W~* log n). Here, W~* is the optimal k-means cost using k clusters and γ is the aspect ratio of the data. The dependence on γ is shown to be unavoidable and tight. We also show that, experimentally, it is not much worse than k-means++ while operating in a strictly more constrained computational model.
机译:本文表明,在线运营时,人们可以对K-Means目标具有竞争力。在该模型中,该算法以任意顺序接收v_1,...,v_n一个逐个向量。对于每个向量V_T,算法在接收V_(T + 1)之前输出群集标识符。我们的在线算法生成o(k log n logγn)群集,其预期的K-means成本为O(w〜* log n)。这里,W〜*是使用k簇的最佳k均值成本,γ是数据的纵横比。对γ的依赖显示是不可避免的和紧张的。我们还表明,通过实验,在严格更加受限的计算模型中运行时,它不会比K-Means ++更差。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号