An Algorithm for Online K-Means Clustering

机译：在线k-means聚类的算法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper shows that one can be competitive with the k-means objective while operating online. In this model, the algorithm receives vectors v_1, ..., v_n one by one in an arbitrary order. For each vector v_t the algorithm outputs a cluster identifier before receiving v_(t+1). Our online algorithm generates O(k log n log γn) clusters whose expected k-means cost is O(W~* log n). Here, W~* is the optimal k-means cost using k clusters and γ is the aspect ratio of the data. The dependence on γ is shown to be unavoidable and tight. We also show that, experimentally, it is not much worse than k-means++ while operating in a strictly more constrained computational model.

机译：本文表明，在线运营时，人们可以对K-Means目标具有竞争力。在该模型中，该算法以任意顺序接收v_1，...，v_n一个逐个向量。对于每个向量V_T，算法在接收V_（T + 1）之前输出群集标识符。我们的在线算法生成o（k log n logγn）群集，其预期的K-means成本为O（w〜* log n）。这里，W〜*是使用k簇的最佳k均值成本，γ是数据的纵横比。对γ的依赖显示是不可避免的和紧张的。我们还表明，通过实验，在严格更加受限的计算模型中运行时，它不会比K-Means ++更差。

著录项

来源
《Workshop on Algorithm Engineering and Experiments》|2016年||共9页
会议地点
作者
Edo Liberty; Ram Sriharsha; Maxim Sviridenko;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP301.6-53;
关键词

相似文献

外文文献
中文文献
专利

1. K-means clustering algorithms used in the evaluation of online learners' behaviour [J] . Xiaoming Chen, Wenge Li, Yubo Jiang International Journal of Continuing Engineering Education and Life-long Learning . 2021,第3期

机译：K-means聚类算法用于评估在线学习者的行为
2. Evaluation Of Fuzzy K-Means And K-Means Clustering Algorithms In Intrusion Detection Systems [J] . Farhad Soleimanian Gharehchopogh, Neda Jabbari, Zeinab Ghaffari Azar International Journal of Scientific & Technology Research . 2012,第11期

机译：入侵检测系统中模糊K-均值和K-均值聚类算法的评估
3. Does Determination of Initial Cluster Centroids Improve the Performance of K-Means Clustering Algorithm? Comparison of Three Hybrid Methods by Genetic Algorithm, Minimum Spanning Tree, and Hierarchical Clustering in an Applied Study [J] . Saeedeh Pourahmad, Atefeh Basirat, Amir Rahimi, Computational and mathematical methods in medicine . 2020,第1期

机译：初始簇质心的确定是否提高了K-Means聚类算法的性能？应用研究中遗传算法，最小生成树和分层聚类的三种混合方法的比较
4. Research on k-means Clustering Algorithm: An Improved k-means Clustering Algorithm [C] . Shi Na, Liu Xumin, Guan Yong Intelligent Information Technology and Security Informatics (IITSI), 2010 . 2010

机译：k均值聚类算法研究：一种改进的k均值聚类算法
5. Hardware Implementation and Performance Evaluation of K-Means and K-Means++ Clustering Algorithms [D] . Singh, Manisha . 2019

机译：K-Means和K-Means ++聚类算法的硬件实现和性能评估
6. Does Determination of Initial Cluster Centroids Improve the Performance of K-Means Clustering Algorithm? Comparison of Three Hybrid Methods by Genetic Algorithm Minimum Spanning Tree and Hierarchical Clustering in an Applied Study [O] . Saeedeh Pourahmad, Atefeh Basirat, Amir Rahimi, 2020

机译：初始簇质心的确定是否提高了K-Means聚类算法的性能？应用研究中遗传算法最小生成树和分层聚类的三种混合方法的比较
7. An Algorithm for Online K-Means Clustering [O] . Liberty, Edo, Sriharsha, Ram, Sviridenko, Maxim 2015

机译：一种在线K均值聚类算法

An Algorithm for Online K-Means Clustering

摘要

著录项

相似文献

相关主题

期刊订阅