首页> 外文会议>Annual ACM symposium on Theory of computing;ACM symposium on Theory of computing >On coresets for k-means and k-median clustering

【24h】

On coresets for k-means and k-median clustering

机译：关于k均值和k中值聚类的核心集

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we show the existence of small coresets for the problems of computing k-median and k-means clustering for points in low dimension. In other words, we show that given a point set P in R^d, one can compute a weighted set S ⊆ P, of size O(k ε^-d log n), such that one can compute the k-median/means clustering on S instead of on P, and get an (1+ε)-approximation. As a result, we improve the fastest known algorithms for (1+ε)-approximate k-means and k-median. Our algorithms have linear running time for a fixed k and ε. In addition, we can maintain the (1+ε)-approximate k-median or k-means clustering of a stream when points are being only inserted, using polylogarithmic space and update time.

机译：在本文中，我们显示了针对低维点计算k中值和k均值聚类问题的小型核集的存在。换句话说，我们证明给定R ^{d 中的点集P，可以计算大小为O（kε^{-d log n的加权集S⊆P ），这样就可以计算在S而不是P上的k中值/均值聚类，并获得（1 +ε）逼近。结果，我们改进了（1 +ε）近似k均值和k中值的最快已知算法。对于固定的k和ε，我们的算法具有 linear 的运行时间。此外，使用多对数空间和更新时间，当仅插入点时，我们可以维持流的（1 +ε）-近似k中值或k-均值聚类。}} 展开▼

著录项

来源
《Annual ACM symposium on Theory of computing;ACM symposium on Theory of computing 》|2004年|P.291-300|共10页

会议地点

作者
Sariel Har-Peled; Soham Mazumdar;
展开▼

作者单位

展开▼

会议组织

原文格式 PDF

正文语种

中图分类一般性问题 ;

关键词
streaming;

机译：流媒体;

相似文献

外文文献

中文文献

专利

1. ON CORESETS FOR k-MEDIAN AND k-MEANS CLUSTERING IN METRIC AND EUCLIDEAN SPACES AND THEIR APPLICATIONS [J] . Chen K SIAM Journal on Computing . 2010 ,第3期

机译：度和欧空间中k均值和k均值聚类的共预设及其应用

2. TURNING BIG DATA INTO TINY DATA: CONSTANT-SIZE CORESETS FOR k-MEANS, PCA, AND PROJECTIVE CLUSTERING [J] . Feldman Dan, Schmidt Melanie, Sohler Christian SIAM Journal on Computing . 2020 ,第3期

机译：将大数据转化为微小数据：K-Means，PCA和投影群集的常量尺寸导致

3. Scalable k-Means Clustering via Lightweight Coresets [J] . Olivier Bachem, Mario Lucic, Andreas Krause SIGKDD explorations . 2018 ,第Udisk期

机译：可扩展的K-meary通过轻量级冠状群体聚类

4. Smaller coresets for k-median and k-means clustering [C] . Sariel Har-Peled, Akash Kushal Annual symposium on Computational geometry;Symposium on Computational geometry . 2005

机译：用于k中值和k均值聚类的较小核心集

5. Automated Parsing of Flexible Molecular Systems Using Principal Component Analysis and K-Means Clustering Techniques [D] . Nwerem, Matthew Jonathan Chukwunenye. 2021

机译：使用主成分分析和K-Means聚类技术自动解析灵活分子系统

6. A comparison of latent class K-means and K-median methods for clustering dichotomous data [O] . Michael J. Brusco, Emilie Shireman, Douglas Steinley -1

机译：潜在类K均值和K中值方法对二分数据进行聚类的比较

7. Smaller Coresets for k-Median and k-Means Clustering [O] . Sariel Har-Peled, Akash Kushal 2006

机译：k中位数和k均值聚类的较小刻度

1. 基于K均值聚类的大数据频繁项集挖掘研究 [J] . 张娅 . 计算机仿真 . 2020 ,第008期

2. 基于粗糙集的K均值聚类算法在案例检索中的应用 [J] . 陈千 ,向阳 ,郭鑫 . 计算机科学 . 2010 ,第012期

3. 基于改进K均值聚类算法的星点聚类研究 [J] . 夏永泉 ,孙静茹 ,WUXin-wen . 图学学报 . 2019 ,第002期

4. 基于改进K均值聚类算法的星点聚类研究 [J] . 夏永泉1 ,孙静茹1 ,WU Xin-wen2 . 图学学报 . 2019 ,第002期

5. 动态分配聚类中心的改进K均值聚类算法 [J] . 程艳云 ,周鹏 . 计算机技术与发展 . 2017 ,第002期

6. 基于模拟退火的粗糙集K均值电力负荷聚类分析 [C] . 刘建华 ,孟颖 . 福建省科协第十三届学术年会分会场——福建省电机工程学会第十三届学术年会 . 2013

7. 基于截集模糊K均值聚类的模糊支持向量机 [A] . 马丽娟 . 2009

1. 基于层次聚类的改进K均值聚类算法 [P] . 中国专利： CN104102726A . 2014-10-15

2. 一种基于在线学习的边云协同k均值聚类的模型优化方法 [P] . 中国专利： CN110968426B . 2022.02.22

3. cluster analysis apparatus using the k-means method, cluster analysis method, cluster analysis program, and recording medium storing the program [P] . 外国专利： JP4292293B2 . 2009-07-08

机译：使用k均值法的聚类分析装置，聚类分析方法，聚类分析程序以及存储该程序的记录介质

4. Occlusion/disocclusion detection using K-means clustering near object boundary with comparison of average motion of clusters to object and background motions [P] . 外国专利： USRE42790E . 2011-10-04

机译：使用K-means聚类在对象边界附近进行遮挡/遮挡检测，将聚类的平均运动与对象和背景运动进行比较

5. SEEDING METHOD FOR K-MEANS CLUSTERING AND OTHER CLUSTERING ALGORITHMS [P] . 外国专利： WO2008022341A2 . 2008-02-21

机译：K均值聚类和其他聚类算法的搜索方法

相关主题

On coresets for k-means and k-median clustering

摘要

著录项

相似文献

相关主题

期刊订阅