首页> 外文会议>International conference on advanced data mining and applications >A Fast Spectral Clustering Method Based on Growing Vector Quantization for Large Data Sets
【24h】

A Fast Spectral Clustering Method Based on Growing Vector Quantization for Large Data Sets

机译:基于大数据集的越来越多矢量量化的快速谱聚类方法

获取原文
获取外文期刊封面目录资料

摘要

Spectral clustering is a flexible clustering algorithm that can produce high-quality clusters on small scale data sets, but it is limited applicable to large scale data sets because it needs O(n~3) computational operations to process a data set of n data points. Based on the minimization of the increment of distortion, we tackle this problem by developing a novel efficient growing vector quantization method to preprocess a large scale data set, which can compress the original data set into a small set of representative data points in one scan of the original data set. Then we apply spectral clustering algorithm to the small set. Experiments on real data sets show that our method provides fast and accurate clustering results.
机译:光谱簇是一种灵活的聚类算法,可以在小刻度数据集上产生高质量的群集,但它有限适用于大规模数据集,因为它需要O(n〜3)计算操作来处理N个数据点的数据集。基于最小化失真的增量,我们通过开发一种新的有效生长的向量量化方法来预处理大规模数据集来解决这个问题,这可以将原始数据集压缩到一扫描中的一小组代表数据点中原始数据集。然后我们将谱聚类算法应用于小组。真实数据集的实验表明,我们的方法提供了快速准确的聚类结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号