首页> 外文期刊>中南大学学报(英文版) >Clustering method based on data division and partition
【24h】

Clustering method based on data division and partition

机译:基于数据划分和分区的聚类方法

获取原文
获取原文并翻译 | 示例
       

摘要

Many classical clustering algorithms do good jobs on their prerequisite but do not scale well when being applied to deal with very large data sets (VLDS). In this work, a novel division and partition clustering method (DP) was proposed to solve the problem. DP cut the source data set into data blocks, and extracted the eigenvector for each data block to form the local feature set. The local feature set was used in the second round of the characteristics polymerization process for the source data to find the global eigenvector. Ultimately according to the global eigenvector, the data set was assigned by criterion of minimum distance. The experimental results show that it is more robust than the conventional clusterings. Characteristics of not sensitive to data dimensions, distribution and number of nature clustering make it have a wide range of applications in clustering VLDS.
机译:许多古典聚类算法在他们的先决条件下做好工作,但在应用于处理非常大的数据集(VLD)时不刻度。在这项工作中,提出了一种新的划分和分区聚类方法(DP)来解决问题。 DP将源数据剪切到数据块中,并提取每个数据块的特征向量以形成本地特征集。本地特征集在第二轮特征聚合过程中用于源数据以找到全局特征向量。最终根据全局特征向量,通过最小距离的标准分配数据集。实验结果表明,它比传统集群更鲁棒。对数据维度不敏感的特征,分布和自然群集的分布使得它在聚类VLD中具有广泛的应用。

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号