首页> 外文会议>IEEE International Conference on Computational Science and Engineering Workshops >Combining Parallel Self-Organizing Maps and K-Means to Cluster Distributed Data
【24h】

Combining Parallel Self-Organizing Maps and K-Means to Cluster Distributed Data

机译:将并行自组织地图和K-means组合为群集分布式数据

获取原文

摘要

Clustering is the process of discovering groups within multidimensional data, based on similarities, with a minimal knowledge of their structure. In previous works, we presented an algorithm (partSOM) to cluster distributed datasets, based on self-organizing maps (SOM). This work extends this approach presenting a strategy for efficient cluster analysis in distributed databases using SOM and K-means. The proposed strategy applies SOM algorithm separately in each distributed dataset, relative to database vertical partitions, to obtain a representative subset of each local dataset. In the sequence, these representative subsets are sent to a central site, which performs a fusion of the partial results and applies SOM and K-means algorithms to obtain a final result. Experimental results are compared with traditional SOM and partSOM approaches for different datasets.
机译:聚类是基于相似性在多维数据中发现组的过程,其结构的最小知识。在以前的作品中,我们基于自组织地图(SOM)向群集分布式数据集提交了一种算法(零件)。这项工作扩展了这种方法,介绍了使用SOM和K均值的分布式数据库中有效的集群分析的策略。所提出的策略在每个分布式数据集中分别应用SOM算法,相对于数据库垂直分区,以获取每个本地数据集的代表性子集。在序列中,这些代表子集被发送到中央站点,该中心站点执行部分结果的融合,并应用SOM和K均值算法以获得最终结果。将实验结果与传统SOM和不同数据集的方法进行比较。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号