首页> 外文会议>International Conference on Advanced Data Mining and Applications >Slice_OP: Selecting Initial Cluster Centers Using Observation Points
【24h】

Slice_OP: Selecting Initial Cluster Centers Using Observation Points

机译:slice_op:使用观察点选择初始群集中心

获取原文

摘要

This paper proposes a new algorithm, Slice_OP, which selects the initial cluster centers on high-dimensional data. A set of observation points is allocated to transform the high-dimensional data into one-dimensional distance data. Multiple Gamma models are built on distance data, which are fitted with the expectation-maximization algorithm. The best-fitted model is selected with the second-order Akaike information criterion. We estimate the candidate initial centers from the objects in each component of the best-fitted model. A cluster tree is built based on the distance matrix of candidate initial centers and the cluster tree is divided into K branches. Objects in each branch are analyzed with k-nearest neighbor algorithm to select initial cluster centers. The experimental results show that the Slice_OP algorithm outperformed the state-of-the-art Kmeans++ algorithm and random center initialization in the k-means algorithm on synthetic and real-world datasets.
机译:本文提出了一种新的算法Slice_op,它在高维数据上选择初始群集中心。分配一组观察点以将高维数据转换为一维距离数据。多个伽马模型构建在距离数据上,距离数据配有期望最大化算法。使用二阶Akaike信息标准选择最佳型号。我们从最佳拟合模型的每个组件中的对象估算候选初始中心。基于候选初始中心的距离矩阵构建群集树,并且群集树被分成K分支。使用K-CORMATE邻算法分析每个分支中的对象以选择初始群集中心。实验结果表明,SLIZE_OP算法优于综合性和现实世界数据集的K-MEASE算法中的最先进的kmeans ++算法和随机中心初始化。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号