首页> 外文期刊>Pattern Analysis and Applications >A new random approach for initialization of the multiple restart EM algorithm for Gaussian model-based clustering
【24h】

A new random approach for initialization of the multiple restart EM algorithm for Gaussian model-based clustering

机译:基于高斯模型聚类的多重重启EM算法初始化的新随机方法

获取原文
获取原文并翻译 | 示例
           

摘要

The paper proposes a new method for initialization of the multiple restart EM algorithm for Gaussian mixture model-based clustering. The method initializes randomly both the mean vector and covariance matrix of a mixture component. In particular, the mean vector is initialized by a feature vector selected deterministically from a random subset of candidate feature vectors. The selection criterion is the maximum Mahalanobis distance from the already initialized mixture component centers. The covariance matrix of a component is initialized by randomly generating its eigenvalues and eigenvectors. In computational experiments, the used approach was compared with three other random EM initialization methods. The experiments were performed on synthetic datasets generated from the Gaussian mixtures with the different overlap characteristics, as well as on four real-life datasets. The results on synthetic data indicate that, for well separated clusters, for which the maximum pairwise overlap is not excessively high, the described method yields clusterings which correspond better to the original partitions of data, as indicated by the adjusted Rand index. The experiments on real data indicate that the performance of the method is comparable to other three methods for two smaller datasets and significantly better for two larger datasets.
机译:提出了一种基于高斯混合模型的多重启EM算法初始化的新方法。该方法随机初始化混合分量的均值向量和协方差矩阵。特别地,均值向量由从候选特征向量的随机子集中确定地选择的特征向量初始化。选择标准是距已初始化的混合组分中心的最大Mahalanobis距离。通过随机生成其特征值和特征向量来初始化组件的协方差矩阵。在计算实验中,将使用的方法与其他三种随机EM初始化方法进行了比较。实验是从具有不同重叠特性的高斯混合物生成的合成数据集以及四个实际数据集上进行的。合成数据的结果表明,对于间隔良好的群集,其最大成对重叠不太高,所描述的方法产生的聚类更好地对应于数据的原始分区,如调整后的兰德指数所示。对真实数据的实验表明,对于两个较小的数据集,该方法的性能可与其他三种方法媲美,而对于两个较大的数据集,其性能则明显更好。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号