首页> 外文期刊>Computational statistics & data analysis >Degrees of freedom and model selection for k-means clustering
【24h】

Degrees of freedom and model selection for k-means clustering

机译:K-Means聚类的自由度和模型选择

获取原文
获取原文并翻译 | 示例
           

摘要

A thorough investigation into the model degrees of freedom in k-means clustering is conducted. An extension of Stein's lemma is used to obtain an expression for the effective degrees of freedom in the k-means model. Approximating the degrees of freedom in practice requires simplifications of this expression, however empirical studies evince the appropriateness of the proposed approach. The practical relevance of this new degrees of freedom formulation for k-means is demonstrated through model selection using the Bayesian Information Criterion. The reliability of this method is then validated through experiments on simulated data as well as on a large collection of publicly available benchmark data sets from diverse application areas. Comparisons with popular existing techniques indicate that this approach is extremely competitive for selecting high quality clustering solutions. (C) 2020 Elsevier B.V. All rights reserved.
机译:进行了对K-Means聚类的模型自由度进行彻底调查。 Stein的引理的延伸用于获得K-Means模型中有效自由度的表达。 近似实践中的自由度需要简化这种表达,然而实证研究估计了所提出的方法的适当性。 通过使用贝叶斯信息标准的模型选择来证明这种新自由度的k-mex自由度的实际相关性。 然后通过对模拟数据的实验以及来自不同应用领域的大量公开的基准数据集进行验证该方法的可靠性。 具有流行现有技术的比较表明,这种方法对于选择高质量聚类解决方案非常竞争。 (c)2020 Elsevier B.V.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号