首页> 外文期刊>Computational statistics & data analysis >Model-based methods to identify multiple cluster structures in a data set
【24h】

Model-based methods to identify multiple cluster structures in a data set

机译:基于模型的方法来识别数据集中的多个群集结构

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

There is an interest in the problem of identifying different partitions of a given set of units obtained according to different subsets of the observed variables (multiple cluster structures). A model-based procedure has been previously developed for detecting multiple cluster structures from independent subsets of variables. The method relies on model-based clustering methods and on a comparison among mixture models using the Bayesian Information Criterion. A generalization of this method which allows the use of any model-selection criterion is considered. A new approach combining the generalized model-based procedure with variable-clustering methods is proposed. The usefulness of the new method is shown using simulated and real examples. Monte Carlo methods are employed to evaluate the performance of various approaches. Data matrices with two cluster structures are analyzed taking into account the separation of clusters, the heterogeneity within clusters and the dependence of cluster structures.
机译:人们感兴趣的问题是识别根据观察变量(多个聚类结构)的不同子集获得的给定单元集的不同分区。先前已经开发了基于模型的过程,用于从变量的独立子集中检测多个聚类结构。该方法依赖于基于模型的聚类方法以及使用贝叶斯信息准则的混合模型之间的比较。考虑了该方法的一般化,该方法允许使用任何模型选择标准。提出了一种将基于模型的广义过程与可变聚类方法相结合的新方法。通过模拟和真实示例展示了该新方法的有用性。蒙特卡罗方法用于评估各种方法的性能。考虑到群集的分离,群集内的异质性以及群集结构的依赖性,对具有两个群集结构的数据矩阵进行了分析。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号