Model-based methods to identify multiple cluster structures in a data set

Giuliano Galimberti; Gabriele Soffritti

首页> 外文期刊>Computational statistics & data analysis >Model-based methods to identify multiple cluster structures in a data set

【24h】

Model-based methods to identify multiple cluster structures in a data set

机译：基于模型的方法来识别数据集中的多个群集结构

获取原文

获取原文并翻译 | 示例

获取外文期刊封面目录资料

开具论文收录证明 >>

文献代查 >>

文献数据库（团队版） >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

There is an interest in the problem of identifying different partitions of a given set of units obtained according to different subsets of the observed variables (multiple cluster structures). A model-based procedure has been previously developed for detecting multiple cluster structures from independent subsets of variables. The method relies on model-based clustering methods and on a comparison among mixture models using the Bayesian Information Criterion. A generalization of this method which allows the use of any model-selection criterion is considered. A new approach combining the generalized model-based procedure with variable-clustering methods is proposed. The usefulness of the new method is shown using simulated and real examples. Monte Carlo methods are employed to evaluate the performance of various approaches. Data matrices with two cluster structures are analyzed taking into account the separation of clusters, the heterogeneity within clusters and the dependence of cluster structures.

机译：人们感兴趣的问题是识别根据观察变量（多个聚类结构）的不同子集获得的给定单元集的不同分区。先前已经开发了基于模型的过程，用于从变量的独立子集中检测多个聚类结构。该方法依赖于基于模型的聚类方法以及使用贝叶斯信息准则的混合模型之间的比较。考虑了该方法的一般化，该方法允许使用任何模型选择标准。提出了一种将基于模型的广义过程与可变聚类方法相结合的新方法。通过模拟和真实示例展示了该新方法的有用性。蒙特卡罗方法用于评估各种方法的性能。考虑到群集的分离，群集内的异质性以及群集结构的依赖性，对具有两个群集结构的数据矩阵进行了分析。

著录项

来源
《Computational statistics & data analysis》 |2007年第1期|共17页
作者
Giuliano Galimberti; Gabriele Soffritti;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词
Cluster analysis; Cluster structure; Mixture model; Model-selection; Clustering variables;

机译：聚类分析;聚类结构;混合物模型;模型选择;聚类变量;

相似文献

外文文献
中文文献
专利

1. Model-based methods to identify multiple cluster structures in a data set [J] . Giuliano Galimberti, Gabriele Soffritti Computational statistics & data analysis . 2007,第1期

机译：基于模型的方法来识别数据集中的多个群集结构
2. Identifying Multiple Cluster Structures in a Data Matrix [J] . Gabriele Soffritti Communications in Statistics. B, Simulation and Computation . 2003,第4期

机译：识别数据矩阵中的多个群集结构
3. Assessing the Performance of Model-Based Clustering Methods in Multivariate Time Series with Application to Identifying Regional Wind Regimes [J] . Kazor Karen, Hering Amanda S. Journal of Agricultural, Biological, and Environmental Statistics . 2015,第2期

机译：评估多元时间序列中基于模型的聚类方法的性能及其在识别区域风况中的应用
4. Discovery Multiple Data Structures in Big Data through Global Optimization and Clustering Methods [C] . Ida Bifulco, Stefano Cirillo International Conference on Information Visualisation . 2018

机译：通过全局优化和聚类方法发现大数据中的多种数据结构
5. Multiple Imputation Methods for Large Multi-Scale Data Sets with Missing or Suppressed Values [D] . Cao, Jian. 2018

机译：具有缺失或抑制值的大型多尺度数据集的多重估算方法
6. Application of bi-clustering of gene expression data and gene set enrichment analysis methods to identify potentially disease causing nanomaterials [O] . Andrew Williams, Sabina Halappanavar 2017

机译：基因表达数据和基因组富集分析方法的双聚类分析在识别潜在致病纳米材料中的应用
7. Application of bi-clustering of gene expression data and gene set enrichment analysis methods to identify potentially disease causing nanomaterials [O] . Andrew Williams, Sabina Halappanavar 2017

机译：应用基因表达数据的双聚类和基因集富集分析方法来识别潜在致病的纳米材料

Model-based methods to identify multiple cluster structures in a data set

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅