首页> 外文期刊>Journal of Statistical Software >A Bayesian Approach for Model-Based Clustering of Several Binary Dissimilarity Matrices: The dmbc Package in R
【24h】

A Bayesian Approach for Model-Based Clustering of Several Binary Dissimilarity Matrices: The dmbc Package in R

机译:一种基于模型的多个二进制矩阵的贝叶斯方法:r中的DMBC包装

获取原文
获取外文期刊封面目录资料

摘要

We introduce the new package dmbc that implements a Bayesian algorithm for clustering a set of binary dissimilarity matrices within a model-based framework. Specifically, we consider the case when S matrices are available, each describing the dissimilarities among the same n objects, possibly expressed by S subjects (judges), or measured under different experimental conditions, or with reference to different characteristics of the objects themselves. In particular, we focus on binary dissimilarities, taking values 0 or 1 depending on whether or not two objects are deemed as dissimilar. We are interested in analyzing such data using multidimensional scaling (MDS). Differently from standard MDS algorithms, our goal is to cluster the dissimilarity matrices and, simultaneously, to extract an MDS configuration specific for each cluster. To this end, we develop a fully Bayesian three-way MDS approach, where the elements of each dissimilarity matrix are modeled as a mixture of Bernoulli random vectors. The parameter estimates and the MDS configurations are derived using a hybrid Metropolis-Gibbs Markov Chain Monte Carlo algorithm. We also propose a BIC-like criterion for jointly selecting the optimal number of clusters and latent space dimensions. We illustrate our approach referring both to synthetic data and to a publicly available data set taken from the literature. For the sake of efficiency, the core computations in the package are implemented in C/C++. The package also allows the simulation of multiple chains through the support of the parallel package.
机译:我们介绍了新的软件包DMBC,它实现了一种贝叶斯算法,用于在基于模型的框架内集聚一组二进制异化矩阵。具体地,我们考虑当S矩阵可用的情况下,每个情况下,每个矩阵描述相同的N对象之间的异化,可能由S受试者(判断),或在不同的实验条件下测量,或者参考物体本身的不同特征。特别是,我们专注于二元异化,取决于两个物体是否被认为是不同的。我们有兴趣使用多维缩放(MDS)分析这些数据。与标准MDS算法不同,我们的目标是群集不相似矩阵,同时提取针对每个群集特定的MDS配置。为此,我们开发了一种完全贝叶斯三通MDS方法,其中每个不相似矩阵的元素被建模为Bernoulli随机载体的混合物。使用混合Metropolis-Gibbs Markov链蒙特卡罗算法导出参数估计和MDS配置。我们还提出了一种类似的标准,用于共同选择最佳的簇和潜在空间尺寸。我们说明了我们对合成数据以及从文献中获取的公开数据集的方法。为效率起见,包中的核心计算在C / C ++中实现。该包装还允许通过并联包的支撑来模拟多个链条。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号