...
首页> 外文期刊>Statistics and computing >Model-based clustering of multivariate ordinal data relying on a stochastic binary search algorithm
【24h】

Model-based clustering of multivariate ordinal data relying on a stochastic binary search algorithm

机译:基于模型的随机序搜索算法对多元序数数据的聚类

获取原文
获取原文并翻译 | 示例
           

摘要

We design a probability distribution for ordinal data by modeling the process generating data, which is assumed to rely only on order comparisons between categories. Contrariwise, most competitors often either forget the order information or add a non-existent distance information. The data generating process is assumed, from optimality arguments, to be a stochastic binary search algorithm in a sorted table. The resulting distribution is natively governed by two meaningful parameters (position and precision) and has very appealing properties: decrease around the mode, shape tuning from uniformity to a Dirac, identifiability. Moreover, it is easily estimated by an EM algorithm since the path in the stochastic binary search algorithm can be considered as missing values. Using then the classical latent class assumption, the previous univariate ordinal model is straightforwardly extended to model-based clustering for multivariate ordinal data. Parameters of this mixture model are estimated by an AECM algorithm. Both simulated and real data sets illustrate the great potential of this model by its ability to parsimoniously identify particularly relevant clusters which were unsuspected by some traditional competitors.
机译:我们通过对生成数据的过程进行建模来设计序数数据的概率分布,假定该过程仅依赖于类别之间的顺序比较。相反,大多数竞争者通常会忘记订单信息或添加不存在的距离信息。根据最佳参数,假定数据生成过程是排序表中的随机二进制搜索算法。所得的分布本机由两个有意义的参数(位置和精度)控制,并具有非常吸引人的属性:围绕模式的减小,从均匀性到Dirac的形状调整,可识别性。此外,由于随机二分查找算法中的路径可以视为缺失值,因此可以通过EM算法轻松估算。使用经典的潜在类假设,先前的单变量序数模型可以直接扩展到多变量序数数据的基于模型的聚类。该混合模型的参数通过AECM算法估算。模拟数据集和真实数据集都通过其简约地识别出某些传统竞争对手未曾怀疑的特别相关的集群的能力,说明了该模型的巨大潜力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号