...
首页> 外文期刊>Pattern Recognition: The Journal of the Pattern Recognition Society >Clustering of multivariate binary data with dimension reduction via L-1-regularized likelihood maximization
【24h】

Clustering of multivariate binary data with dimension reduction via L-1-regularized likelihood maximization

机译:通过L-1正则化似然最大化进行降维的多元二进制数据聚类

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Clustering methods with dimension reduction have been receiving considerable wide interest in statistics lately and a lot of methods to simultaneously perform clustering and dimension reduction have been proposed. This work presents a novel procedure for simultaneously determining the optimal cluster structure for multivariate binary data and the subspace to represent that cluster structure. The method is based on a finite mixture model of multivariate Bernoulli distributions, and each component is assumed to have a low-dimensional representation of the cluster structure. This method can be considered as an extension of the traditional latent class analysis. Sparsity is introduced to the loading values, which produces the low-dimensional subspace, for enhanced interpretability and more stable extraction of the subspace. An EM-based algorithm is developed to efficiently solve the proposed optimization problem. We demonstrate the effectiveness of the proposed method by applying it to a simulation study and real datasets. (C) 2015 Elsevier Ltd. All rights reserved.
机译:最近,具有降维的聚类方法在统计学中引起了广泛的关注,并且提出了许多同时执行聚类和降维的方法。这项工作提出了一种新颖的过程,可以同时确定多元二进制数据的最佳聚类结构和代表该聚类结构的子空间。该方法基于多元伯努利分布的有限混合模型,并且假定每个组件都具有簇结构的低维表示。该方法可以被认为是传统潜在类分析的扩展。稀疏性被引入到加载值中,生成低维子空间,以增强子空间的可解释性和提取的稳定性。开发了一种基于EM的算法来有效解决所提出的优化问题。我们通过将其应用于仿真研究和实际数据集来证明所提出方法的有效性。 (C)2015 Elsevier Ltd.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号