...
首页> 外文期刊>Advances in data analysis and classification >Clustering and classification via cluster-weighted factor analyzers
【24h】

Clustering and classification via cluster-weighted factor analyzers

机译:通过聚类加权因子分析器进行聚类和分类

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

In model-based clustering and classification, the cluster-weighted model is a convenient approach when the random vector of interest is constituted by a response variable Y and by a vector X of p covariates. However, its applicability may be limited when p is high. To overcome this problem, this paper assumes a latent factor structure for x in each mixture component, under Gaussian assumptions. This leads to the cluster-weighted factor analyzers (CWFA) model. By imposing constraints on the variance of y and the covariance matrix of x, a novel family of sixteen CWFA models is introduced for model-based clustering and classification. The alternating expectation-conditional maximization algorithm, for maximum likelihood estimation of the parameters of all models in the family, is described; to initialize the algorithm, a 5-step hierarchical procedure is proposed, which uses the nested structures of the models within the family and thus guarantees the natural ranking among the sixteen likelihoods. Artificial and real data show that these models have very good clustering and classification performance and that the algorithm is able to recover the parameters very well.
机译:在基于模型的聚类和分类中,当感兴趣的随机向量由响应变量Y和p个协变量的向量X构成时,聚类加权模型是一种便捷的方法。然而,当p高时,其适用性可能受到限制。为了克服这个问题,本文假设在高斯假设下每个混合成分中x的潜在因子结构。这导致了群集加权因子分析器(CWFA)模型。通过对y的方差和x的协方差矩阵施加约束,引入了一个新的16个CWFA模型族,用于基于模型的聚类和分类。描述了交替期望条件最大化算法,用于最大似然估计族中所有模型的参数;为了初始化该算法,提出了一个五步分级程序,该程序使用了族内模型的嵌套结构,从而保证了十六种可能性之间的自然排名。人工和真实数据表明,这些模型具有很好的聚类和分类性能,并且该算法能够很好地恢复参数。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号