...
首页> 外文期刊>Neurocomputing >Model-based subspace clustering of non-Gaussian data
【24h】

Model-based subspace clustering of non-Gaussian data

机译:非高斯数据的基于模型的子空间聚类

获取原文
获取原文并翻译 | 示例
           

摘要

This paper presents a new generalized Dirichlet (GD) mixture model to address the challenging problem of clustering multidimensional data sets on different feature subsets. We approximate class-conditional distributions of mixture components to define binary relevance of features at the level of clusters. We consider a relevant feature as the one providing the knowledge to assign data points in the cluster. Then, we define a new message length objective to learn the model and select both feature subsets and the number of components. The proposed method is general comparatively with existing feature selection and subspace clustering models. In addition, it selects for each cluster only relevant and statistically independent features in a linear time of the number of observations and dimensions. Experiments on synthetic data and in unsupervised image categorization show the merits of our approach.
机译:本文提出了一种新的广义Dirichlet(GD)混合模型,以解决将多维数据集聚在不同特征子集上的难题。我们近似混合成分的类条件分布,以在聚类水平上定义特征的二进制相关性。我们认为相关功能是一种提供知识以在群集中分配数据点的功能。然后,我们定义一个新的消息长度目标,以学习模型并选择特征子集和组件数量。该方法与现有特征选择和子空间聚类模型比较通用。此外,它为每个聚类在线性观察次数和维数时间内仅选择相关且统计独立的特征。在合成数据和无监督图像分类中进行的实验表明了我们方法的优点。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号