首页> 外文期刊>Pattern Recognition: The Journal of the Pattern Recognition Society >Multi-manifold clustering: A graph-constrained deep nonparametric method
【24h】

Multi-manifold clustering: A graph-constrained deep nonparametric method

机译:多流形集群:图形约束深度非参数方法

获取原文
获取原文并翻译 | 示例
           

摘要

For multi-manifold clustering, it is still a challenging problem on how to learn the cluster number automatically from data. This paper presents a novel nonparametric Bayesian model to cluster the multi manifold data and estimate the number of submanifolds simultaneously. Our model firstly assumes that every submanifold is a probability distribution defined in the manifold space. Then, we approximate the manifold distribution with a deep neural network. To maintain the data similarity among data, we regularize the data generation process with a modified k-nearest neighbor graph. Though the posterior inference is hard, our model leads to a very efficient deterministic optimization algorithm, which incorporates the mean field variational inference with the Graph regularized Variational Auto-Encoder (Graph-VAE). By applying the Graph-VAE, our model exhibits another advantage of realistic image generation which overcomes the conventional clustering methods. Furthermore, we expand our proposed manifold algorithm with the Dirichlet Process Mixture (DPM) to model the real datasets, in which the manifold data and non-manifold data are coexisting. Experiments on synthetic data verify our theoretical analysis. Clustering results on motion segmentation, coil20 and 3D pedestrian show that our approach can significantly improve the clustering accuracy. The handwritten database experiment demonstrates the image generation capability. (C) 2019 Elsevier Ltd. All rights reserved.
机译:对于多流形群集,如何从数据中自动学习群集编号仍然是一个具有挑战性的问题。本文介绍了一种新颖的非参数贝叶斯模型,可以同时培养多流形数据并估计子植物数量。我们的模型首先假设每个子纤维是在歧管空间中定义的概率分布。然后,我们将歧管分布与深神经网络近似。为了保持数据之间的数据相似性,我们将数据生成过程规范为修改的k最近邻图。虽然后部推理很难,但我们的模型导致了一个非常有效的确定性优化算法,其结合了与图形正规化变分自动编码器(图VAE)的平均场变分推断。通过应用Graph-VAE,我们的模型呈现了克服传统聚类方法的现实图像生成的另一个优点。此外,我们将所提出的歧管算法扩展使用Dirichlet处理混合物(DPM)来模拟实际数据集,其中歧管数据和非歧管数据共存。合成数据的实验验证了我们的理论分析。运动分割,线圈20和3D行人的聚类结果表明,我们的方法可以显着提高聚类精度。手写数据库实验演示了图像生成功能。 (c)2019年elestvier有限公司保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号