...
首页> 外文期刊>Neural processing letters >An Image Clustering Auto-Encoder Based on Predefined Evenly-Distributed Class Centroids and MMD Distance
【24h】

An Image Clustering Auto-Encoder Based on Predefined Evenly-Distributed Class Centroids and MMD Distance

机译:基于预定义的均匀分布式质心和MMD距离的图像聚类自动编码器

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

In this paper, we propose a novel, effective and simpler end-to-end image clustering auto-encoder algorithm: ICAE. The algorithm uses predefined evenly-distributed class centroids (PEDCC) as the clustering centers, which ensures the inter-class distance of latent features is maximal, and adds data distribution constraint, data augmentation constraint, auto-encoder reconstruction constraint and Sobel smooth constraint to improve the clustering performance. Specifically, we perform one-to-one data augmentation to learn the more effective features. The data and the augmented data are simultaneously input into the autoencoder to obtain latent features and the augmented latent features whose similarity are constrained by an augmentation loss. Then, making use of the maximum mean discrepancy distance, we combine the latent features and augmented latent features to make their distribution close to the PEDCC distribution (uniform distribution between classes, Dirac distribution within the class) to further learn clustering-oriented features. At the same time, the MSE of the original input image and reconstructed image is used as reconstruction constraint, and the Sobel smooth loss to build generalization constraint to improve the generalization ability. Finally, extensive experiments on three common datasets MNIST, Fashion-MNIST, COIL20 are conducted. The experimental results show that the algorithm has achieved the best clustering results so far. In addition, we can use the predefined PEDCC class centers, and the decoder to clearly generate the samples of each class. The code can be downloaded at https://github. Com/zyWang- Power/Clustering!
机译:在本文中,我们提出了一种新颖,有效和更简单的端到端映像聚类自动编码器算法:ICAE。该算法使用预定义的均匀分布式Centroids(PEDCC)作为群集中心,这确保了潜在功能的帧间距离是最大的,并添加数据分布约束,数据增强约束,自动编码器重建约束和Sobel平滑约束提高聚类性能。具体而言,我们执行一对一的数据增强以了解更有效的功能。数据和增强数据同时输入到AutoEncoder中,以获得潜在的特征和增强的潜在特征,其相似度受到增强损耗的约束。然后,利用最大均值差异距离,我们组合了潜在的特征和增强潜在功能,使其分发接近PECCC分布(类之间的均匀分布,类内的DIRAC分布),以进一步学习面向群集的集群特征。同时,原始输入图像和重建图像的MSE用作重建约束,以及Sobel平滑丢失,以构建泛化约束来提高泛化能力。最后,对三个常见的数据集Mnist,时尚 - Mnist,线圈20进行了大量实验。实验结果表明,该算法到目前为止已达到最佳聚类结果。此外,我们可以使用预定义的PECCC类中心,以及解码器清楚地生成每个类的样本。代码可以在https:// github下载。 COM / ZYWANG - POWER / CLUSETING!

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号