首页> 外文会议>IEEE Image, Video, and Multidimensional Signal Processing Workshop >Learning Discriminative Representations for Big Data Clustering Using Similarity-Based Dimensionality Reduction
【24h】

Learning Discriminative Representations for Big Data Clustering Using Similarity-Based Dimensionality Reduction

机译:使用基于相似性的维度减少学习大数据聚类的判别表征

获取原文

摘要

Discriminative Clustering techniques simultaneously perform clustering and learn a representation that encourages the separability of the clusters. However, methods with high discriminative power tend to decrease clustering accuracy, since the cluster assignments are usually noisy. In this paper, a similarity-based dimensionality reduction method, that allows for learning regularized clustering-oriented representations and is able to efficiently scale to large datasets, is proposed. We avoid the pitfalls of highly discriminative methods, such as the Linear Discriminant Analysis (LDA), by maintaining a small similarity between the inter-cluster samples and a small dissimilarity between the intra-cluster samples instead of collapsing the intra-cluster samples and pushing the clusters as far apart as possible. Three datasets are used to demonstrate the ability of the proposed method to learn robust representations that improve the quality of the obtained clustering solutions over other clustering techniques.
机译:判别聚类技术同时执行群集并学习鼓励群集可分离的表示。然而,具有高鉴别力的方法倾向于降低聚类准确性,因为群集分配通常是嘈杂的。在本文中,提出了一种基于相似性的维度减少方法,其允许学习正则化聚类的表示和能够有效地缩放到大型数据集。我们避免了高度辨别方法的陷阱,例如线性判别分析(LDA),通过维持簇间样本与簇内样品之间的小的相似性而不是折叠簇内样品并推动群集尽可能远。三个数据集用于展示所提出的方法学习鲁棒表示的能力,从而通过其他聚类技术提高所获得的聚类解决方案的质量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号