首页> 外文会议>Knowledge Discovery and Data Mining, 2010. WKDD '10 >Clustering Data on Manifold with Local and Global Consistency
【24h】

Clustering Data on Manifold with Local and Global Consistency

机译:在具有本地和全局一致性的流形上对数据进行聚类

获取原文

摘要

Data clustering aims at finding the hidden patterns in a large collection of data and a large body of effective algorithms have been proposed to partition the data in the past three decades. However, most of the algorithms fail to handle data that expose a manifold structure which is common in many data-driven application, such as interpretation and recognition of video, handwritten character and image data. In this paper, we study the problem of clustering on manifold that aims to partition a set of input data into several clusters each of which contains data points from a simple low-dimensional manifold. We apply the basic assumption of local and global consistency on the manifold. A novel algorithm name CMLGC is proposed to find the proper clusters on the manifold. Our research can also be seen as an instance of manifold learning. The encouraging results on several synthetic and real-world data set are obtained which validate our proposed algorithm.
机译:数据聚类旨在发现大量数据中的隐藏模式,并且在过去的三十年中,已经提出了大量有效的算法来对数据进行分区。但是,大多数算法无法处理暴露出多种结构的数据,这种结构在许多数据驱动的应用程序中很常见,例如视频的解释和识别,手写字符和图像数据。在本文中,我们研究了在流形上进行聚类的问题,该问题旨在将一组输入数据划分为几个聚类,每个聚类包含来自简单低维流形的数据点。我们在流形上应用局部和全局一致性的基本假设。提出了一种新颖的算法名称CMLGC,以在流形上找到合适的聚类。我们的研究也可以看作是多元学习的一个实例。在几个综合的和真实的数据集上获得了令人鼓舞的结果,这些结果验证了我们提出的算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号