首页> 外文学位 >Kernel-based clustering and low rank approximation.
【24h】

Kernel-based clustering and low rank approximation.

机译:基于内核的聚类和低秩逼近。

获取原文
获取原文并翻译 | 示例

摘要

Clustering is an unsupervised data exploration scenario that is of fundamental importance to pattern recognition and machine learning. This thesis involves two types of clustering paradigms, the mixture models and graph-based clustering methods, with the primary focus on how to improve the scaling behavior of related algorithms for large-scale application. With regard to mixture models, we are interested in reducing the model complexity in terms of number of components. We propose a unified algorithm to simultaneously solve "model simplification" and "component clustering", and apply it with success in a number of learning algorithms using mixture models, such as density based clustering and SVM testing. For graph-based clustering, we propose the density weighted Nystrom method for solving large scale eigenvalue problems, which demonstrates encouraging performance in the normalized-cut and kernel principal component analysis. We further extend this to the low rank approximation of kernel matrices, which is the key component to scaling up the kernel machines. We provide an error analysis on the Nystrom low rank approximation, based on which a new sampling scheme is proposed. Our scheme is very efficient and numerically outperforms a number of state-of-the-art approaches such as incomplete Cholesky decomposition, the standard Nystrom method, and probabilistic sampling approaches.
机译:群集是一种无监督的数据探索方案,对于模式识别和机器学习至关重要。本文涉及两种类型的聚类范例:混合模型和基于图的聚类方法,主要关注于如何为大规模应用改进相关算法的缩放行为。关于混合模型,我们有兴趣降低组件数量方面的模型复杂性。我们提出了一种统一的算法来同时解决“模型简化”和“组件聚类”问题,并将其成功地应用于许多使用混合模型的学习算法中,例如基于密度的聚类和SVM测试。对于基于图的聚类,我们提出了用于解决大规模特征值问题的密度加权Nystrom方法,这在归一化割和核主成分分析中证明了令人鼓舞的性能。我们进一步将其扩展到内核矩阵的低秩逼近,这是扩大内核计算机规模的关键组成部分。我们提供了对Nystrom低秩近似的误差分析,在此基础上提出了一种新的采样方案。我们的方案非常有效,并且在数值上优于许多最新方法,例如不完全的Cholesky分解,标准Nystrom方法和概率抽样方法。

著录项

  • 作者

    Zhang, Kai.;

  • 作者单位

    Hong Kong University of Science and Technology (Hong Kong).;

  • 授予单位 Hong Kong University of Science and Technology (Hong Kong).;
  • 学科 Computer Science.
  • 学位 Ph.D.
  • 年度 2008
  • 页码 120 p.
  • 总页数 120
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号