首页> 外文学位 >An Empirical Study of Novel Approaches to Dimensionality Reduction and Applications.
【24h】

An Empirical Study of Novel Approaches to Dimensionality Reduction and Applications.

机译:对降维和应用新方法的实证研究。

获取原文
获取原文并翻译 | 示例

摘要

Dimensionality reduction is becoming increasingly important in the field of machine learning. In this thesis, we examine several traditional methods of dimensionality reduction, which include random projections, principal component analysis, singular value decomposition, kernel principal component analysis and discrete cosine transform. We also examine several existing applications of random projections (or dimensionality reduction, in general).;In their paper, Random projections in dimensionality reduction: Applications to image and text data (2001), Bingham and Manilla suggest the use of random projections for query matching in a situation where a set of documents, instead of one particular one, were searched for. This suggests another application of random projections, namely to reduce the complexity of the query process. In this thesis, we explain why this approach fails, and suggest three alternative approaches to reducing the complexity of the query process using dimensionality reduction. We also outline query-based dimensionality reduction methods that can be used for image and web data.;In each of the traditional approaches to dimensionality reduction (named above), each attribute in the reduced set is actually a linear combination of the attributes in the original data set. In this thesis, we take the position that true dimensionality reduction is obtained when the set of attributes in the reduced set is a proper subset of the attributes in the original data set, and we discuss seven novel approaches which satisfy this requirement. Using these seven approaches, as well as the RP and PCA approaches, we discuss several ways in which dimensionality reduction can be used for high dimensional clustering and classification.
机译:降维在机器学习领域变得越来越重要。本文研究了几种传统的降维方法,包括随机投影,主成分分析,奇异值分解,核主成分分析和离散余弦变换。我们还研究了几种现有的随机投影应用(通常是降维)。在他们的论文中,降维的随机投影:图像和文本数据的应用(2001年),宾厄姆和马尼拉建议使用随机投影进行查询在搜索一组文档而不是一个特定文档的情况下进行匹配。这建议了随机投影的另一种应用,即降低查询过程的复杂性。在本文中,我们解释了这种方法为什么会失败的原因,并提出了三种可选的方法来使用降维方法降低查询过程的复杂性。我们还概述了可用于图像和Web数据的基于查询的降维方法。在每种传统的降维方法(如上命名)中,约化集合中的每个属性实际上都是图元中属性的线性组合。原始数据集。在本文中,我们采取的立场是,当约简集中的属性集是原始数据集中属性的适当子集时,就可以获得真实的降维,并且我们讨论了满足此要求的七种新颖方法。使用这七种方法以及RP和PCA方法,我们讨论了降维可用于高维聚类和分类的几种方法。

著录项

  • 作者

    Nsang, Augustine Shey.;

  • 作者单位

    University of Cincinnati.;

  • 授予单位 University of Cincinnati.;
  • 学科 Information Science.;Computer Science.;Artificial Intelligence.
  • 学位 Ph.D.
  • 年度 2011
  • 页码 274 p.
  • 总页数 274
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号