首页> 外文学位 >Dimensionality reduction algorithms with applications to collaborative data and images.
【24h】

Dimensionality reduction algorithms with applications to collaborative data and images.

机译:降维算法及其在协作数据和图像中的应用。

获取原文
获取原文并翻译 | 示例

摘要

General dimensionality reduction techniques play important roles in various fields in machine learning. As a well studied problem, many existing algorithms have achieved wide success in specific fields. In this work, we view this problem from a different viewpoint.;We first focuses on collaborative data, which consist of ratings relating two distinct sets of objects: users and items. Much of the work with such data focuses on filtering: predicting unknown ratings for pairs of users and items. In this work, we propose a well-structured Bayesian network to model the collaborative data, and employ loopy belief propogation to estimate parameters of the network and perform filtering tasks. In addition, we are interested in the problem of visualizing the information in the collaborative data. Given all of the ratings, our task is to embed all of the users and items as points in the same Euclidean space. We would like to place users near items that they have rated (or would rate) high, and far away from those they would give low ratings. We pose this problem as a real-valued non-linear Bayesian network and employ Markov chain Monte Carlo and expectation maximization to find an embedding. We present a metric by which to judge the quality of a visualization.;We then extend the visualization framework to images, specifically to embed images. Embedding images into a low dimensional space has a wide range of applications: visualization, clustering, and preprocessing for supervised learning. Traditional dimension reduction algorithms assume that the examples densely populate the manifold. Image databases tend to break this assumption, having isolated islands of similar images instead. Here we extend our framework to achieve the embedding goal of preserving local image similarities based on their scale invariant feature transform (SIFT) vectors. We make no neighborhood assumptions in our embedding. Our algorithm can also embed the images in a discrete grid, useful for many visualization tasks.
机译:通用降维技术在机器学习的各个领域中发挥着重要作用。作为一个经过充分研究的问题,许多现有算法已在特定领域取得了广泛的成功。在这项工作中,我们从不同的角度看待这个问题。我们首先关注协作数据,该数据由与两个不同对象集有关的等级组成:用户和项目。这些数据的大部分工作都集中在过滤上:预测用户和物品对的未知等级。在这项工作中,我们提出了一种结构良好的贝叶斯网络来对协作数据进行建模,并采用循环信念传播来估计网络的参数并执行过滤任务。另外,我们对可视化协作数据中信息的问题感兴趣。给定所有等级,我们的任务是将所有用户和项目作为点嵌入到同一欧氏空间中。我们希望将用户放在他们(或将要评分)较高的项目附近,并远离他们会给其较低评分的项目。我们将此问题作为实值非线性贝叶斯网络提出,并采用马尔可夫链蒙特卡罗和期望最大化找到嵌入。我们提出了一种衡量可视化质量的指标。然后,我们将可视化框架扩展到图像,特别是嵌入图像。将图像嵌入低维空间具有广泛的应用:可视化,聚类和用于监督学习的预处理。传统的降维算法假设示例密集地填充了流形。图像数据库倾向于打破这种假设,取而代之的是孤立的类似图像的孤岛。在这里,我们扩展框架以实现基于局部不变特征变换(SIFT)向量保持局部图像相似性的嵌入目标。在嵌入过程中,我们不做任何邻里假设。我们的算法还可以将图像嵌入离散的网格中,这对于许多可视化任务很有用。

著录项

  • 作者

    Mei, Guobiao.;

  • 作者单位

    University of California, Riverside.;

  • 授予单位 University of California, Riverside.;
  • 学科 Artificial Intelligence.;Computer Science.
  • 学位 Ph.D.
  • 年度 2008
  • 页码 106 p.
  • 总页数 106
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 人工智能理论;自动化技术、计算机技术;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号