首页> 外文学位 >Scalable machine learning for massive datasets: Fast summation algorithms.
【24h】

Scalable machine learning for massive datasets: Fast summation algorithms.

机译:适用于海量数据集的可扩展机器学习:快速求和算法。

获取原文
获取原文并翻译 | 示例

摘要

Huge data sets containing millions of training examples with a large number of attributes are relatively easy to gather. However one of the bottlenecks for successful inference is the computational complexity of machine learning algorithms. Most state-of-the-art nonparametric machine learning algorithms have a computational complexity of either O (N2) or O (N3), where N is the number of training examples. This has seriously restricted the use of massive data sets. The bottleneck computational primitive at the heart of various algorithms is the multiplication of a structured matrix with a vector, which we refer to as matrix-vector product (MVP) primitive. The goal of my thesis is to speedup up some of these MVP primitives by fast approximate algorithms that scale as O (N) and also provide high accuracy guarantees . I use ideas from computational physics, scientific computing, and computational geometry to design these algorithms. The proposed algorithms have been applied to speedup kernel density estimation, optimal bandwidth estimation, projection pursuit, Gaussian process regression, implicit surface fitting, and ranking.
机译:相对容易收集包含数百万个具有大量属性的训练示例的庞大数据集。但是,成功推理的瓶颈之一是机器学习算法的计算复杂性。大多数最新的非参数机器学习算法的计算复杂度为O(N2)或O(N3),其中N是训练示例的数量。这严重限制了海量数据集的使用。各种算法的核心计算瓶颈是结构化矩阵与向量的乘积,我们称其为矩阵向量乘积(MVP)原语。本文的目标是通过缩放为O(N)的快速近似算法来加速其中一些MVP原语,并提供高精度的保证。我使用来自计算物理学,科学计算和计算几何的思想来设计这些算法。所提出的算法已应用于加速内核密度估计,最佳带宽估计,投影追踪,高斯过程回归,隐式表面拟合和排序。

著录项

  • 作者

    Raykar, Vikas Chandrakant.;

  • 作者单位

    University of Maryland, College Park.$bComputer Science.;

  • 授予单位 University of Maryland, College Park.$bComputer Science.;
  • 学科 Artificial Intelligence.; Computer Science.
  • 学位 Ph.D.
  • 年度 2007
  • 页码 218 p.
  • 总页数 218
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 人工智能理论;自动化技术、计算机技术;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号