首页> 外文会议>International conference on algorithms and discrete applied mathematics >Sketching for Data Streams and Numerical Linear Algebra
【24h】

Sketching for Data Streams and Numerical Linear Algebra

机译:绘制数据流和数值线性代数

获取原文

摘要

In the data stream model, data arrives in high volume and speed and there is not enough storage space to hold all the input. The input data is examined one record at a time upon arrival and processed. The typical processing done is randomized sketching that allows for sublinear storage, and often uses only poly-logarithmic space and time (per record). We present some interesting sketching solutions that have appeared in the literature for several statistical problems over data streams. We will also discuss some recent advances in algorithms for numerical linear algebra obtained using linear sketching. In this technique, an input matrix is compressed into a much smaller matrix by multiplying it with a random matrix chosen from certain distributions. The original problem is now solved, approximately to within factors of 1 with high probability, by computing over the smaller matrix. We will illustrate this method using the least squares regression problem.
机译:在数据流模型中,数据以高容量和高速度到达,并且没有足够的存储空间来容纳所有输入。输入数据到达时一次检查一个记录并进行处理。完成的典型处理是随机草图,允许进行次线性存储,并且通常仅使用对数空间和时间(每条记录)。我们提出了一些有趣的素描解决方案,这些解决方案已出现在文献中,涉及数据流中的一些统计问题。我们还将讨论使用线性素描获得的数字线性代数算法的一些最新进展。在这种技术中,通过将输入矩阵与从某些分布中选择的随机矩阵相乘,可以将其压缩为更小的矩阵。现在,通过对较小的矩阵进行计算,可以将原始问题解决的几率大约在1的范围之内。我们将使用最小二乘回归问题说明此方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号