首页> 外文学位 >Diffusion in computer science and statistics.
【24h】

Diffusion in computer science and statistics.

机译:计算机科学和统计学的扩散。

获取原文
获取原文并翻译 | 示例

摘要

In this thesis, we investigate diffusion as an algorithmic and analytic tool in statistics and computer science.;We address a question arising from computational linguistics, where we wish to understand the behavior of a network of agents modeled as nodes of a graph that adaptively modify their lexicon using data from their neighbors. By introducing a model of memory and a family of coalescing random walks, we prove that they eventually reach a consensus with probability 1.;We study distributed averaging on graphs and devise a distributed algorithm that is based on a diffusion process having two time scales.;Addressing the question of routing in a network, we use steady-state diffusions corresponding to electrical flow in a network of resistors for oblivious routing and prove that this scheme performs well under a variety of performance measures.;Based on a microscopic view of diffusion as an ensemble of particles executing independent Brownian motions, we develop the fastest currently known algorithm for computing the area of the boundary of a convex set. A similar technique is used to produce samplers for the boundaries of convex sets and smooth hypersurfaces that are the boundaries of open sets in Rn, assuming access to samplers for the interior. These algorithms are motivated by Goodness-of-Fit tests in statistics.;The halfplane capacity, a quantity often used to parameterize stochastic processes arising in statistical physics, known as Schramm-Loewner evolutions, is shown to be comparable to a more geometric notion.;We analyze a class of natural random walks on a Riemannian manifold, and give bounds on the mixing times in terms of the Cheeger constant and a notion of smoothness that relates the random walk to the metric underlying the manifold.;A Markov chain having a stationary distribution that is uniform on the interior of a polytope is developed. This is the first chain whose mixing time is strongly polynomial when initiated in the vicinity of the center of mass. This Markov chain can be interpreted as a random walk on a certain Riemannian manifold. The resulting algorithm for sampling polytopes outperforms known algorithms when the number of constraints is of the same order of magnitude as the dimension. We use a variant of this Markov chain to design a randomized version of Dikin's affine scaling algorithm for linear programming. We provide polynomial-time guarantees which do not exist for Dikin's algorithm.;Addressing a question from machine learning, under certain smoothness conditions, we prove that a form of weighted surface area is the limit of the weight of graph cuts in a family of random graphs arising in the context of clustering. This is done by relating both to the amount of diffusion across the surface in question.;Addressing a related issue on manifolds, we obtain an upper bound on the annealed entropy of the collection of open subsets of a manifold whose boundaries are well-conditioned. This result leads to an upper bound on the number of random samples needed before it is possible to accurately classify data lying on a manifold.
机译:在本文中,我们研究了作为统计和计算机科学中的算法和分析工具的扩散。;我们解决了由计算语言学引起的问题,我们希望了解建模为自适应修改图的节点的智能体网络的行为他们的词典使用邻居的数据。通过引入记忆模型和一系列合并的随机游走,我们证明它们最终以概率1达成共识。我们研究了图上的分布式平均,并设计了基于具有两个时间尺度的扩散过程的分布式算法。 ;解决了网络中的路由问题,我们使用与电阻器网络中的电流相对应的稳态扩散进行了无遗漏的路由,并证明了该方案在各种性能指标下的性能都很好。作为执行独立布朗运动的粒子的整体,我们开发了当前最快的算法,用于计算凸集边界的面积。假设访问内部的采样器,则使用类似的技术来生成凸集和光滑超曲面(它们是Rn中的开放集的边界)边界的采样器。这些算法是由统计学中的拟合优度检验激发的。半平面容量(通常用于参数化统计物理学中出现的随机过程的参数,被称为Schramm-Loewner演化)与更几何的概念可比。 ;我们分析了黎曼流形上的一类自然随机游动,并根据Cheeger常数和光滑度的概念给出了混合时间的界限,该概念将随机游动与流形下方的度量联系起来。形成了在多表位内部均匀的平稳分布。这是在质心附近启动时其混合时间是强多项式的第一条链。该马尔可夫链可以解释为在某个黎曼流形上的随机游动。当约束的数量与维相同数量级时,所得的用于采样多边形的算法优于已知算法。我们使用此Markov链的一种变体来设计Dikin仿射缩放算法的随机版本,以进行线性编程。我们提供了Dikin算法不存在的多项式时间保证。;针对机器学习中的一个问题,在一定的光滑度条件下,我们证明加权表面积的形式是图随机族中图割权重的极限图在聚类的背景下出现。通过将这两个问题与整个问题表面上的扩散量相关联来完成。解决流形上的一个相关问题,我们获得了边界条件良好的流形的开放子集集合的退火熵的上限。该结果导致在可以对流形上的数据进行准确分类之前所需的随机样本数量上限。

著录项

  • 作者

    Narayanan, Hariharan.;

  • 作者单位

    The University of Chicago.;

  • 授予单位 The University of Chicago.;
  • 学科 Statistics.;Computer Science.
  • 学位 Ph.D.
  • 年度 2009
  • 页码 219 p.
  • 总页数 219
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 统计学;自动化技术、计算机技术;
  • 关键词

  • 入库时间 2022-08-17 11:37:43

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号