首页> 外文学位 >Profiles of Patricia Tries.
【24h】

Profiles of Patricia Tries.

机译:帕特里夏(Patricia)的个人资料。

获取原文
获取原文并翻译 | 示例

摘要

Digital trees are data structures that represent sets of strings according to their shared prefix structure. In the most fundamental of such trees, a trie, each string in the set is represented by a sequence of edges, each representing a single letter of the string, starting at the root of the tree and ending at a leaf, the parent edge of which corresponds to the last letter of the longest prefix that the string shares with any other string in the set. A PATRICIA trie is a trie in which each non-branching path is compressed into a single edge. The external profile B n,k, defined to be the number of leaves at level k of a PATRICIA trie on n strings, is an important "summarizing'' parameter, in terms of which several other parameters of interest can be formulated. Here we derive precise asymptotics for the expected value and variance of Bn,k , as well as a central limit theorem with error bound on the characteristic function, for PATRICIA tries on n infinite binary strings generated by a memoryless source with bias p > 1/2 for k ∼ alphalog n with alpha ∈ (1/log(1/q) + epsilon, 1/log(1/ p) -- epsilon) for any fixed epsilon > 0 . In this range, E[Bn,k] = theta(Var[Bn,k ]) , and both are of the form theta(n beta(alpha)/√log n), where the theta hides bounded, periodic functions of log n whose Fourier series we explicitly determine. The compression property leads to extra terms in the Poisson functional equations for the profile which are not seen in tries or digital search trees, resulting in Mellin transforms which are only implicitly given in terms of the moments of Bm,j for various m and j . Thus, the proofs require information about the profile outside the main range of interest. We then extend our results to the boundaries of the central region, allowing analyses of the typical height and fillup level, both of which exhibit a surprising phase transition with respect to p . Our derivations rely on analytic techniques, including Mellin transforms, analytic de-Poissonization, the saddle point method, and careful bounding of complex functions.
机译:数字树是根据其共享前缀结构表示字符串集的数据结构。在此类树的最基本树中,一个trie,该集中的每个字符串都由一系列边表示,每个边代表该字符串的单个字母,从树的根部开始到叶子(叶的父边)结束它对应于字符串与集合中任何其他字符串共享的最长前缀的最后一个字母。 PATRICIA trie是一种Trie,其中每个非分支路径都压缩为单个边缘。外部轮廓B n,k定义为n个字符串上PATRICIA trie级别k上的叶子数,它是一个重要的“摘要”参数,根据该参数可以制定其他一些感兴趣的参数。推导Bn,k的期望值和方差的精确渐近性,以及对特征函数有误差限制的中心极限定理,因为PATRICIA尝试对由无记忆源产生的n个无限二进制字符串进行偏置,其中p> 1/2为对于任何固定的epsilon> 0,k〜alphalog n,其中α∈(1 / log(1 / q)+ epsilon,1 / log(1 / p)-epsilon)在此范围内,E [Bn,k] = theta (Var [Bn,k]),两者的形式均为theta(n beta(alpha)/√logn),其中theta隐藏了我们明确确定其Fourier级数的log n的有界周期函数。在配置文件的Poisson函数方程中增加了在尝试树或数字搜索树中看不到的额外项,从而导致梅林变换仅根据Bm,j的矩分别给出了各种m和j的值。因此,证明需要有关主要关注范围之外的个人资料的信息。然后,我们将结果扩展到中心区域的边界,从而可以分析典型的高度和填充水平,这两个部分都相对于p表现出令人惊讶的相变。我们的推导依赖于分析技术,包括梅林变换,分析去泊松,鞍点法和复杂函数的仔细界定。

著录项

  • 作者

    Magner, Abram N.;

  • 作者单位

    Purdue University.;

  • 授予单位 Purdue University.;
  • 学科 Computer science.;Mathematics.
  • 学位 Ph.D.
  • 年度 2015
  • 页码 108 p.
  • 总页数 108
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

  • 入库时间 2022-08-17 11:52:25

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号