...
首页> 外文期刊>SIAM Journal on Computing >Asymptotic behavior of the height in a digital search tree and the longest phrase of the Lempel-Ziv scheme
【24h】

Asymptotic behavior of the height in a digital search tree and the longest phrase of the Lempel-Ziv scheme

机译:数字搜索树中的高度的渐近行为和Lempel-Ziv方案的最长短语

获取原文
获取原文并翻译 | 示例

摘要

We study the height of a digital search tree (DST) built from n random strings generated by an unbiased memoryless source (i.e., all symbols are equally likely). We shall argue that the height of such a tree is equivalent to the length of the longest phrase in the Lempel-Ziv parsing scheme that partitions a random sequence into n phrases. We also analyze the longest phrase in the Lempel-Ziv scheme in which a string of fixed length m is parsed into a random number of phrases. In the course of our analysis, we shall identify four natural regions of the height distribution and characterize them asymptotically for large n. n particular, for the region where most of the probability mass is concentrated, the asymptotic distribution of the height exhibits an exponential of a Gaussian distribution (with an oscillating term) around the most probable value k(1) = right perpendicularlog(2) n + root 2log(2)n - log(2)(root 2log(2)n) + 1/log 2 - 1/2left perpendicular + 1. More precisely, we shall prove that the asymptotic distribution of a DST is concentrated on either the one point k(1) or the two points k(1) - 1 and k(1), which actually proves (slightly modified) Kesten's conjecture quoted in [Probab. Theory Related Fields, 79 (1988), pp. 509-542]. Finally, we compare our findings for DST with the asymptotic distributions of the height for other digital trees such as tries and PATRICIA tries. We derive these results by a combination of analytic methods such as generating functions, Laplace transform, the saddle point method, and ideas of applied mathematics such as linearization, asymptotic matching, and the WKB method. Our analysis makes certain assumptions about the forms of some of the asymptotic expansions as well as their asymptotic matching. We also present detailed numerical veri cation of our results. [References: 32]
机译:我们研究了一种数字搜索树(DST)的高度,该数字搜索树是由无偏的无记忆源(即所有符号均等可能)生成的n个随机字符串构建而成的。我们将争辩说,这种树的高度等于将随机序列划分为n个短语的Lempel-Ziv解析方案中最长短语的长度。我们还分析了Lempel-Ziv方案中最长的短语,其中将固定长度m的字符串解析为随机数量的短语。在我们的分析过程中,我们将确定高度分布的四个自然区域,并对大n渐近地表征它们。 n特别是,对于大部分概率质量都集中的区域,高度的渐近分布在最可能值k(1)= right verticallog(2)周围表现出高斯分布(具有振荡项)的指数。 + root 2log(2)n-log(2)(root 2log(2)n)+ 1 / log 2-1/2向左垂直+1。更确切地说,我们将证明DST的渐近分布集中于任一一个点k(1)或两个点k(1)-1和k(1),实际上证明了(稍作修改)[Probab。]中引用的Kesten猜想。与理论相关的领域,第79卷(1988),第509-542页]。最后,我们将DST的发现与其他数字树(例如尝试和PATRICIA尝试)的高度的渐近分布进行比较。我们通过分析方法(例如生成函数,Laplace变换,鞍点方法)和应用数学思想(例如线性化,渐近匹配和WKB方法)的组合得出这些结果。我们的分析对某些渐近展开的形式及其渐近匹配做出了某些假设。我们还介绍了我们的结果的详细数值验证。 [参考:32]

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号