Asymptotic behavior of the height in a digital search tree and the longest phrase of the Lempel-Ziv scheme

Kness C.; Szpankowski W.

首页> 外文期刊>SIAM Journal on Computing >Asymptotic behavior of the height in a digital search tree and the longest phrase of the Lempel-Ziv scheme

【24h】

Asymptotic behavior of the height in a digital search tree and the longest phrase of the Lempel-Ziv scheme

机译：数字搜索树中的高度的渐近行为和Lempel-Ziv方案的最长短语

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

团队文献服务 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

We study the height of a digital search tree (DST) built from n random strings generated by an unbiased memoryless source (i.e., all symbols are equally likely). We shall argue that the height of such a tree is equivalent to the length of the longest phrase in the Lempel-Ziv parsing scheme that partitions a random sequence into n phrases. We also analyze the longest phrase in the Lempel-Ziv scheme in which a string of fixed length m is parsed into a random number of phrases. In the course of our analysis, we shall identify four natural regions of the height distribution and characterize them asymptotically for large n. n particular, for the region where most of the probability mass is concentrated, the asymptotic distribution of the height exhibits an exponential of a Gaussian distribution (with an oscillating term) around the most probable value k(1) = right perpendicularlog(2) n + root 2log(2)n - log(2)(root 2log(2)n) + 1/log 2 - 1/2left perpendicular + 1. More precisely, we shall prove that the asymptotic distribution of a DST is concentrated on either the one point k(1) or the two points k(1) - 1 and k(1), which actually proves (slightly modified) Kesten's conjecture quoted in [Probab. Theory Related Fields, 79 (1988), pp. 509-542]. Finally, we compare our findings for DST with the asymptotic distributions of the height for other digital trees such as tries and PATRICIA tries. We derive these results by a combination of analytic methods such as generating functions, Laplace transform, the saddle point method, and ideas of applied mathematics such as linearization, asymptotic matching, and the WKB method. Our analysis makes certain assumptions about the forms of some of the asymptotic expansions as well as their asymptotic matching. We also present detailed numerical veri cation of our results. [References: 32]

机译：我们研究了一种数字搜索树（DST）的高度，该数字搜索树是由无偏的无记忆源（即所有符号均等可能）生成的n个随机字符串构建而成的。我们将争辩说，这种树的高度等于将随机序列划分为n个短语的Lempel-Ziv解析方案中最长短语的长度。我们还分析了Lempel-Ziv方案中最长的短语，其中将固定长度m的字符串解析为随机数量的短语。在我们的分析过程中，我们将确定高度分布的四个自然区域，并对大n渐近地表征它们。 n特别是，对于大部分概率质量都集中的区域，高度的渐近分布在最可能值k（1）= right verticallog（2）周围表现出高斯分布（具有振荡项）的指数。 + root 2log（2）n-log（2）（root 2log（2）n）+ 1 / log 2-1/2向左垂直+1。更确切地说，我们将证明DST的渐近分布集中于任一一个点k（1）或两个点k（1）-1和k（1），实际上证明了（稍作修改）[Probab。]中引用的Kesten猜想。与理论相关的领域，第79卷（1988），第509-542页]。最后，我们将DST的发现与其他数字树（例如尝试和PATRICIA尝试）的高度的渐近分布进行比较。我们通过分析方法（例如生成函数，Laplace变换，鞍点方法）和应用数学思想（例如线性化，渐近匹配和WKB方法）的组合得出这些结果。我们的分析对某些渐近展开的形式及其渐近匹配做出了某些假设。我们还介绍了我们的结果的详细数值验证。 [参考：32]

著录项

来源
《SIAM Journal on Computing 》 |2000年第3期| 共42页
作者
Kness C.; Szpankowski W.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类应用数学 ;
关键词
Digital search trees; Lempel-ziv algorithm; Height distribution; Longest phrase distribution; Laplace transform; Saddle point method; Matched asymptotics; Linearization; Wkb method; Elliptic theta function; Algorithm; Compression;

机译：数字搜索树;Lempel-ziv算法;高度分布;最长短语分布;拉普拉斯变换;鞍点法;匹配渐近;线性化;Wkb法;椭圆theta函数;算法;压缩;

相似文献

外文文献
中文文献
专利

1. Asymptotic behavior of the height in a digital search tree and the longest phrase of the Lempel-Ziv scheme [J] . Kness C., Szpankowski W. SIAM Journal on Computing . 2000 ,第3期

机译：数字搜索树中的高度的渐近行为和Lempel-Ziv方案的最长短语
2. ASYMPTOTIC BEHAVIOR OF THE LEMPEL-ZIV PARSING SCHEME AND DIGITAL SEARCH TREES [J] . Jacquet P., Szpankowski W. Theoretical computer science . 1995 ,第1a2期

机译：Leppel-Ziv解析方案和数字搜索树的渐近行为
3. ASYMPTOTIC BEHAVIOR OF THE LEMPEL-ZIV PARSING SCHEME AND DIGITAL SEARCH TREES [J] . Jacquet P., Szpankowski W. Theoretical Computer Science . 1995 ,第1a2期

机译：Leppel-Ziv解析方案和数字搜索树的渐近行为
4. Height in a digital search tree and the longest phrase of the Lempel-Ziv scheme [C] . Charles Knessl, Wojciech Szpankowski Annual ACM-SIAM symposium on Discrete algorithms;ACM-SIAM symposium on Discrete algorithms . 2000

机译：数字搜索树中的高度和Lempel-Ziv方案的最长短语
5. Asymptotics and Interpretability of Decision Trees and Decision Tree Ensembles [D] . Zhou, Yichen. 2019

机译：决策树和决策树集合的渐近性和可解释性
6. Asymptotically optimal minimizers schemes [O] . Guillaume Marçais, Dan DeBlasio, Carl Kingsford -1

机译：渐近最优最小化器方案
7. Asymptotic Behavior of the Height in a Digital Search Tree and the Longest Phrase of the Lempel--Ziv Scheme [O] . Charles Knessl, Wojciech Szpankowski 2000

机译：数字搜索树中高度的渐近行为及LEMPEL - ZIV方案的最长短语
8. Searching the ASRS Database Using QUORUM Keyword Search, Phrase Search, Phrase Generation, and Phrase Discovery [R] . McGreevy, M. W. 2001

机译：使用QUORUm关键字搜索，短语搜索，短语生成和短语发现搜索asRs数据库

Asymptotic behavior of the height in a digital search tree and the longest phrase of the Lempel-Ziv scheme

摘要

著录项

相似文献

相关主题

期刊订阅