【24h】

Sparse Suffix Tree Construction in Small Space

机译:小空间中的后缀树稀疏构建

获取原文

摘要

We consider the problem of constructing a sparse suffix tree (or suffix array) for b suffixes of a given text T of length n, using only O(b) words of space during construction. Attempts at breaking the naive bound of Ω(nb) time for this problem can be traced back to the origins of string indexing in 1968. First results were only obtained in 1996, but only for the case where the suffixes were evenly spaced in T. In this paper there is no constraint on the locations of the suffixes. We show that the sparse suffix tree can be constructed in O(n log~2 b) time. To achieve this we develop a technique, which may be of independent interest, that allows to efficiently answer b longest common prefix queries on suffixes of T, using only O(b) space. We expect that this technique will prove useful in many other applications in which space usage is a concern. Our first solution is Monte-Carlo and outputs the correct tree with high probability. We then give a Las-Vegas algorithm which also uses O(b) space and runs in the same time bounds with high probability when b = O(n~(1/2)). Furthermore, additional tradeoffs between the space usage and the construction time for the Monte-Carlo algorithm are given.
机译:我们考虑在构造期间仅使用O(b)个空格为给定长度为n的给定文本T的b个后缀构造一个稀疏后缀树(或后缀数组)的问题。尝试打破Ω(nb)时间的天真界限的尝试可以追溯到1968年字符串索引的起源。最初的结果仅在1996年获得,但仅适用于后缀在T中均匀间隔的情况。在本文中,对后缀的位置没有任何限制。我们证明了稀疏后缀树可以在O(n log〜2 b)的时间内构造。为了实现这一点,我们开发了一种可能具有独立利益的技术,该技术允许仅使用O(b)空间有效地回答T后缀上的b个最长的公共前缀查询。我们希望该技术将在许多其他需要空间使用的应用中被证明是有用的。我们的第一个解决方案是蒙特卡洛,并以高概率输出正确的树。然后,我们给出了Las-Vegas算法,该算法也使用O(b)空间,并且当b = O(n〜(1/2))时,在相同的时间范围内运行的可能性很高。此外,还给出了蒙特卡洛算法在空间使用量和构建时间之间的其他折衷方案。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号