首页> 外文会议>Algorithms - ESA 2011 >Pattern Matching in Lempel-Ziv Compressed Strings: Fast, Simple, and Deterministic
【24h】

Pattern Matching in Lempel-Ziv Compressed Strings: Fast, Simple, and Deterministic

机译:Lempel-Ziv压缩字符串中的模式匹配:快速,简单和确定性

获取原文
获取原文并翻译 | 示例

摘要

Countless variants of the Lempel-Ziv compression are widely used in many real-life applications. This paper is concerned with a natural modification of the classical pattern matching problem inspired by the popularity of such compression methods: given an uncompressed pattern p[1.. m] and a Lempel-Ziv representation of a string t[1.. N], does p occur in t? Farach and Thorup [5] gave a randomized O(n log~2 N + m) time solution for this problem, where n is the size of the compressed representation of t. Building on the methods of [3] and [6], we improve their result by developing a faster and fully deterministic O(n log N + m) time algorithm with the same space complexity. Note that for highly compressible texts, log N might be of order n, so for such inputs the improvement is very significant. A small fragment of our method can be used to give an asymptotically optimal solution for the substring hashing problem considered by Farach and Muthukrishnan [4].
机译:Lempel-Ziv压缩的无数变体被广泛用于许多实际应用中。本文涉及对经典模式匹配问题的自然修改,该问题受此类压缩方法的普及启发:给定未压缩模式p [1..m]和字符串t [1..N]的Lempel-Ziv表示形式,p是否在t中出现? Farach和Thorup [5]为这个问题给出了一个随机的O(n log〜2 N / n + m)时间解,其中n是t压缩表示的大小。基于[3]和[6]的方法,我们通过开发具有相同空间复杂度的更快且完全确定的O(n log N / n + m)时间算法来提高其结果。注意,对于高度可压缩的文本,log N / n可能为n阶,因此对于此类输入,改进非常重要。我们的方法的一小部分可以用来为Farach和Muthukrishnan [4]考虑的子串散列问题提供渐近最优解。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号