首页> 外文期刊>Algorithmica >Fast Compressed Self-indexes with Deterministic Linear-Time Construction
【24h】

Fast Compressed Self-indexes with Deterministic Linear-Time Construction

机译:确定性线性时间构造的快速压缩自索引

获取原文
获取原文并翻译 | 示例

摘要

We introduce a compressed suffix array representation that, on a text T of length n over an alphabet of size sigma, can be built in O(n) deterministic time, within O(nlog sigma) bits of working space, and counts the number of occurrences of any pattern P in T in time O(|P|+loglogw sigma) on a RAM machine of w=Omega(logn)-bit words. This time is almost optimal for large alphabets (log sigma=Theta(logn)), and it outperforms all the other compressed indexes that can be built in linear deterministic time, as well as some others. The only faster indexes can be built in linear time only in expectation, or require Theta(nlogn) bits. For smaller alphabets, where log sigma=o(logn), we show how, by using space proportional to a compressed representation of the text, we can build in linear time an index that counts in time O(|P|/log sigma n+log sigma En) for any constant E>0. This is almost RAM-optimal in the typical case where w=Theta(logn).
机译:我们介绍一种压缩后缀数组表示形式,该表示形式在长度为n且长度为sigma的字母表上的文本T上,可以在O(nlog sigma)个工作空间位内的O(n)确定时间内建立,并计算在w =Ω(logn)位字的RAM机器上,在时间O(| P | + loglogw sigma)中T中出现任何模式P。对于大型字母(log sigma = Theta(logn)),此时间几乎是最佳时间,它优于线性确定时间可以建立的所有其他压缩索引以及其他一些压缩索引。只有在期望的情况下,才能在线性时间中构建唯一更快的索引,或者需要Theta(nlogn)位。对于较小的字母,其中log sigma = o(logn),我们说明如何通过使用与文本的压缩表示形式成比例的空间,如何在线性时间内建立一个以时间O(| P | / log sigma n + log sigma En)的任何常数E> 0。在w = Theta(logn)的典型情况下,这几乎是RAM最佳的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号