首页> 外文会议>Annual ACM-SIAM symposium on Discrete algorithms;ACM-SIAM symposium on Discrete algorithms >Succinct representations of lcp information and improvements in the compressed suffix arrays
【24h】

Succinct representations of lcp information and improvements in the compressed suffix arrays

机译:lcp信息的简洁表示和压缩后缀数组的改进

获取原文

摘要

We introduce two succinct data structures to solve various string problems. One is for storing the information of lcp, the longest common prefix, between suffixes in the suffix array, and the other is an improvement in the compressed suffix array which supports linear time counting queries for any pattern. The former occupies only 2n + o(n) bits for a text of length n for computing lcp between adjacent suffixes in lexicographic order in constant time, and 6n + o(n) bits between any two suffixes. No data structure in the literature attained linear size. The latter has size proportional to the text size and it is applicable to texts on any alphabet Σ such that |Σ| = logO(1) n. These space-economical data structures are useful in processing huge amounts of text data.
机译:我们引入了两个简洁的数据结构来解决各种字符串问题。一种是在后缀数组的后缀之间存储最长的公共前缀 lcp 的信息,另一种是对压缩后缀数组的改进,它支持对任何模式进行线性计时查询。前者仅占2 n + o n )位,而长度为 n 的文本用于计算 lcp 按字典顺序在相邻后缀之间保持恒定的时间,并且任意两个后缀之间有6 n + o n )位。文献中没有数据结构达到线性大小。后者的大小与文本大小成正比,并且适用于任何字母Σ上的文本,使得|Σ| = log O (1) n 。这些节省空间的数据结构可用于处理大量文本数据。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号