Succinct representations of lcp information and improvements in the compressed suffix arrays

机译：lcp信息的简洁表示和压缩后缀数组的改进

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We introduce two succinct data structures to solve various string problems. One is for storing the information of lcp, the longest common prefix, between suffixes in the suffix array, and the other is an improvement in the compressed suffix array which supports linear time counting queries for any pattern. The former occupies only 2n + o(n) bits for a text of length n for computing lcp between adjacent suffixes in lexicographic order in constant time, and 6n + o(n) bits between any two suffixes. No data structure in the literature attained linear size. The latter has size proportional to the text size and it is applicable to texts on any alphabet Σ such that |Σ| = log^O(1) n. These space-economical data structures are useful in processing huge amounts of text data.

机译：我们引入了两个简洁的数据结构来解决各种字符串问题。一种是在后缀数组的后缀之间存储最长的公共前缀 lcp 的信息，另一种是对压缩后缀数组的改进，它支持对任何模式进行线性计时查询。前者仅占2 n + o （ n ）位，而长度为 n 的文本用于计算 lcp 按字典顺序在相邻后缀之间保持恒定的时间，并且任意两个后缀之间有6 n + o （ n ）位。文献中没有数据结构达到线性大小。后者的大小与文本大小成正比，并且适用于任何字母Σ上的文本，使得|Σ| = log ^{O （1） n 。这些节省空间的数据结构可用于处理大量文本数据。} 展开▼

著录项

来源
《Annual ACM-SIAM symposium on Discrete algorithms;ACM-SIAM symposium on Discrete algorithms》|2002年|P.225-232|共8页

会议地点

作者
Kunihiko Sadakane; PKunihiko Sadakane;
展开▼

作者单位

展开▼

会议组织

原文格式 PDF

正文语种

中图分类算法理论;

关键词

相似文献

外文文献

中文文献

专利

1. Using Compressed Suffix-Arrays for a compact representation of temporal-graphs [J] . Brisaboa Nieves R., Caro Diego, Farina Antonio, Information Sciences: An International Journal . 2018,第期

机译：使用压缩后缀 - 阵列进行时间图的紧凑型表示

2. A quick tour on suffix arrays and compressed suffix arrays [J] . Roberto Grossi Theoretical computer science . 2011,第27期

机译：快速浏览后缀数组和压缩后缀数组

3. Space-Efficient Parallel Construction of Succinct Representations of Suffix Tree Topologies [J] . UWE BAIER, TIMO BELLER, ENNO OHLEBUSCH Journal of experimental algorithmics . 2017,第1期

机译：后缀树拓扑的简洁表示的节省空间的并行构造

4. Succinct representations of lcp information and improvements in the compressed suffix arrays [C] . Kunihiko Sadakane, PKunihiko Sadakane Annual ACM-SIAM symposium on Discrete algorithms . 2002

机译：LCP信息的简洁表示和压缩后缀数组中的改进

5. Suffix trees and suffix arrays in primary and secondary storage [D] . Ko, Pang 2007

机译：主存储和辅助存储中的后缀树和后缀数组

6. gsufsort: constructing suffix arrays LCP arrays and BWTs for string collections [O] . Felipe A. Louza, Guilherme P. Telles, Simon Gog, 2020

机译：gsufsort：构造后缀阵列LCP阵列和BWTS for String Collections

7. A new succinct representation of RMQ-information and improvements in the enhanced suffix array [O] . Johannes Fischer, Volker Heun 2007

机译：RmQ信息的新简洁表示和增强后缀数组的改进

1. 年报简洁明晰性和通俗易懂性的问题及改进路径——基于2019版《证券法》对信息披露的新要求 [J] . 洪韵华 . 商业经济 . 2022,第001期

2. 基于压缩感知原理的融合判别信息的协作表示方法 [J] . 项凤涛 ,王正志 ,袁兴生 . 国防科技大学学报 . 2013,第005期

3. 一种改进的基于LZW压缩编码的可逆信息隐藏算法 [J] . 赵文强 ,杨百龙 ,龚世忠 . 计算机应用研究 . 2017,第006期

4. 基于信息熵改进PCA的往复压缩机融合故障敏感特征提取 [J] . 陈涛 ,王立勇 ,徐小力 . 制造业自动化 . 2015,第013期

5. 基于改进Face Fixer方法的多边形网格模型拓扑信息压缩 [J] . 许敏 ,刘宁 ,吴石虎 . 测绘科学技术学报 . 2010,第006期

6. 维吾尔文Web信息检索中基于改进VSM的文档表示及相似度研究 [C] . 吐尔地·托合提 ,维尼拉·木沙江 ,艾斯卡尔·艾木都拉 . 第三届全国少数民族青年自然语言信息处理、第二届全国多语言知识库建设联合学术研讨会 . 2010

7. 基于后缀数组的滑动窗口匹配压缩改进算法研究 [A] . 王坚 . 2012

1. 一种融合近义词信息用于自动问答系统的成语压缩表示方法 [P] . 中国专利： CN111428499B . 2021.10.26

2. 一种融合近义词信息用于自动问答系统的成语压缩表示方法 [P] . 中国专利： CN111428499A . 2020-07-17

3. Systems and methods for generating compressed light field representation data using captured light fields, array geometry, and parallax information [P] . 外国专利： US10009538B2 . 2018-06-26

机译：用于使用捕获的光场，阵列几何形状和视差信息生成压缩光场表示数据的系统和方法

4. Systems and Methods for Generating Compressed Light Field Representation Data using Captured Light Fields, Array Geometry, and Parallax Information [P] . 外国专利： US2017257562A1 . 2017-09-07

机译：使用捕获的光场，阵列几何和视差信息生成压缩光场表示数据的系统和方法

5. Systems and Methods for Generating Compressed Light Field Representation Data using Captured Light Fields, Array Geometry, and Parallax Information [P] . 外国专利： US2017054901A1 . 2017-02-23

机译：使用捕获的光场，阵列几何和视差信息生成压缩光场表示数据的系统和方法

相关主题

Succinct representations of lcp information and improvements in the compressed suffix arrays

摘要

著录项

相似文献

相关主题

期刊订阅