Practical Linear-Time O(1)-Workspace Suffix Sorting for Constant Alphabets

GE NONG

首页> 外文期刊>ACM Transactions on Information Systems >Practical Linear-Time O(1)-Workspace Suffix Sorting for Constant Alphabets

【24h】

Practical Linear-Time O(1)-Workspace Suffix Sorting for Constant Alphabets

机译：实用线性时间O（1）-工作空间后缀排序的恒定字母

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

This article presents an O(n)-time algorithm called SACA-K for sorting the suffixes of an input string T [0, n- 1] over an alphabet A[0, K-1]. The problem of sorting the suffixes of T is also known as constructing the suffix array (SA) for T. The theoretical memory usage of SACA-K is n log K+n log n+Klog n bits. Moreover, we also have a practical implementation for SACA-K that uses n bytes + (n + 256) words and is suitable for strings over any alphabet up to full ASCII, where a word is log n bits. In our experiment, SACA-K outperforms SA-IS that was previously the most time- and space-efficient linear-time SA construction algorithm (SACA). SACA-K is around 33% faster and uses a smaller deterministic workspace of K words, where the workspace is the space needed beyond the input string and the output SA. Given K = 0(1), SACA-K runs in linear time and 0(1) workspace. To the best of our knowledge, such a result is the first reported in the literature with a practical source code publicly available.

机译：本文介绍了一种称为SACA-K的O（n）时间算法，用于对字母A [0，K-1]上的输入字符串T [0，n-1]的后缀进行排序。排序T后缀的问题也称为构造T的后缀数组（SA）。SACA-K的理论内存使用量为n log K + n log n + Klog n位。此外，我们还为SACA-K提供了一个实用的实现，它使用n个字节+（n + 256）个单词，适用于任何字母的字符串，直到完整ASCII，其中一个单词为log n位。在我们的实验中，SACA-K优于SA-IS，后者以前是时间和空间效率最高的线性时间SA构造算法（SACA）。 SACA-K的速度提高了约33％，并使用了K个词的较小的确定性工作空间，该工作空间是输入字符串和输出SA之外所需的空间。给定K = 0（1），SACA-K在线性时间和0（1）工作空间中运行。据我们所知，这样的结果是文献中首次报道的，并提供了可公开获得的实用源代码。

著录项

来源
《ACM Transactions on Information Systems》 |2013年第3期|15.1-15.15|共15页
作者
GE NONG;
展开▼
作者单位

Computer Science Department, Sun Yat-sen University, Guangzhou 510275,China;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Suffix array; sorting algorithm; linear time; O(1)-workspace;

机译：后缀数组;分类算法;线性时间O（1）-工作区;

相似文献

外文文献
中文文献
专利

1. Lempel-Ziv Factorization in Linear-Time O(1)-Workspace for Constant Alphabets [J] . Weijun LIU IEICE transactions on information and systems . 2021,第12期

机译：LEMPEL-ZIV在线性时间O（1）-Workspace的恒定字母表
2. Optimal suffix sorting and LCP array construction for constant alphabets [J] . Louza Felipe A., Gog Simon, Telles Guilherme P. Information Processing Letters . 2017,第feba期

机译：常量字母的最佳后缀排序和LCP数组构造
3. Linear-time Suffix Sorting - A New Approach for Suffix Array Construction [J] . Uwe Baier LIPIcs : Leibniz International Proceedings in Informatics . 2016,第30期

机译：线性时间后缀排序 - 后缀阵列构造的新方法
4. Linear-Time Construction of Compressed Suffix Arrays Using o(n log n)-Bit Working Space for Large Alphabets [C] . Joong Chae Na Annual Symposium on Combinatorial Pattern Matching(CPM 2005); 20050619-22; Jeju Island(KR) . 2005

机译：使用o（n log n）位工作空间处理大字母的压缩后缀数组的线性时间构造
5. Optimizing MIMO equalization of QAM signals based on constant modulus algorithm and alphabet matched algorithm. [D] . Taiwo, Peter O. 2015

机译：基于恒模算法和字母匹配算法优化QAM信号的MIMO均衡。
6. Linear-time computation of minimal absent words using suffix array [O] . Carl Barton, Alice Heliou, Laurent Mouchard, 2014

机译：使用后缀数组的线性时间计算最小缺席单词
7. Linear-time Suffix Sorting - A New Approach for Suffix Array Construction [O] . Baier Uwe 2016

机译：线性时间后缀排序 - 后缀数组构造的一种新方法
8. Simplified Linear-Time Jordan Sorting and Polygon Clipping. [R] . Fung, K. Y., Nicholl, T. M., Tarjan, R. E., 1989

机译：简化线性时间约旦排序和多边形裁剪。

Practical Linear-Time O(1)-Workspace Suffix Sorting for Constant Alphabets

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅