Two Efficient Algorithms for Linear Time Suffix Array Construction

Nong Ge; Zhang Sen; Chan Wai Hong

首页> 外文期刊>Computers, IEEE Transactions on >Two Efficient Algorithms for Linear Time Suffix Array Construction

【24h】

Two Efficient Algorithms for Linear Time Suffix Array Construction

机译：线性时间后缀数组构造的两种有效算法

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

We present, in this paper, two efficient algorithms for linear time suffix array construction. These two algorithms achieve their linear time complexities, using the techniques of divide-and-conquer, and recursion. What distinguish the proposed algorithms from other linear time suffix array construction algorithms (SACAs) are the variable-length leftmost S-type (LMS) substrings and the fixed-length d-critical substrings sampled for problem reduction, and the simple algorithms for sorting these sampled substrings: the induced sorting algorithm for the variable-length LMS substrings and the radix sorting algorithm for the fixed-length d-critical substrings. The very simple sorting mechanisms render our algorithms an elegant design framework, and, in turn, the surprisingly succinct implementations. The fully functional sample implementations of our proposed algorithms require only around 100 lines of C code for each, which is only 1/10 of the implementation of the KA [CHECK END OF SENTENCE] algorithm and comparable to that of the KS [CHECK END OF SENTENCE] algorithm. The experimental results demonstrate that these two newly proposed algorithms yield the best time and space efficiencies among all the existing linear time SACAs.

机译：我们在本文中提出了两种有效的线性时间后缀数组构造算法。这两种算法使用分治法和递归技术实现了线性时间复杂度。所提出的算法与其他线性时间后缀数组构造算法（SACA）的区别在于可变长度最左边的S型（LMS）子字符串和固定长度d关键子字符串采样以减少问题，并对这些进行排序的简单算法采样子串：用于可变长度LMS子串的归纳排序算法和用于固定长度d-关键子串的基数排序算法。非常简单的排序机制为我们的算法提供了一个优雅的设计框架，进而实现了令人惊讶的简洁实现。我们提出的算法的功能齐全的示例实现每个仅需要大约100行C代码，这仅是KA [CHECK END OF SENTENCE]算法的实现的1/10，与KS [CHECK END OF] SENTENCE]算法。实验结果表明，这两种新提出的算法在所有现有的线性时间SACA中产生了最佳的时间和空间效率。

著录项

来源
《Computers, IEEE Transactions on》 |2011年第10期|p.1471-1484|共14页
作者
Nong Ge; Zhang Sen; Chan Wai Hong;
展开▼
作者单位

Sun Yat-sen University, Guangzhou;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Suffix array; divide-and-conquer.; linear time;

机译：后缀数组;分而治之;线性时间;

相似文献

外文文献
中文文献
专利

1. Space efficient linear time construction of suffix arrays [J] . Pang Ko, Srinivas Aluru Journal of Discrete Algorithms . 2005,第2a4期

机译：后缀数组的空间有效线性时间构造
2. Linear-time Suffix Sorting - A New Approach for Suffix Array Construction [J] . Uwe Baier LIPIcs : Leibniz International Proceedings in Informatics . 2016,第30期

机译：线性时间后缀排序 - 后缀阵列构造的新方法
3. Linearized Suffix Tree: an Efficient Index Data Structure with the Capabilities of Suffix Trees and Suffix Arrays [J] . Dong Kyue Kim, Minhwan Kim, Heejin Park Algorithmica . 2008,第3期

机译：线性化后缀树：具有后缀树和后缀数组功能的高效索引数据结构
4. Space Efficient Linear Time Construction of Suffix Arrays [C] . Pang Ko, Srinivas Aluru Combinatorial Pattern Matching . 2003

机译：后缀数组的空间有效线性时间构造
5. Parallel external memory suffix array construction. [D] . Walia, Nancy. 2009

机译：并行外部存储器后缀数组构造。
6. A bioinformatician’s guide to the forefront of suffix array construction algorithms [O] . Anish Man Singh Shrestha, *, Martin C. Frith, -1

机译：生物信息学家指南介绍后缀数组构建算法的最前沿
7. Space efficient linear time construction of suffix arrays [O] . Ko Pang, Aluru Srinivas 2005

机译：后缀数组的空间有效线性时间构造
8. Fast and Efficient Algorithms for Linear Programming and for the Linear Least Squares Problem. [R] . Pan, V., Reif, J. H. 1985

机译：线性规划和线性最小二乘问题的快速有效算法。

Two Efficient Algorithms for Linear Time Suffix Array Construction

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅