...
首页> 外文期刊>Theoretical computer science >A quick tour on suffix arrays and compressed suffix arrays
【24h】

A quick tour on suffix arrays and compressed suffix arrays

机译:快速浏览后缀数组和压缩后缀数组

获取原文
获取原文并翻译 | 示例
           

摘要

Suffix arrays are a key data structure for solving a run of problems on texts and sequences, from data compression and information retrieval to biological sequence analysis and pattern discovery. In their simplest version, they can just be seen as a permutation of the elements in {1, 2,..., n}, encoding the sorted sequence of suffixes from a given text of length n, under the lexicographic order. Yet, they are on a par with ubiquitous and sophisticated suffix trees. Over the years, many interesting combinatorial properties have been devised for this special class of permutations: for instance, they can implicitly encode extra information, and they are a well characterized subset of the n! permutations. This paper gives a short tutorial on suffix arrays and their compressed version to explore and review some of their algorithmic features, discussing the space issues related to their usage in text indexing, combinatorial pattern matching, and data compression.
机译:后缀数组是一个关键的数据结构,用于解决一系列文本和序列问题,从数据压缩和信息检索到生物序列分析和模式发现。在其最简单的版本中,它们仅可以看作是{1,2,...,n}中元素的排列,按照字典顺序对来自长度为n的给定文本的后缀排序序列进行编码。但是,它们与无处不在的复杂后缀树相当。多年以来,针对这种特殊的排列类别,已经设计了许多有趣的组合属性:例如,它们可以隐式编码额外的信息,并且它们是n!的特征明确的子集。排列。本文提供了一个关于后缀数组及其压缩版本的简短教程,以探索和回顾它们的一些算法功能,并讨论了与它们在文本索引,组合模式匹配和数据压缩中的使用有关的空间问题。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号