首页> 外文会议>International Workshop on Combinatorial Algorithms >BWT Variants: A Combinatorial Investigation
【24h】

BWT Variants: A Combinatorial Investigation

机译:BWT变体:组合调查

获取原文

摘要

The Burrows-Wheeler transform (BWT) is a reversible transformation that was introduced in 1994 in the field of data compression and it has also become a fundamental tool for self-indexing data structures. It is a context-dependent transformation that produces a permutation of the input text that is likely to have runs of equal letters (clusters) longer than the ones in the original text. Such a combinatorial property of BWT make it a versatile tool also in several other applications, especially in bioinformatics [4, 6]. In [5], a complexity measure that counts the number of equal-letter runs produced by the BWT is introduced, by exploring how the number of clusters of the BWT output varies depending on the number of clusters of the input. Over the years other variants of the BWT have been proposed, without, however, obtaining transformations entirely capable of playing the same variegated role as the BWT. Very recently, a whole new class of transformations, called local ordering trasformation, has been introduced |3J. They have all the same prerogatives as BWT, i.e., they can be computed and inverted in linear time, they produce provably highly compressible output, and they support linear time pattern search directly on the transformed text. Such a class is a special case of a more general family of transformations based on context-adaptive alphabet orderings. This more general class includes also the alternating BWT (ABWT), another invertible transformation recently introduced in connection with a generalization of Lyndon words. Algorithmic and combinatorial issues on ABWT have been investigated in [1,2] In this talk an overview of the aforementioned transformations is presented, by focusing on some distinctive algorithmic and combinatorial features that could represent effective tools for investigating and handling the structure of the input text.
机译:Burrows-Wheeler变换(BWT)是可逆变换,于1994年在数据压缩领域引入,它也已成为用于自索引数据结构的基本工具。这是一个与上下文有关的转换,会产生输入文本的排列,该排列的等号字母(簇)的行距可能比原始文本中的字母(行)长。 BWT的这种组合特性使其在其他几个应用程序中也成为通用工具,尤其是在生物信息学中[4,6]。在[5]中,通过探索BWT输出的簇数如何根据输入的簇数变化而引入了一种复杂性度量,该度量对BWT产生的等字母游程数进行计数。多年来,已经提出了BWT的其他变体,但是没有获得完全能够起到与BWT相同的多样化作用的转换。最近,引入了一种全新的转换类,称为局部排序变换| 3J。它们具有与BWT相同的特权,即,它们可以在线性时间中进行计算和求逆,它们产生可证明的高度可压缩的输出,并且它们直接在转换后的文本上支持线性时间模式搜索。这样的类是基于上下文自适应字母顺序的更为通用的转换族的特例。这个更通用的类还包括交替BWT(ABWT),这是最近结合Lyndon单词的泛化引入的另一种可逆转换。关于ABWT的算法和组合问题已在[1,2]中进行了研究。在本次演讲中,通过着重介绍一些独特的算法和组合功能,它们可以代表研究和处理输入结构的有效工具,从而概述了上述转换。文本。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号