首页> 外文期刊>Software >Revisiting bounded context block-sorting transformations
【24h】

Revisiting bounded context block-sorting transformations

机译:重访有界上下文块排序转换

获取原文
获取原文并翻译 | 示例
       

摘要

The Burrows-Wheeler Transform (BWT) produces a permutation of a string X, denoted X~*, by sorting the n cyclic rotations of X into full lexicographical order and taking the last column of the resulting n x n matrix to be X~*. The transformation is reversible in O(n) time. In this paper, we consider an alteration to the process, called A-bwt, where rotations are only sorted to a depth k. We propose new approaches to the forward and reverse transform, and show that the methods are efficient in practice. More than a decade ago, two algorithms were independently discovered for reversing k-BWT, both of which run in O(nk) time. Two recent algorithms have lowered the bounds for the reverse transformation to O(n log A:) and O(n), respectively. We examine the practical performance for these reversal algorithms. We rind that the original O(nk) approach is most efficient in practice, and investigates new approaches, aimed at further speeding reversal, which store precomputed context boundaries in the compressed file. By explicitly encoding the context boundaries, we present an O{n) reversal technique that is both efficient and effective. Finally, our study elucidates an inherently cache-friendly - and hitherto unobserved - behavior in the reverse &-BWT, which could lead to new applications of the £-bwt transform. In contrast to previous empirical studies, we show that the partial transform can be reversed significantly faster than the full transform, without significantly affecting compression effectiveness.
机译:Burrows-Wheeler变换(BWT)通过将X的n个循环旋转按完全字典顺序排序并将所得n x n矩阵的最后一列取为X〜*,来生成表示为X〜*的字符串X的排列。该转换在O(n)时间中是可逆的。在本文中,我们考虑了一种称为A-bwt的过程更改,其中旋转仅分类到深度k。我们提出了正向和反向变换的新方法,并表明该方法在实践中是有效的。十多年前,独立发现了两种用于反转k-BWT的算法,两者均在O(nk)时间内运行。两种最新算法分别降低了逆向转换为O(n log A :)和O(n)的界限。我们检查了这些反向算法的实际性能。我们认为原始的O(nk)方法在实践中是最有效的,并且研​​究了旨在进一步加快反转速度的新方法,该方法将预先计算的上下文边界存储在压缩文件中。通过显式编码上下文边界,我们提出了一种高效的O(n)反转技术。最后,我们的研究阐明了反向&-BWT固有的对缓存友好的行为,并且迄今未曾观察到,这可能会导致£-bwt变换的新应用。与以前的经验研究相反,我们表明部分变换可以比完全变换更快地反转,而不会显着影响压缩效果。

著录项

  • 来源
    《Software》 |2012年第8期|p.1037-1054|共18页
  • 作者单位

    School of Computer Science & Information Technology, RM1T University, Melbourne VIC 3001, Australia;

    School of Computer Science & Information Technology, RM1T University, Melbourne VIC 3001, Australia;

    School of Computer Science & Information Technology, RM1T University, Melbourne VIC 3001, Australia;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    data compression; burrows-wheeler transform; block-sorting; suffix array;

    机译:数据压缩穴轮转换分块排序;后缀数组;
  • 入库时间 2022-08-17 13:03:48

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号